Extract Embedded Images from Google Drive Documents with VLM Run Agent

Name: Extract Embedded Images from Google Drive Documents with VLM Run Agent
Availability: InStock
Rating: 0.4 (1 reviews)
Author: Shahrear

🧾 Image Extraction Pipeline (Google Drive + VLM Run + n8n)

⚙️ What This Workflow Does

This workflow automates the process of extracting images from uploaded documents in Google Drive using the VLM Run Execute Agent, then downloads and saves those extracted images into a designated Drive folder.

🧩 Requirements Google Drive OAuth2 credentials**
VLM Run API credentials** with Execute Agent access
A reachable n8n Webhook URL (e.g., /image-extract-via-agent)

⚡Quick Setup

Configure Google Drive OAuth2 and create upload folder and folder for saving extracted images. Install the verified VLM Run node by searching for VLM Run in the node list, then click Install. Once installed, you can start using it in your workflows. Add VLM Run API credentials for document parsing.

⚙️ How It Works

Monitor Uploads – The workflow watches a specific Google Drive folder for new file uploads (e.g., receipts, reports, or PDFs).
Download File – When a file is created, it’s automatically downloaded in binary form.
Extract Images (VLM Run) – The file is sent to the VLM Run Execute Agent, which analyzes the document and extracts image URLs via its callback.
Receive Image Links (Webhook) – The workflow’s Webhook node listens for the agent’s response containing extracted image URLs.
Split & Download – The Split Out node processes each extracted link, and the HTTP Request node downloads each image.
Save Image – Finally, each image is uploaded to your chosen Google Drive folder for storage or further processing.

💡Why Use This Workflow

Manual image extraction from PDFs and scanned files is repetitive and error-prone. This pipeline automates it using VLM Run, a vision-language AI service that:

Understands document layout and structure Handles multi-page and mixed-content files Extracts accurate image data with minimal setup. For example- the output contains URLs to extracted images

{ "image_urls": [ "https://vlm.run/api/files/img1.jpg", "https://vlm.run/api/files/img2.jpg" ] }

Works with both images and PDFs

🧠 Perfect For Extracting photos or receipts from multi-page PDFs
Archiving embedded images from reports or invoices
Preparing image datasets for labeling or ML model training

🛠️ How to Customize

You can extend this workflow by: Adding naming conventions or folder structures based on upload type Integrating Slack/Email notifications when extraction completes Including metadata logging (file name, timestamp, source) into Google Sheets or a database Chaining with classification or OCR workflows using VLM Run’s other agents

⚠️ Community Node Disclaimer

This workflow uses community nodes (VLM Run) that may need additional permissions and custom setup.

0

Downloads

0

Views

7.41

Quality Score

beginner

Complexity

Category:AI & Machine Learning

Author:Shahrear(View Original →)

Created:10/17/2025

Updated:12/18/2025

Related Templates

AI SEO Readability Audit: Check Website Friendliness for LLMs

Who is this for? This workflow is designed for SEO specialists, content creators, marketers, and website developers who ...

AI & Machine Learning3 downloads

Use OpenRouter in n8n versions <1.78

What it is: In version 1.78, n8n introduced a dedicated node to use the OpenRouter service, which lets you to use a lot...

AI & Machine Learning3 downloads

Task Deadline Reminders with Google Sheets, ChatGPT, and Gmail

Intro This template is for project managers, team leads, or anyone who wants to automatically remind teammates of tasks ...

Extract Embedded Images from Google Drive Documents with VLM Run Agent

Tags

Related Templates

AI SEO Readability Audit: Check Website Friendliness for LLMs

Use OpenRouter in n8n versions <1.78

Task Deadline Reminders with Google Sheets, ChatGPT, and Gmail

Workflow Visualization

Loading...

Comments (0)