Process OCR Documents from Google Drive into Searchable Knowledge Base with OpenAI & Pinecone

Name: Process OCR Documents from Google Drive into Searchable Knowledge Base with OpenAI & Pinecone
Availability: InStock
Rating: 0.4 (1 reviews)
Author: osama goda

How it works This workflow automates a full RAG ingestion pipeline. When a new OCR JSON file is added to a Google Drive folder, the workflow extracts lesson metadata, parses and cleans the Arabic text, generates semantic chunks, creates AI embeddings, and stores them in a Pinecone vector index. After processing, the file is automatically moved to an archive folder to prevent duplicates.

Set up steps Follow the sticky notes inside the workflow for detailed instructions.

Connect your Google Drive credentials. Replace the input folder ID and archive folder ID with your own. Connect your OpenAI account for embeddings. Connect your Pinecone API key and select your index.

The workflow is ready to run once credentials and folder paths are configured.

Downloads

Views

8.29

Quality Score

intermediate

Complexity

Category:AI & Machine Learning

Author:osama goda(View Original →)

Created:12/12/2025

Updated:2/17/2026

Related Templates

AI SEO Readability Audit: Check Website Friendliness for LLMs

Who is this for? This workflow is designed for SEO specialists, content creators, marketers, and website developers who ...

AI & Machine Learning5 downloads

Use OpenRouter in n8n versions <1.78

What it is: In version 1.78, n8n introduced a dedicated node to use the OpenRouter service, which lets you to use a lot...

AI & Machine Learning4 downloads

Reply to Outlook Emails with OpenAI

Who is this template for? This template is for any Microsoft Outlook user who wants a trained AI agent to reason and rep...

AI & Machine Learning4 downloads

🔒 Please log in to import templates to n8n and favorite templates

Workflow Visualization

Loading...

Preparing workflow renderer