Build a RAG System for PDF Documents with Google Drive, Unstructured, and OpenAI

This template monitors a Google Drive folder, converts PDF documents into clean text chunks with Unstructured, generates OpenAI embeddings, and upserts vectors into Pinecone. It’s a practical, production-ready starting point for Retrieval-Augmented Generation (RAG) that you can plug into a chatbot, semantic search, or internal knowledge tools.

How it works

  1. Google Drive Trigger detects new files in a selected folder and downloads them.
  2. The files are sent to Unstructured where they are split into smaller pieces (chunks).
  3. The chunks are prepared to be sent to OpenAI where they are converted into vectors (embeddings).
  4. The embeddings are recombined with their original data and the payload is prepared for upsert into the Pinecone index.

Set up steps

  1. In Pinecone, create an index with 1536 dimensions and configure it for text-embedding-3-small.
  2. Copy the host url and paste it on the 'Pinecone Upsert' node. It should look something like this: https://{your-index-name}.pinecone.io/vectors/upsert.
  3. Add Google Drive, OpenAI and Pinecone credentials in n8n.
  4. Point the trigger to your ingest folder (you can use this article for demo).
  5. Click the 'Open chat' button and enter the following: Which Git provider do the authors use?
0
Downloads
0
Views
8.74
Quality Score
intermediate
Complexity
Author:Tomas Lubertino(View Original →)
Created:11/15/2025
Updated:11/20/2025

🔒 Please log in to import templates to n8n and favorite templates

Workflow Visualization

Loading...

Preparing workflow renderer

Comments (0)

Login to post comments