Website Content Chatbot with Pinecone, Airtable & OpenAI for RAG Applications
This n8n workflow shows how to extract website content, index it in Pinecone, and leverage Airtable to power a chat agent for customer Q&A.
Use cases include:
Building a knowledge base from your website.
Creating a chatbot that answers customer queries using your own site content.
Powering RAG workflows for FAQs, support docs, or product knowledge.
How it works
Workflow starts with a manual trigger or chat message.
Website content is fetched via HTTP Request.
The HTML body is extracted and converted into clean Markdown.
Text is split into chunks (~500 chars with 50 overlap) using the Character Text Splitter.
OpenAI embeddings** are generated for each chunk.
Content and embeddings are stored in Pinecone with namespace separation.
A Chat Agent (powered by OpenAI or OpenRouter) retrieves answers from Pinecone and Airtable.
Memory buffer** allows multi-turn conversations.
A billing tool (Airtable) provides dynamic billing-related answers when needed.
How to use
Replace the sample website URL in the HTTP Request node with your own domain or content source.
Update Normalize code based on markdown content output to remove noise.
Adjust chunk size in the Text Splitter for your website markdown output.
In this example, the Character Text Splitter with separator ###### worked really well.
Always check the Markdown output to fine-tune your splitting logic.
Update Pinecone namespace to match your project.
Customize the Chat Agent system prompt to fit your brand voice and response rules.
Connect to your own Airtable schema if you want live billing/payment data access.
Requirements
OpenAI account** (for embeddings + chat model).
Pinecone account** (vector DB for semantic search).
Airtable account** (if using the billing tool).
(Optional) OpenRouter account (alternative chat model provider).
n8n self-hosted or cloud.
Need Help?
Ask in the n8n Forum!
Happy Automating! 🚀
Related Templates
Send structured logs to BetterStack from any workflow using HTTP Request
Send structured logs to BetterStack from any workflow using HTTP Request Who is this for? This workflow is perfect for...
Provide latest euro exchange rates from European Central Bank via Webhook
What is this workflow doing? This simple workflow is pulling the latest Euro foreign exchange reference rates from the E...
Automate Daily Keyword Research with Google Sheets, Suggest API & Custom Search
Who's it for This workflow is perfect for SEO specialists, marketers, bloggers, and content creators who want to automa...
🔒 Please log in to import templates to n8n and favorite templates
Workflow Visualization
Loading...
Preparing workflow renderer
Comments (0)
Login to post comments