Crawl Websites & Answer Questions with GPT-5 nano and Google Sheets
Web Consultation & Crawling Chatbot with Google Sheets Memory
Who is this workflow for? This workflow is designed for SEO analysts, content creators, marketing agencies, and developers who need to index a website and then interact with its content as if it were a chatbot. ⚠ Note: if the site contains many pages, AI token consumption can generate high costs, especially during the initial crawling and analysis phase.
- Initial Mode (first use with a URL)
When the user enters a URL for the first time:
URL validation using AI (gpt-5-nano). Automatic sitemap discovery via robots.txt. Relevant sitemap selection (pages, posts, categories, or tags) using GPT-4o according to configured options. (Includes “OPTIONS” node to precisely choose which types of URLs to process) Crawling of all selected pages:
Downloads HTML of each page. Converts HTML to Markdown. AI analysis to extract:
Detected language.
Heading hierarchy (H1, H2, etc.).
Internal and external links.
Content summary.
Structured storage in Google Sheets:
Lang H1 and hierarchy External URLs Internal URLs Summary Content Data schema (flag to enable agent mode)
When finished, the sheet is marked with Data schema = true, signaling that the site is indexed.
- Agent Mode (subsequent queries)
If the URL has already been indexed (Data schema = true):
The chat becomes a LangChain Agent that:
Reads the database in Google Sheets. Can perform real-time HTTP requests if it needs updated information. Responds as if it were the website, using stored and live data.
This allows the user to ask questions such as:
"What’s on the contact page?" "How many external links are there on the homepage?" "Give me all the H1 headings from the services pages" "What CTA would you suggest for my page?" "How would you expand X content?"
Use cases
Build a chatbot that answers questions about a website’s content. Index and analyze full websites for future queries. SEO tool to list headings, links, and content summaries. Assistant for quick exploration of a site’s structure. Generate improvement recommendations and content strategies from site data.
Related Templates
AI Agent: Scrape, Summarize & Save Articles to Notion (Gemini, Browserless)
This n8n workflow automates the process of saving web articles or links shared in a chat conversation directly into a No...
USDT And TRC20 Wallet Tracker API Workflow for n8n
Overview This n8n workflow is specifically designed to monitor USDT TRC20 transactions within a specified wallet. It u...
Send structured logs to BetterStack from any workflow using HTTP Request
Send structured logs to BetterStack from any workflow using HTTP Request Who is this for? This workflow is perfect for...
🔒 Please log in to import templates to n8n and favorite templates
Workflow Visualization
Loading...
Preparing workflow renderer
Comments (0)
Login to post comments