Website Content Scraper & SEO Keyword Extractor with GPT-4o-mini and Airtable
This workflow allows you to scrape website content, clean the HTML, extract structured information using GPT-4o-mini, and store the results along with SEO keywords into Airtable. Ideal for building keyword lists and organizing web content for SEO research.
Setup Instructions
- Prerequisites
n8n Community or Cloud instance Airtable account with a base and table ready OpenAI API Key with access to GPT-4o-mini
- Airtable Structure
Ensure your Airtable table has the following fields:
| Field Name | Type | Notes | | ------------ | ------- | ------------------------------- | | Website Name | String | Name or URL of the website | | Data | String | Cleaned website text | | Keyword | String | Extracted SEO keyword list | | Status | Options | Values: Todo, In progress, Done |
- Node Setup
✅ Form Trigger: Collects website URL from the user.
✅ HTTP Request: Fetches the website content.
✅ HTML Cleaner (Code Node): Strips out styles, tags, and whitespace to get clean text.
✅ Topic Extractor (AI Agent + GPT-4o-mini): Extracts topic-wise information from the cleaned website content.
✅ Text Cleaner (Code Node): Removes unwanted symbols like ### and **.
✅ Keyword Extractor (AI Agent + GPT-4o-mini): Generates a list of 90 important SEO keywords.
✅ Airtable Upsert: Stores the cleaned data, keywords, and status in Airtable.
- Key Features
✅ Automatic website content scraping ✅ Clean HTML and extract plain text ✅ Use GPT-4o-mini for topic-wise information extraction ✅ Generate 90-keyword SEO lists ✅ Store and manage data in Airtable
- Use Cases
SEO Keyword Research Competitor Website Content Analysis Structured Website Data Collection
Additional Workflow Recommendations
✅ Rename Nodes for Clarity
| Current Name | Suggested Name | | ------------ | ------------------------------- | | Website Name | Website URL Input Form | | HTTP Request | Fetch Website Content | | Code | HTML to Plain Text Cleaner | | Split Out1 | Clean Text Splitter | | AI Agent1 | Topic Extractor (GPT-4o-mini) | | Code1 | Text Cleanup Formatter | | Split Out2 | Final Text Splitter | | AI Agent | Keyword Extractor (GPT-4o-mini) | | Airtable | Airtable Data Upsert | | Wait1 | Delay Before Merge | | Merge | Combine Data for Airtable |
Related Templates
Automate Daily Keyword Research with Google Sheets, Suggest API & Custom Search
Who's it for This workflow is perfect for SEO specialists, marketers, bloggers, and content creators who want to automa...
USDT And TRC20 Wallet Tracker API Workflow for n8n
Overview This n8n workflow is specifically designed to monitor USDT TRC20 transactions within a specified wallet. It u...
Bulk Automated Google Drive Files Sharing and Direct Download Link Generation
This N8N workflow automates the process of sharing files from Google Drive. It includes OAuth2 authentication, batch pro...
🔒 Please log in to import templates to n8n and favorite templates
Workflow Visualization
Loading...
Preparing workflow renderer
Comments (0)
Login to post comments