Scrape Trustpilot Reviews with DeepSeek, Analyze Sentiment with OpenAI
Workflow Overview
This workflow automates the process of scraping Trustpilot reviews, extracting key details, analyzing sentiment, and saving the results to Google Sheets. It uses OpenAI for sentiment analysis and HTML parsing for review extraction.
How It Works
-
Scrape Trustpilot Reviews
HTTP Request**:
Fetches review pages from Trustpilot (https://it.trustpilot.com/review/{{company_id}}).
Paginates through pages (up to max_page limit).
HTML Parsing**:
Extracts review URLs using CSS selectors Splits the URLs into individual review links. -
Extract Review Details
Information Extractor**:
Uses DeepSeek to extract structured data from the review:
Author: Name of the reviewer.
Rating: Numeric rating (1-5).
Date: Review date in YYYY-MM-DD format.
Title: Review title.
Text: Full review text.
Total Reviews: Number of reviews by the user.
Country: Reviewer’s country (2-letter code). -
Sentiment Analysis
Sentiment Analysis Node**:
Uses OpenAI to classify the review text as Positive, Neutral, or Negative.
Example output:
{
"category": "Positive",
"confidence": 0.95
} -
Save to Google Sheets
Google Sheets Node**:
Appends or updates the extracted data to a Google Sheet
Set Up Steps
-
Configure Trustpilot Scraping
Edit Fields1 Node**:
Set company_id to the Trustpilot company name Set max_page to limit the number of pages scraped. -
Configure Google Sheets
Google Sheets Node**:
Update the documentId with your Google Sheet ID Ensure the sheet has the required columns (Id, Data, Nome, etc.). -
Configure OpenAI
OpenAI Chat Model Node**:
Add your OpenAI API key.
Sentiment Analysis Node**:
Ensure the categories match your desired sentiment labels (Positive, Neutral, Negative).
Key Components
Nodes**:
HTTP Request/HTML: Scrape and parse Trustpilot reviews.
Information Extractor: Extract structured review data using DeepSeek.
Sentiment Analysis: Classify review sentiment.
Google Sheets: Save and update review data.
Credentials**:
OpenAI API key.
DeepSeek API key.
Google Sheets OAuth2.
Related Templates
Restore your workflows from GitHub
This workflow restores all n8n instance workflows from GitHub backups using the n8n API node. It complements the Backup ...
Build a Restaurant Voice Assistant with VAPI and PostgreSQL for Bookings & Orders
This n8n template demonstrates how to create a comprehensive voice-powered restaurant assistant that handles table reser...
Extract Named Entities from Web Pages with Google Natural Language API
Who is this for? Content strategists analyzing web page semantic content SEO professionals conducting entity-based anal...
🔒 Please log in to import templates to n8n and favorite templates
Workflow Visualization
Loading...
Preparing workflow renderer
Comments (0)
Login to post comments