Scrape any web page into structured JSON data with ScrapeNinja and AI

Disclaimer: This template only works on self-hosted for now, as it uses a community node.

Use Case Web scrapers often break due to web page layout changes. This workflow attempts to mitigate this problem by auto-generating web scraping data extractor code via LLM.

How It Works This workflow leverages ScrapeNinja n8n community node to: scrape webpage HTML, feed it into LLM (Google Gemini) and ask to write a JS extractor function code, then it executes the written JS extractor against scraped HTML to extract useful data from webpage (the code is safely executed in a sandbox)

Installation To install ScrapeNinja n8n node, in your self-hosted instance, go to Settings -> Community nodes, enter "n8n-nodes-scrapeninja", and install.

Make sure you are using at least v0.3.0.

See this in action: https://www.linkedin.com/feed/update/urn:li:activity:7289659870935490560/

0
Downloads
17135
Views
8.64
Quality Score
beginner
Complexity
Author:Anthony(View Original →)
Created:8/14/2025
Updated:8/25/2025

🔒 Please log in to import templates to n8n and favorite templates

Workflow Visualization

Loading...

Preparing workflow renderer

Comments (0)

Login to post comments