Extract Links and URLs from PDF documents using PDF.co

Name: Extract Links and URLs from PDF documents using PDF.co
Availability: InStock
Rating: 0.4 (1 reviews)
Author: Mauricio Perera

📝 Description

This workflow allows you to extract all links (URLs) contained in a PDF file by converting it to HTML via PDF.co and then extracting the URLs present in the resulting HTML.

Unlike the traditional Read PDF node, which only returns visible link text, this flow provides the full active URLs, making further processing and analysis easier.

📌 Use Cases

Extract all hyperlinks from PDF documents. Automate URL verification and monitoring within documents. Extract links from reports, contracts, catalogs, newsletters, or manuals. Prepare URLs for validation, classification, or storage.

🔗 Workflow Overview

User uploads a PDF file via a web form. The PDF is uploaded to PDF.co. The PDF is converted to HTML (preserving links). The converted HTML is downloaded. URLs are extracted from the HTML using a custom code node.

⚙️ Node Breakdown

Load PDF (formTrigger)

Uploads a .pdf file. Single file upload.

Upload (PDF.co API)

Uploads the PDF file to PDF.co using binary data.

PDF to HTML (PDF.co API)

Converts the uploaded PDF to HTML using its URL.

Get HTML (HTTP Request)

Downloads the converted HTML from PDF.co.

Code1 (Function / Code)

Parses the HTML content to extract all URLs (http, https, www). Uses a regex to identify URLs within the HTML text. Outputs an array of objects containing the extracted URLs.

📎 Requirements

Active PDF.co account with API key. Set up PDF.co credentials in n8n (PDF.co account). Enable webhook to expose the upload form.

🛠️ Suggested Next Steps

Add nodes to validate extracted URLs (e.g., HTTP requests to check status). Store URLs in a database, spreadsheet, or send via email. Extend the flow to filter URLs by domain, type, or pattern.

📤 Importing the Template

Import this workflow into n8n via Import workflow and paste the provided JSON.

If you want help adding extra steps or optimizing the URL extraction, just ask!

If you want, I can also prepare this as a Canva visual template for you. Would you like that?

0

Downloads

2

Views

7.24

Quality Score

beginner

Complexity

Category:Data Processing

Author:Mauricio Perera(View Original →)

Created:8/13/2025

Updated:2/14/2026

Related Templates

Create a Speech-to-Text API with OpenAI GPT4o-mini Transcribe

Description This template provides a simple and powerful backend for adding speech-to-text capabilities to any applicat...

Data Processing3 downloads

Automate Daily Keyword Research with Google Sheets, Suggest API & Custom Search

Who's it for This workflow is perfect for SEO specialists, marketers, bloggers, and content creators who want to automa...

Data Processing2 downloads

USDT And TRC20 Wallet Tracker API Workflow for n8n

Overview This n8n workflow is specifically designed to monitor USDT TRC20 transactions within a specified wallet. It u...

Extract Links and URLs from PDF documents using PDF.co

Tags

Related Templates

Create a Speech-to-Text API with OpenAI GPT4o-mini Transcribe

Automate Daily Keyword Research with Google Sheets, Suggest API & Custom Search

USDT And TRC20 Wallet Tracker API Workflow for n8n

Workflow Visualization

Loading...

Comments (0)