Automate LLM Testing with GPT-4 Judge & Google Sheets Tracking

Name: Automate LLM Testing with GPT-4 Judge & Google Sheets Tracking
Availability: InStock
Rating: 0.4 (1 reviews)
Author: Adam Janes

How it works The workflow loads a list of test cases from a Google Sheet (previous results stored from an LLM) For each test case, we execute a call to an LLM judge in parallel (using HTTP Request + Webhook nodes) The judge uses the Input, Output, and Reference Answer fields from the spreadsheet to mark each LLM response as Pass/Fail The results are logged into a separate sheet in the same Sheets file.

Set up steps: Add your credentials for Google Sheets and OpenRouter (or replace the OpenRouter node with your favourite chat model). Make a copy of the example Sheet to populate it with you own test data. Run the workflow with the Execute Workflow button next to the Manual Trigger node.

Downloads

Views

8.94

Quality Score

intermediate

Complexity

Category:Data Processing

Author:Adam Janes(View Original →)

Created:8/13/2025

Updated:2/24/2026

Related Templates

Extract Title tag and Meta description from url for SEO analysis with Airtable

Extract Title tag and meta description from url for SEO analysis. How it works The workflows takes records from Airtabl...

Data Processing0 downloads

Restore your workflows from GitHub

This workflow restores all n8n instance workflows from GitHub backups using the n8n API node. It complements the Backup ...

Data Processing2 downloads

Build a Restaurant Voice Assistant with VAPI and PostgreSQL for Bookings & Orders

This n8n template demonstrates how to create a comprehensive voice-powered restaurant assistant that handles table reser...

Data Processing6 downloads

🔒 Please log in to import templates to n8n and favorite templates

Workflow Visualization

Loading...

Preparing workflow renderer