Evaluate RAG Response Accuracy with OpenAI: Document Groundedness Metric

Name: Evaluate RAG Response Accuracy with OpenAI: Document Groundedness Metric
Availability: InStock
Rating: 0.4 (1 reviews)
Author: Jimleuk

This n8n template demonstrates how to calculate the evaluation metric "RAG document groundedness" which in this scenario, measures the ability to provide or reference information included only in retrieved vector store documents.

The scoring approach is adapted from https://cloud.google.com/vertex-ai/generative-ai/docs/models/metrics-templates#pointwise_groundedness

How it works This evaluation works best for an agent that requires document retrieval from a vector store or similar source. For our scoring, we need to collect the agent's response and the documents retrieved and use an LLM to assess if the former is based off the latter. A key factor is to look out information in the response which is not mentioned in the documents. A high score indicates LLM adherence and alignment whereas a low score could signal inadequate prompt or model hallucination.

Requirements n8n version 1.94+ Check out this Google Sheet for a sample data https://docs.google.com/spreadsheets/d/1YOnu2JJjlxd787AuYcg-wKbkjyjyZFgASYVV0jsij5Y/edit?usp=sharing

1

Downloads

640

Views

8.94

Quality Score

advanced

Complexity

Category:AI & Machine Learning

Author:Jimleuk(View Original →)

Created:8/13/2025

Updated:11/17/2025

Related Templates

AI SEO Readability Audit: Check Website Friendliness for LLMs

Who is this for? This workflow is designed for SEO specialists, content creators, marketers, and website developers who ...

AI & Machine Learning3 downloads

Task Deadline Reminders with Google Sheets, ChatGPT, and Gmail

Intro This template is for project managers, team leads, or anyone who wants to automatically remind teammates of tasks ...

AI & Machine Learning1 downloads

🤖 Build Resilient AI Workflows with Automatic GPT and Gemini Failover Chain

This workflow contains community nodes that are only compatible with the self-hosted version of n8n. How it works This...

Evaluate RAG Response Accuracy with OpenAI: Document Groundedness Metric

Related Templates

AI SEO Readability Audit: Check Website Friendliness for LLMs

Task Deadline Reminders with Google Sheets, ChatGPT, and Gmail

🤖 Build Resilient AI Workflows with Automatic GPT and Gemini Failover Chain

Workflow Visualization

Loading...

Comments (0)