Evaluate AI Agent Response Relevance using OpenAI and Cosine Similarity

Name: Evaluate AI Agent Response Relevance using OpenAI and Cosine Similarity
Availability: InStock
Rating: 0.4 (1 reviews)
Author: Jimleuk

This n8n template demonstrates how to calculate the evaluation metric "Relevance" which in this scenario, measures the relevance of the agent's response to the user's question.

The scoring approach is adapted from the open-source evaluations project RAGAS and you can see the source here https://github.com/explodinggradients/ragas/blob/main/ragas/src/ragas/metrics/_answer_relevance.py

How it works This evaluation works best for Q&A agents. For our scoring, we analyse the agent's response and ask another AI to generate a question from it. This generated question is then compared to the original question using cosine similarity. A high score indicates relevance and the agent's successful ability to answer the question whereas a low score means agent may have added too much irrelevant info, went off script or hallucinated.

Requirements n8n version 1.94+ Check out this Google Sheet for a sample data https://docs.google.com/spreadsheets/d/1YOnu2JJjlxd787AuYcg-wKbkjyjyZFgASYVV0jsij5Y/edit?usp=sharing

0

Downloads

559

Views

8.94

Quality Score

intermediate

Complexity

Category:AI & Machine Learning

Author:Jimleuk(View Original →)

Created:8/13/2025

Updated:2/7/2026

Related Templates

AI SEO Readability Audit: Check Website Friendliness for LLMs

Who is this for? This workflow is designed for SEO specialists, content creators, marketers, and website developers who ...

AI & Machine Learning3 downloads

Task Deadline Reminders with Google Sheets, ChatGPT, and Gmail

Intro This template is for project managers, team leads, or anyone who wants to automatically remind teammates of tasks ...

AI & Machine Learning1 downloads

🤖 Build Resilient AI Workflows with Automatic GPT and Gemini Failover Chain

This workflow contains community nodes that are only compatible with the self-hosted version of n8n. How it works This...

Evaluate AI Agent Response Relevance using OpenAI and Cosine Similarity

Tags

Related Templates

AI SEO Readability Audit: Check Website Friendliness for LLMs

Task Deadline Reminders with Google Sheets, ChatGPT, and Gmail

🤖 Build Resilient AI Workflows with Automatic GPT and Gemini Failover Chain

Workflow Visualization

Loading...

Comments (0)