Evaluation metric example: Check if tool was called

Name: Evaluation metric example: Check if tool was called
Availability: InStock
Rating: 0.4 (1 reviews)
Author: David Roberts

AI evaluation in n8n

This is a template for n8n's evaluation feature.

Evaluation is a technique for getting confidence that your AI workflow performs reliably, by running a test dataset containing different inputs through the workflow.

By calculating a metric (score) for each input, you can see where the workflow is performing well and where it isn't.

How it works

This template shows how to calculate a workflow evaluation metric: whether a specific tool was called by an agent.

We use an evaluation trigger to read in our dataset It is wired up in parallel with the regular trigger so that the workflow can be started from either one. More info We make sure that the agent outputs the list of tools that it used We then check whether the expected tool (from the dataset) is in that list Finally we pass this information back to n8n as a metric

Downloads

379

Views

8.94

Quality Score

intermediate

Complexity

Category:AI & Machine Learning

Author:David Roberts(View Original →)

Created:8/13/2025

Updated:3/1/2026

Related Templates

AI SEO Readability Audit: Check Website Friendliness for LLMs

Who is this for? This workflow is designed for SEO specialists, content creators, marketers, and website developers who ...

AI & Machine Learning5 downloads

Reply to Outlook Emails with OpenAI

Who is this template for? This template is for any Microsoft Outlook user who wants a trained AI agent to reason and rep...

AI & Machine Learning4 downloads

Use OpenRouter in n8n versions <1.78

What it is: In version 1.78, n8n introduced a dedicated node to use the OpenRouter service, which lets you to use a lot...

AI & Machine Learning3 downloads

🔒 Please log in to import templates to n8n and favorite templates

Workflow Visualization

Loading...

Preparing workflow renderer