Evaluation Metric: Summarization

Name: Evaluation Metric: Summarization
Availability: InStock
Rating: 0.4 (1 reviews)
Author: Jimleuk

This n8n template demonstrates how to calculate the evaluation metric "Summarization" which in this scenario, measures the LLM's accuracy and faithfulness in producing summaries which are based on an incoming Youtube transcript.

The scoring approach is adapted from https://cloud.google.com/vertex-ai/generative-ai/docs/models/metrics-templates#pointwise_summarization_quality

How it works This evaluation works best for an AI summarization workflows. For our scoring, we simple compare the generated response to the original transcript. A key factor is to look out information in the response which is not mentioned in the documents. A high score indicates LLM adherence and alignment whereas a low score could signal inadequate prompt or model hallucination.

Requirements n8n version 1.94+ Check out this Google Sheet for a sample data https://docs.google.com/spreadsheets/d/1YOnu2JJjlxd787AuYcg-wKbkjyjyZFgASYVV0jsij5Y/edit?usp=sharing

Downloads

499

Views

8.94

Quality Score

intermediate

Complexity

Category:Content Management

Author:Jimleuk(View Original →)

Created:8/13/2025

Updated:11/17/2025

Related Templates

Track Demo Bookings with Google Calendar to Meta Conversions API Integration

Who is this workflow for? If you're using Meta Ads to generate new leads to your sales pipeline, this workflow is for yo...

Content Management1 downloads

Transcribe & Summarize Audio with Whisper and GPT, from Google Drive to Notion

This workflow contains community nodes that are only compatible with the self-hosted version of n8n. Overview This work...

Content Management1 downloads

Reusable and Independently Testable Sub-workflow

Reusable and Independently Testable Sub-workflow This n8n workflow provides a standardized structure for building and te...

Content Management3 downloads

🔒 Please log in to import templates to n8n and favorite templates

Workflow Visualization

Loading...

Preparing workflow renderer