Evaluate AI workflows using Google Sheets, Gemini, Claude, GPT, and Perplexity

Go to Workflow
1 views
Built by Ryan Nolan Ryan Nolan
Created on June 06, 2026

Description

This template and YouTube video goes over
5 different implementations of evaluations within n8n.

Categorization
Correctness
Tools used
String similarity
Helpfulness

You’ll learn when to use each type, how to set up test datasets in Google Sheets or data tables, and how to track your results over time. I also explain best practices like only changing one variable at a time, documenting your prompts and model settings, and building proper training datasets with enough examples to confidently validate your workflow.

YouTube Video: https://www.youtube.com/watch?v=-4LXYOhQ-Z0

Thank you for downloading our free n8n Evaluations template. If you enjoyed the template + tutorial please subscribe to the YouTube channel. We are uploading weekly content on AI/n8n

Connect With Us
Check out the links down below. If you need help with this template, want 1:1 coaching, or have a n8n project you want to build, reach out at [email protected]

Free Skool AI/n8n Group: https://www.skool.com/data-and-ai
LinkedIn: https://www.linkedin.com/in/ryan-p-nolan/
Twitter/X:https://x.com/RyanMattDS
Website: https://ryanandmattdatascience.com/

Nodes Used (8)

AI Agent
@n8n/n8n-nodes-langchain.agent
Anthropic Chat Model
@n8n/n8n-nodes-langchain.lmChatAnthropic
Evaluation
n8n-nodes-base.evaluation
Gmail
n8n-nodes-base.gmail
Google Gemini Chat Model
@n8n/n8n-nodes-langchain.lmChatGoogleGemini
Information Extractor
@n8n/n8n-nodes-langchain.informationExtractor
OpenAI Chat Model
@n8n/n8n-nodes-langchain.lmChatOpenAi
Summarization Chain
@n8n/n8n-nodes-langchain.chainSummarization