Evaluations - Meetzy Docs

What are Evaluations?

Evaluations are automated tests that assess your agent’s performance after each call. They help you measure success rates, identify issues, and maintain quality standards across all conversations.

Evaluations provide quantitative metrics to track your agent’s performance over time.

How Evaluations Work

Each evaluation is a yes/no question that the AI analyzes based on the conversation transcript. After every call, Meetzy runs all configured evaluations and stores the results.

Example Evaluations

Evaluation	Question
Appointment Scheduled	Did the customer schedule an appointment?
Customer Satisfied	Did the customer express satisfaction with the service?
Issue Resolved	Was the customer’s issue successfully resolved?
Information Collected	Did the agent collect all required information?
Objection Handled	Did the agent successfully address customer objections?

AI Copilot Suggestions

The AI Copilot can automatically suggest performance metrics and evaluation criteria based on your agent’s purpose and conversation patterns.

Automatic Evaluation Recommendations

The AI Copilot analyzes your agent configuration and suggests relevant evaluations:

Context-Aware Suggestions: Based on your agent’s role (sales, support, appointments)
Conversation Analysis: Recommendations from analyzing test conversations
Industry Best Practices: Standard metrics for your use case
Performance Optimization: Evaluations that identify improvement opportunities

Using Copilot Suggestions

Open AI Copilot

Click the sparkles icon in the right panel to access AI Copilot.

Request Evaluation Suggestions

Type something like “Suggest evaluations for my agent” or use the quick actions.

Review Recommendations

The Copilot will analyze your agent and suggest specific evaluation criteria.

Accept or Customize

Accept suggestions directly or modify them to fit your specific needs.

Copilot Suggestion Types

Goal-Based
Quality Metrics
Compliance Checks
Data Collection

Evaluations focused on primary objectives:

“Did the agent achieve the call objective?”
“Was the primary goal accomplished?”
“Did the conversation meet success criteria?”

Applying Copilot Suggestions

When the AI Copilot suggests evaluations, they appear in a purple suggestion banner:

Review Copilot suggestions carefully before accepting them. While AI recommendations are based on best practices, they should be customized for your specific use case.

Review Suggestions

Examine the suggested evaluation name and question for relevance.

Customize as Needed

Modify the evaluation question to match your specific requirements.

Accept or Reject

Use the “Accept All” button to add all suggestions, or reject to dismiss them.

Fine-tune

Edit accepted evaluations in the main interface to perfect them.

Creating Evaluations

Open Evaluations Section

Navigate to the Evaluations section in the left panel (✅ icon).

Add Evaluation

Click “Add Evaluation” to create a new test.

Enter Name

Give your evaluation a descriptive name.

Write Question

Create a clear yes/no question that can be answered from the transcript.

Save

Save the evaluation to add it to your agent.

Evaluation Properties

Property	Description	Example
Name	Short identifier	”Appointment Scheduled”
Question	Yes/no prompt for AI	”Did the customer schedule an appointment during this call?”

Writing Effective Questions

The clearer your question, the more accurate the evaluation results will be.

Good Questions

✅ “Did the customer agree to schedule a follow-up call?”
✅ “Did the agent verify the customer’s contact information?”
✅ “Was the customer’s primary concern addressed during the call?”
✅ “Did the agent mention the current promotion?”

Poor Questions

❌ “Was the call good?” (too vague)
❌ “How satisfied was the customer?” (not yes/no)
❌ “Rate the agent’s performance” (not answerable from transcript)

Copilot-Enhanced Question Writing

The AI Copilot can help improve your evaluation questions:

Clarity Enhancement: Makes vague questions more specific
Binary Conversion: Converts complex questions into yes/no format
Context Addition: Adds relevant context for better evaluation accuracy
Best Practice Alignment: Ensures questions follow evaluation best practices

Using Templates

Meetzy provides pre-built evaluation templates for common use cases:

Sales
Support
Appointments

“Did the customer express interest in the product?”
“Was pricing discussed during the call?”
“Did the agent attempt to close the sale?”
“Were all customer objections addressed?”

Applying Templates

Click Templates

Click the “Templates” button in the Evaluations section.

Browse Categories

Select a category relevant to your use case.

Select Evaluations

Choose which evaluations to add.

Customize

Modify the questions to match your specific needs.

Managing Evaluations

Editing Evaluations

Click the edit icon (✏️) next to any evaluation
Modify the name or question
Save changes

Deleting Evaluations

Click the delete icon (🗑️) next to the evaluation
Confirm deletion

Deleting an evaluation removes it from future calls. Historical evaluation data is preserved.

Reordering Evaluations

Drag and drop evaluations to change their display order.

Viewing Results

Evaluation results are available in multiple places:

Location	What You See
Call Details	Individual evaluation results for each call
Analytics Dashboard	Aggregated success rates over time
Webhook Output	Raw evaluation data sent to your systems
API Response	Programmatic access to results

Best Practices

Start with Copilot Suggestions

Begin by asking the AI Copilot to suggest evaluations. This provides a solid foundation based on your agent’s configuration and industry best practices.

Evaluation Quality Guidelines

Be Specific: The more specific your question, the more reliable the evaluation
Avoid Overlap: Don’t create multiple evaluations that measure the same thing
Start Simple: Begin with 3-5 core evaluations that measure your most important KPIs
Review Regularly: Periodically review evaluation results to calibrate questions
Use for Training: Low-scoring evaluations highlight areas for agent improvement

Copilot Integration Tips

Iterate with AI: Use the Copilot to refine and improve existing evaluations
Context Matters: Provide context about your business when asking for suggestions
Test Suggestions: Use the Playground to validate Copilot-suggested evaluations
Combine Sources: Mix Copilot suggestions with templates and custom evaluations

Common Evaluation Patterns

Goal Achievement

Name: Primary Goal Met
Question: Did the call achieve its primary objective (sale, appointment, resolution, etc.)?

Compliance

Name: Disclosure Made
Question: Did the agent provide the required legal disclosure at the beginning of the call?

Quality

Name: Professional Tone
Question: Did the agent maintain a professional and courteous tone throughout the call?

Data Collection

Name: Contact Info Verified
Question: Did the agent verify or collect the customer's email and phone number?

Next Steps

AI Copilot

Get AI-powered evaluation suggestions

Playground

Test evaluations in real-time

Output Settings

Include evaluations in webhooks

Analytics

View evaluation results and trends

Getting Started

Agents

Advanced Editor

Voice Library

Calls

CRM

Account

​What are Evaluations?

​How Evaluations Work

​Example Evaluations

​AI Copilot Suggestions

​Automatic Evaluation Recommendations

​Using Copilot Suggestions

​Copilot Suggestion Types

​Applying Copilot Suggestions

​Creating Evaluations

​Evaluation Properties

​Writing Effective Questions

​Good Questions

​Poor Questions

​Copilot-Enhanced Question Writing

​Using Templates

​Applying Templates

​Managing Evaluations

​Editing Evaluations

​Deleting Evaluations

​Reordering Evaluations

​Viewing Results

​Best Practices

​Start with Copilot Suggestions

​Evaluation Quality Guidelines

​Copilot Integration Tips

​Common Evaluation Patterns

​Goal Achievement

​Compliance

​Quality

​Data Collection

​Next Steps

AI Copilot

Playground

Output Settings

Analytics

What are Evaluations?

How Evaluations Work

Example Evaluations

AI Copilot Suggestions

Automatic Evaluation Recommendations

Using Copilot Suggestions

Copilot Suggestion Types

Applying Copilot Suggestions

Creating Evaluations

Evaluation Properties

Writing Effective Questions

Good Questions

Poor Questions

Copilot-Enhanced Question Writing

Using Templates

Applying Templates

Managing Evaluations

Editing Evaluations

Deleting Evaluations

Reordering Evaluations

Viewing Results

Best Practices

Start with Copilot Suggestions

Evaluation Quality Guidelines

Copilot Integration Tips

Common Evaluation Patterns

Goal Achievement

Compliance

Quality

Data Collection

Next Steps