Conversation Quality is a binary evaluation metric that assesses whether a chatbot interaction left the user feeling satisfied and positive, or frustrated and dissatisfied, based on tone, engagement, and overall experience.
Create the agent efficiency metric
This metric needs to be manually created, using a prompt defined by Galileo.1
Create a new LLM-as-a-judge metric
Create a new LLM-as-a-judge metric by following the instructions in our LLM-as-a-judge concept guide.Use the following settings:
Setting | Value |
---|---|
Name | Conversation quality |
LLM Model | Select your preferred model |
Apply to | Session |
Advanced Settings | Configure these as required for your needs |
2
Set the prompt
Set the prompt to the following:
Prompt
3
Save the metric
Save the metric, then turn it on for your Log stream.