Measure semantic equivalence between model outputs and reference answers using Galileo’s Guardrail Metrics to ensure alignment with expected responses
output
column of your experiment’s dataset.Model Request
Prompt Engineering
Multiple Evaluations
Result Analysis
Score Calculation