Explore Galileo’s comprehensive out-of-the-box metrics for evaluating and improving AI system performance across multiple dimensions
Galileo provides a comprehensive suite of pre-built metrics designed to evaluate various aspects of AI system performance without requiring custom implementation.
These metrics span across five categories including:
Each metric addresses specific evaluation needs, from measuring factual correctness to detecting potential biases or tracking tool usage effectiveness.
These metrics apply to different node types (such as session, trace, or different span types), depending on the metric.
Use the sortable, filterable table below to explore all available native metrics and find the right measurements for your AI applications.