Learn how to create custom LLM-as-a-Judge metrics to evaluate domain-specific applications within Galileo
Create a new project from the Galileo Console using the `New Project button`
Get your Galileo API Keys
Clone the project in your IDE of choice.
sdk-examples/python/agent/startup-simulator-3000
folder in the cloned repo.Set up a virtual environment and install dependencies.
Configure your .env file.
.example.env
file, copy it, renaming it to .env
and add in your own variables. Be sure the variables are added to your .gitignore
file.When complete, it should look something like this:Start the Flask app and test out the application
Add metrics to your log stream in Galileo
Add custom metrics
Create your own LLM-as-a-Judge prompt
Test your metric
Add your Custom Metric