Learn best practices for prompt engineering with custom LLM-as-a-judge metrics
boolean
, you must explain what constitutes true
and what constitutes false
. If it is categorical, you must define every category.input
and output
. For example, in your prompt you might have something like “Validate that the provided output is relevant based on the provided input”.