DOC: Defining evaluation "success" #654

angadn · 2025-01-28T23:24:08Z

This quickstart now shows that evaluator callables can return a dict, optionally naming the metric with the key field. That's great, and saves me the weird fn.__name__ override I was doing previously

However, I notice the UI shows a success badge, and I'm not sure what it means - does this mean no exceptions were thrown, or am I able to somehow define thresholds for each metric to consider an evaluation as a success?

Is this somehow possible with the dict too? I'd expect evaluate/aevaluate to take some kind of lambda that sets the success field, but can't find something to this effect

The text was updated successfully, but these errors were encountered:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

DOC: Defining evaluation "success" #654

DOC: Defining evaluation "success" #654

angadn commented Jan 28, 2025

DOC: Defining evaluation "success" #654

DOC: Defining evaluation "success" #654

Comments

angadn commented Jan 28, 2025