You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
This quickstart now shows that evaluator callables can return a dict, optionally naming the metric with the key field. That's great, and saves me the weird fn.__name__ override I was doing previously
However, I notice the UI shows a success badge, and I'm not sure what it means - does this mean no exceptions were thrown, or am I able to somehow define thresholds for each metric to consider an evaluation as a success?
Is this somehow possible with the dict too? I'd expect evaluate/aevaluate to take some kind of lambda that sets the success field, but can't find something to this effect
The text was updated successfully, but these errors were encountered:
This quickstart now shows that evaluator callables can return a
dict
, optionally naming the metric with thekey
field. That's great, and saves me the weirdfn.__name__
override I was doing previouslyHowever, I notice the UI shows a success badge, and I'm not sure what it means - does this mean no exceptions were thrown, or am I able to somehow define thresholds for each metric to consider an evaluation as a success?
Is this somehow possible with the dict too? I'd expect
evaluate
/aevaluate
to take some kind of lambda that sets thesuccess
field, but can't find something to this effectThe text was updated successfully, but these errors were encountered: