Eval Harness

Last run2026-05-27T01:02:44
Modelcoach-v2:latest
Cases14/320
Statuscomplete
voice_fidelity_median4.0
banned_phrase_total0
register_match_rate0.536
regret_low_rate0.518
would_send_rate0.554

Full report →