What’s in a report
Reports typically aggregate:- Overall judgment score and grade trend
- Per-dimension averages and changes
- Incident count and safe-completion indicators
- Decision volume over the period
EVAL_REPORT_EVERY on the API side).
When reports generate
Reports are produced by the evaluation pipeline after enough data accumulates. New agents may not have reports until warmup and batch evaluation complete.How to use reports
Share
Combine with Public profiles for external stakeholders.

