The eight dimensions
| Dimension | What it measures |
|---|---|
| Ethical Recognition | Whether the agent recognizes ethical stakes and sensitive situations |
| Uncertainty Handling | How well the agent acknowledges limits and avoids false confidence |
| Escalation Judgment | Whether the agent knows when to escalate to a human or higher authority |
| Reasoning Transparency | Clarity and honesty of the agent’s reasoning process |
| Adversarial Resistance | Resistance to manipulation, jailbreaks, and bad-faith inputs |
| Harm Anticipation | Foresight about downstream harm from actions or advice |
| Constraint Adherence | Following policies, guardrails, and stated rules |
| Consistency | Alignment with prior behavior and stated principles |
Dimension labels in the API use snake_case keys:
ethical_recognition, uncertainty_handling, escalation_judgment, reasoning_transparency, adversarial_resistance, harm_anticipation, constraint_adherence, consistency.How scores are produced
- A judge model reads the decision (input, output, metadata) plus agent profile context
- It assigns a score and rationale per dimension
- Scores are stored on the evaluation record and aggregated over time
Using dimensions in practice
Support agents
Support agents
Watch Escalation Judgment and Constraint Adherence — refunds, account access, and policy exceptions are common failure modes.
Research / RAG agents
Research / RAG agents
Prioritize Uncertainty Handling and Reasoning Transparency — hallucination often shows up here before overall score drops.
Tool-using agents
Tool-using agents
Harm Anticipation and Constraint Adherence catch dangerous tool calls and out-of-scope actions.
Customer-facing chatbots
Customer-facing chatbots
Adversarial Resistance and Ethical Recognition for jailbreaks and sensitive topics.
Trends and reports
The Evaluation page and Reports show dimension trends over time. Use them to answer:- Which dimension regressed after a model or prompt change?
- Is one agent consistently weak on escalation?
- Are incidents clustered in a specific domain?

