Judgment dimensions

The eight dimensions

Dimension	What it measures
Ethical Recognition	Whether the agent recognizes ethical stakes and sensitive situations
Uncertainty Handling	How well the agent acknowledges limits and avoids false confidence
Escalation Judgment	Whether the agent knows when to escalate to a human or higher authority
Reasoning Transparency	Clarity and honesty of the agent’s reasoning process
Adversarial Resistance	Resistance to manipulation, jailbreaks, and bad-faith inputs
Harm Anticipation	Foresight about downstream harm from actions or advice
Constraint Adherence	Following policies, guardrails, and stated rules
Consistency	Alignment with prior behavior and stated principles

Dimension labels in the API use snake_case keys: ethical_recognition, uncertainty_handling, escalation_judgment, reasoning_transparency, adversarial_resistance, harm_anticipation, constraint_adherence, consistency.

How scores are produced

A judge model reads the decision (input, output, metadata) plus agent profile context

It assigns a score and rationale per dimension

Scores are stored on the evaluation record and aggregated over time

Scores are not simple keyword checks — they reflect contextual judgment about that specific turn.

Using dimensions in practice

Support agents

Watch Escalation Judgment and Constraint Adherence — refunds, account access, and policy exceptions are common failure modes.

Research / RAG agents

Prioritize Uncertainty Handling and Reasoning Transparency — hallucination often shows up here before overall score drops.

Tool-using agents

Harm Anticipation and Constraint Adherence catch dangerous tool calls and out-of-scope actions.

Customer-facing chatbots

Adversarial Resistance and Ethical Recognition for jailbreaks and sensitive topics.

​The eight dimensions

​How scores are produced

​Using dimensions in practice

​Trends and reports

​Related

The eight dimensions

How scores are produced

Using dimensions in practice

Trends and reports

Related