The loop
1. Connect
You integrate once:- Create a workspace (automatic on signup)
- Define an agent profile — name, role, and context
- Generate an API key linked to that profile
- Add the SDK or call the REST API after each agent turn
agent_id, session_id, domain).
2. Evaluate
The evaluation engine scores each decision on eight judgment dimensions — ethical recognition, uncertainty handling, escalation judgment, and more. Scores are evidence-backed, not binary pass/fail. Evaluation runs asynchronously after ingest. Agents warm up after a minimum number of decisions; then batch evaluation and periodic reports kick in.3. Improve
Use the app to act on results:- Command center — live metrics and agent health
- Monitoring — decision stream and filters
- Evaluation — dimension breakdowns and trends
- Reports — periodic judgment summaries
- Inbox — incidents and items needing review
- Public profiles — shareable agent judgment pages
Design principles
Zero impact on agent latency
Zero impact on agent latency
SDK sends are async and fire-and-forget. If Histeeria is unreachable, your agent continues normally — failures are silent by design.
Workspace-scoped security
Workspace-scoped security
API keys belong to your workspace. All decisions and evaluations are isolated to your organization.
Profile-aware evaluation
Profile-aware evaluation
Agent profiles give the judge model context about what this agent is supposed to do, improving score relevance.
Evidence over vibes
Evidence over vibes
Each dimension score is tied to reasoning about the specific decision — useful for audits and debugging.
Architecture at a glance
| Component | Role |
|---|---|
| app.histeeria.com | Web UI — dashboards, profiles, settings |
| api.histeeria.com | REST API — ingest, auth, evaluation data |
| SDK | Thin client — queue, batch, retry |
| Evaluation pipeline | LLM judge + aggregation + incident detection |
Next
- Decisions and evaluation — data model deep dive
- Judgment dimensions — what each score means
- Quickstart — integrate in minutes

