EvalGuard™
  • Features
  • Solutions
  • Pricing
  • Compare
  • Alternatives
  • Trust
  • Docs
Book a DemoSign InGet Started

Documentation

OverviewGetting StartedConceptsAPI ReferenceCLI ReferenceTypeScript SDKPython SDKGo SDKScorersAttack PluginsProvidersIntegrationsSelf-HostingVPC DeploymentCompliance

Concepts

Mental models for EvalGuard

The API reference catalogues every endpoint, but those pages assume you already know the difference between a scorer and a firewall, what the regeneration loop does, and when an evaluation mode is "basic" vs "deep". Read these seven pages first if you're new to the platform — they're the smallest set of concepts that make the rest of the docs parse.

  • Evaluation modes — basic vs deep

    Cheap ML scorers vs LLM-as-judge rubrics. Cost/latency tradeoffs and when to pick each.

  • Scoring thresholds

    0–1 score scale, the 0.8 default, MIN-of-dims gate semantics, calibrating thresholds for your domain.

  • The regeneration loop

    Evaluate → if-failing-then-regenerate → re-evaluate. Stop conditions, cost-budget gate, audit row shape.

  • Policy engine

    Declarative rules that map score thresholds and dim verdicts to actions (block, regenerate, redact, log).

  • Agent checkpoints

    Input injection scan → tool-call gate → tool-result scan. Three places to insert safety in an agent loop.

  • Red teaming

    Plugins (what to test) × strategies (how to obfuscate). 249 × 42 surface; choosing the right subset.

  • Firewall vs scorer

    Sub-3ms inline gate vs LLM-judged eval. When each fires, how they compose, why you want both.

EvalGuard™

AI evals, firewall, and compliance — one platform.

Subscribe to updates

Product

  • Features
  • Pricing
  • Changelog
  • Docs
  • Blog

Company

  • About
  • Contact Sales
  • Support
  • Privacy
  • Terms
  • DPA
  • MSA

Resources

  • Documentation
  • Quickstart
  • Status
  • Trust Center
  • Security
  • Engineering
  • SLA
  • Incident Response

Compare

  • vs Promptfoo
  • vs DeepEval
  • vs Langfuse
  • vs Giskard
  • vs Maxim AI
  • vs Arize AI
  • Alternatives

Migrate

  • From Promptfoo
  • From Humanloop
  • From Helicone

Audited & mapped against

SOC 2EVIDENCE
SOC 2
ISO 42001AI MGMT
ISO 42001
ISO 27001INFOSEC
ISO 27001
EU AI ACTANNEX IV
EU AI Act
GDPREU READY
GDPR
HIPAAALIGNED
HIPAA
NIST RMFv1.0
NIST AI RMF
OWASPLLM TOP 10
OWASP LLM

© 2026EvalGuard™ Inc. All rights reserved. EvalGuard is a registered trademark.

View system status

Built with care by EvalGuard Inc.