GET/api/v1/evals/compare

Compare two eval runs

Side-by-side comparison of two eval runs, matching cases by input text. Returns regressions, improvements, and per-case diffs.

Authentication

Send Authorization: Bearer YOUR_API_KEY on every request. Generate API keys at /dashboard/api-keys.

Parameters

runA in queryrequired

First eval run ID

string
runB in queryrequired

Second eval run ID

string

Response

All status codes

200Comparison result with regressions and improvements

Code samples

cURL

curl -X GET \
  https://evalguard.ai/api/v1/evals/compare \
  -H "Authorization: Bearer $EVALGUARD_API_KEY" \

TypeScript

import { EvalGuard } from "@evalguard/sdk";

const client = new EvalGuard({ apiKey: process.env.EVALGUARD_API_KEY });

const response = await client.request({
  method: "GET",
  path: "/api/v1/evals/compare",
});
console.log(response);

Python

from evalguard import EvalGuard
import os

client = EvalGuard(api_key=os.environ["EVALGUARD_API_KEY"])

response = client.request(method="GET", path="/api/v1/evals/compare")
print(response)

Go

package main

import (
	"context"
	"fmt"
	"os"

	"github.com/evalguard/evalguard-go"
)

func main() {
	client := evalguard.NewClient(os.Getenv("EVALGUARD_API_KEY"))
	resp, err := client.Request(context.Background(), "GET", "/api/v1/evals/compare", nil)
	if err != nil { panic(err) }
	fmt.Println(resp)
}

Other Evals endpoints