Built-in Evaluators
Contributed Evaluators
| Evaluator | Name | Description |
|---|---|---|
| Luna-2 | galileo.luna2 | AI-powered detection via Galileo |
Regex Evaluator
Pattern matching using Google RE2 (safe from ReDoS attacks). Evaluator name:regex
| Option | Type | Required | Description |
|---|---|---|---|
pattern | string | Yes | Regular expression pattern (RE2 syntax) |
flags | list | No | Optional: ["IGNORECASE"] |
Examples
Examples
List Evaluator
Flexible value matching with multiple modes and logic options. Evaluator name:list
| Option | Type | Default | Description |
|---|---|---|---|
values | list | required | Values to match against |
logic | string | "any" | "any" = match any value, "all" = match all |
match_on | string | "match" | "match" = trigger when found, "no_match" = trigger when NOT found |
match_mode | string | "exact" | "exact" = full string match, "contains" = word-boundary match |
case_sensitive | bool | false | Case sensitivity |
match_mode="contains" uses word-boundary matching, not generic substring matching. For example, "admin" will match "admin user" but will NOT match "sysadministrator".Examples
Examples
Luna-2 Evaluator
AI-powered detection using Galileo’s Luna-2 small language models. Provides real-time, low-latency evaluation for complex patterns that can’t be caught with regex or lists. Evaluator name:galileo.luna2
Installation:
GALILEO_API_KEY environment variable where evaluations run.
| Option | Type | Default | Description |
|---|---|---|---|
metric | string | — | Metric to evaluate (required if stage_type="local") |
operator | string | — | "gt", "lt", "gte", "lte", "eq" |
target_value | number | — | Threshold value (0.0–1.0) |
stage_type | string | "local" | "local" or "central" |
galileo_project | string | — | Project name for logging |
on_error | string | "allow" | "allow" (fail open) or "deny" (fail closed) |
timeout_ms | int | 10000 | Request timeout (1000–60000 ms) |
| Metric | Description |
|---|---|
input_toxicity | Toxic/harmful content in user input |
output_toxicity | Toxic/harmful content in agent response |
input_sexism | Sexist content in user input |
output_sexism | Sexist content in agent response |
prompt_injection | Prompt manipulation attempts |
pii_detection | Personally identifiable information |
hallucination | Potentially false or fabricated statements |
tone | Communication tone analysis |
Examples
Examples