prompt-injection, bola, sql-injection, and so on. Each is a YAML file: the attacker LLM reads it to craft prompts, and the judge uses its pass/fail criteria to score the response.
A suite is a named bundle of evaluators. Pick one suite for a broad scan, or list individual evaluator IDs for a focused one.
Standard vs curated suites
- Standard suites (
owasp-llm-top10,owasp-mcp-top10,owasp-agentic-ai, …) are auto-derived from each evaluator’sstandards:tags. Tag an evaluator and it joins the matching suite automatically — no drift. - Curated suites (
harmful-content,pre-deploy-critical,quick-smoke, …) are hand-authored bundles for a specific purpose.
Two catalogs: agent vs MCP
Opfor maintains two independent evaluator catalogs — one for agent / chatbot red-teaming, one for MCP server red-teaming. The target type selects which catalog the engine reads.Choosing what to run
- A suite
- Specific evaluators
- MCP server tool
opfor setup) and the browser extension both let you pick a suite or individual evaluators interactively.
Full reference
Every evaluator and suite with OWASP mappings.
Author an evaluator
Add your own — no TypeScript needed.
