Inspect Petri is an open-source alignment auditing agent that lets researchers rapidly test concrete safety hypotheses against target models using realistic, multi-turn scenarios. Instead of building bespoke evals, Inspect Petri automatically generates audit environments from seed “special instructions,” orchestrates an auditor model to probe a target model, and simulates tool use and rollbacks to surface risky behaviors. Each interaction transcript is then scored by a judge model using a consistent rubric so results are comparable across runs and models. The system supports major model APIs and comes with starter seeds and judge dimensions, enabling minutes-to-insight workflows for questions like reward hacking, self-preservation, or eval awareness. Petri is designed for parallel exploration: it spins many audits in flight, aggregates findings, and highlights transcripts that deserve human review.

Features

  • Scenario generator that turns seed instructions into realistic audit setups
  • Multi-turn auditor orchestration with simulated tool use and rollbacks
  • Judge model that scores transcripts via a consistent safety rubric
  • Parallel execution to explore many hypotheses and surface the riskiest traces first
  • Built-in starters for seeds and judge dimensions plus guidance for customization
  • API support for popular model providers with reproducible runs and reports

Project Samples

Project Activity

See All Activity >

Categories

AI Agents

License

MIT License

Follow Inspect Petri

Inspect Petri Web Site

Other Useful Business Software
Stop Storing Third-Party Tokens in Your Database Icon
Stop Storing Third-Party Tokens in Your Database

Auth0 Token Vault handles secure token storage, exchange, and refresh for external providers so you don't have to build it yourself.

Rolling your own OAuth token storage can be a security liability. Token Vault securely stores access and refresh tokens from federated providers and handles exchange and renewal automatically. Connected accounts, refresh exchange, and privileged worker flows included.
Try Auth0 for Free
Rate This Project
Login To Rate This Project

User Reviews

Be the first to post a review of Inspect Petri!

Additional Project Details

Programming Language

Python

Related Categories

Python AI Agents

Registered

2025-10-08