Petri

Petri is an open-source alignment auditing agent that lets researchers rapidly test concrete safety hypotheses against target models using realistic, multi-turn scenarios. Instead of building bespoke evals, Petri automatically generates audit environments from seed “special instructions,” orchestrates an auditor model to probe a target model, and simulates tool use and rollbacks to surface risky behaviors. Each interaction transcript is then scored by a judge model using a consistent rubric so results are comparable across runs and models. The system supports major model APIs and comes with starter seeds and judge dimensions, enabling minutes-to-insight workflows for questions like reward hacking, self-preservation, or eval awareness. Petri is designed for parallel exploration: it spins many audits in flight, aggregates findings, and highlights transcripts that deserve human review.

Features

Scenario generator that turns seed instructions into realistic audit setups
Multi-turn auditor orchestration with simulated tool use and rollbacks
Judge model that scores transcripts via a consistent safety rubric
Parallel execution to explore many hypotheses and surface the riskiest traces first
Built-in starters for seeds and judge dimensions plus guidance for customization
API support for popular model providers with reproducible runs and reports

Project Samples

Project Activity

See All Activity >

License

MIT License

Follow Petri

Petri Web Site

Other Useful Business Software

MongoDB Atlas runs apps anywhere

Deploy in 115+ regions with the modern database for every enterprise.

MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.

Start Free

Rate This Project

User Reviews

Be the first to post a review of Petri!

Additional Project Details

Programming Language

Python

Related Categories

Python AI Agents

Registered

2 days ago

Similar Business Software

Vertex AI

Build, deploy, and scale machine learning (ML) models faster, with fully managed ML tools for any use case. Through Vertex AI Workbench, Vertex AI is natively integrated with BigQuery, Dataproc, and Spark. You can use BigQuery ML to create and execute machine learning models in BigQuery...

See Software
MAIHEM

MAIHEM creates AI agents that continuously test your AI applications. We enable you to automate your AI quality assurance, ensuring AI performance and safety from development all the way to deployment. Avoid hours of manual testing and randomly probing for AI model weaknesses. MAIHEM automates...

See Software
DemoGPT

DemoGPT is an open source platform that simplifies the creation of LLM (Large Language Model) agents by providing an all-in-one toolkit. It offers tools, frameworks, prompts, and models for rapid agent development. The platform automatically generates LangChain code, which can be used for...

See Software
StackAI

StackAI is an enterprise AI automation platform to build end-to-end internal tools and processes with AI agents in a fully compliant and secure way. Designed for large organizations, it enables teams to automate complex workflows across operations, compliance, finance, IT, and support without...

See Software
LM-Kit.NET

LM-Kit.NET is a cutting-edge, high-level inference SDK designed specifically to bring the advanced capabilities of Large Language Models (LLM) into the C# ecosystem. Tailored for developers working within .NET, LM-Kit.NET provides a comprehensive suite of powerful Generative AI tools, making...

See Software
potpie

Potpie is an open source platform that enables developers to create AI agents tailored to their codebases, automating tasks such as debugging, testing, system design, onboarding, code review, and documentation. By transforming your codebase into a comprehensive knowledge graph, Potpie's agents...

See Software

Report inappropriate content

Petri

An alignment auditing agent capable of exploring alignment hypothesis

Get an email when there's a new version of Petri

Features

Project Samples

Project Activity

Categories

License

Follow Petri

User Reviews

Additional Project Details

Programming Language

Related Categories

Registered