In-The-Wild Jailbreak Prompts on LLMs download

In-The-Wild Jailbreak Prompts on LLMs is an open-source research repository that provides datasets and analytical tools for studying jailbreak prompts used to bypass safety restrictions in large language models. The project is part of a research effort to understand how users attempt to circumvent alignment and safety mechanisms built into modern AI systems. The repository includes a large collection of prompts gathered from real-world platforms such as Reddit, Discord, prompt-sharing communities, and other public sources. Researchers analyze these prompts to identify patterns, attack strategies, and techniques commonly used to trick language models into producing restricted or harmful outputs. The dataset includes thousands of prompts collected across multiple platforms and represents one of the largest collections of jailbreak attempts available for research.

Features

Large dataset of real-world jailbreak prompts collected from multiple platforms
Framework for analyzing adversarial prompt strategies against LLMs
Measurement study of jailbreak attacks in the wild
Tools for evaluating model responses to adversarial prompts
Dataset containing thousands of prompts and jailbreak attempts
Research resource for improving LLM safety and alignment methods

Project Samples

In-The-Wild Jailbreak Prompts on LLMs Screenshot 1

Project Activity

See All Activity >

License

MIT License

Follow In-The-Wild Jailbreak Prompts on LLMs

In-The-Wild Jailbreak Prompts on LLMs Web Site

Other Useful Business Software

Go from Code to Production URL in Seconds

Cloud Run deploys apps in any language instantly. Scales to zero. Pay only when code runs.

Skip the Kubernetes configs. Cloud Run handles HTTPS, scaling, and infrastructure automatically. Two million requests free per month.

Try it free

Rate This Project

User Reviews

Be the first to post a review of In-The-Wild Jailbreak Prompts on LLMs!

Additional Project Details

Programming Language

Python

Related Categories

Python Large Language Models (LLM)

Registered

2026-03-05

Similar Business Software

LM-Kit.NET

LM-Kit.NET is a cutting-edge, high-level inference SDK designed specifically to bring the advanced capabilities of Large Language Models (LLM) into the C# ecosystem. Tailored for developers working within .NET, LM-Kit.NET provides a comprehensive suite of powerful Generative AI tools, making...

See Software
Gemini Enterprise Agent Platform

Gemini Enterprise Agent Platform is a comprehensive solution from Google Cloud designed to help organizations build, scale, govern, and optimize AI agents. It represents the evolution of Vertex AI, combining advanced model development with new capabilities for agent orchestration and...

See Software
Google AI Studio

Google AI Studio is a unified development platform that helps teams explore, build, and deploy applications using Google’s most advanced AI models, including Gemini 3. It brings text, image, audio, and video models together in one interactive playground. With vibe coding, developers can use...

See Software
OpenGPT-X

OpenGPT-X is a German initiative focused on developing large AI language models tailored to European needs, emphasizing versatility, trustworthiness, multilingual capabilities, and open-source accessibility. The project brings together a consortium of partners to cover the entire generative AI...

See Software
Tülu 3

Tülu 3 is an advanced instruction-following language model developed by the Allen Institute for AI (Ai2), designed to enhance capabilities in areas such as knowledge, reasoning, mathematics, coding, and safety. Built upon the Llama 3 Base, Tülu 3 employs a comprehensive four-stage post-training...

See Software
Cohere

Cohere is an enterprise AI platform that enables developers and businesses to build powerful language-based applications. Specializing in large language models (LLMs), Cohere provides solutions for text generation, summarization, and semantic search. Their model offerings include the Command...

See Software

Report inappropriate content

In-The-Wild Jailbreak Prompts on LLMs

A dataset consists of 15,140 ChatGPT prompts from Reddit

Get an email when there's a new version of In-The-Wild Jailbreak Prompts on LLMs

Features

Project Samples

Project Activity

Categories

License

Follow In-The-Wild Jailbreak Prompts on LLMs

User Reviews

Additional Project Details

Programming Language

Related Categories

Registered