OBLITERATUS

OBLITERATUS is an advanced open-source toolkit designed to analyze and modify the internal behavior of large language models by identifying and removing mechanisms responsible for refusal or restricted responses. It implements a set of techniques collectively referred to as “abliteration,” which target specific internal representations within neural networks to alter how models respond to certain prompts. Unlike traditional fine-tuning approaches, OBLITERATUS operates directly on model activations, enabling behavioral changes without retraining the model. The toolkit provides a full pipeline for probing, analyzing, and modifying model behavior, including visualization tools that help researchers understand where and how refusal mechanisms are encoded. It supports multiple analytical methods such as PCA and SVD to locate these behavioral directions within model layers.

Features

Identification and removal of refusal behaviors in language models
Techniques such as PCA and SVD for analyzing model activations
Modification of model behavior without retraining
Visualization tools for understanding internal model representations
Python API for advanced experimentation and integration
Optional telemetry for contributing to collaborative research

Project Samples

Project Activity

See All Activity >

License

Affero GNU Public License

Follow OBLITERATUS

OBLITERATUS Web Site

Other Useful Business Software

MongoDB Atlas runs apps anywhere

Deploy in 115+ regions with the modern database for every enterprise.

MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.

Start Free

Rate This Project

User Reviews

Be the first to post a review of OBLITERATUS!

Additional Project Details

Programming Language

Python

Related Categories

Python Artificial Intelligence Software

Registered

14 hours ago

Similar Business Software

LM-Kit.NET

LM-Kit.NET is a cutting-edge, high-level inference SDK designed specifically to bring the advanced capabilities of Large Language Models (LLM) into the C# ecosystem. Tailored for developers working within .NET, LM-Kit.NET provides a comprehensive suite of powerful Generative AI tools, making...

See Software
StackAI

StackAI is an enterprise AI automation platform to build end-to-end internal tools and processes with AI agents in a fully compliant and secure way. Designed for large organizations, it enables teams to automate complex workflows across operations, compliance, finance, IT, and support without...

See Software
Criminal IP

Criminal IP equips security teams with the actionable Threat Intelligence needed to proactively identify, analyze, and respond to emerging threats. Powered by AI and OSINT, it delivers threat scoring, reputation data, and real-time detection of a wide array of malicious indicators, ranging from...

See Software
Vertex AI

Build, deploy, and scale machine learning (ML) models faster, with fully managed ML tools for any use case. Through Vertex AI Workbench, Vertex AI is natively integrated with BigQuery, Dataproc, and Spark. You can use BigQuery ML to create and execute machine learning models in BigQuery...

See Software
Podium

Podium is a comprehensive AI-driven lead management and communication platform utilized by over 100,000 businesses seeking to enhance customer acquisition and retention. At the heart of Podium’s innovation is its AI employee, which ensures businesses respond to incoming leads instantly around...

See Software
LTX

Control every aspect of your video using AI, from ideation to final edits, on one holistic platform. We’re pioneering the integration of AI and video production, enabling the transformation of a single idea into a cohesive, AI-generated video. LTX empowers individuals to share their visions,...

See Software

Report inappropriate content

OBLITERATUS

OBLITERATE THE CHAINS THAT BIND YOU

Get an email when there's a new version of OBLITERATUS

Features

Project Samples

Project Activity

Categories

License

Follow OBLITERATUS

User Reviews

Additional Project Details

Programming Language

Related Categories

Registered