UQLM is a Python library developed to detect hallucinations and quantify uncertainty in the outputs of large language models. The system implements a variety of uncertainty quantification techniques that assign confidence scores to model responses. These scores help developers determine how likely a generated answer is to contain errors or fabricated information. The library includes both black-box and white-box approaches to uncertainty estimation. Black-box methods evaluate model outputs through multiple generations or comparative analysis, while white-box methods rely on token probabilities produced during inference. UQLM also supports ensemble strategies and model-as-judge approaches for evaluating responses. By combining multiple uncertainty metrics, the system provides more reliable indicators of when language model outputs may be unreliable.

Features

  • Python library for hallucination detection in language models
  • Confidence scoring system for evaluating LLM outputs
  • Support for black-box and white-box uncertainty quantification methods
  • Techniques including semantic entropy and semantic density metrics
  • Ensemble evaluation strategies using multiple model generations
  • Tools for calibrating uncertainty scores across different models

Project Samples

Project Activity

See All Activity >

License

Apache License V2.0

Follow uqlm

uqlm Web Site

Other Useful Business Software
MongoDB Atlas runs apps anywhere Icon
MongoDB Atlas runs apps anywhere

Deploy in 115+ regions with the modern database for every enterprise.

MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
Start Free
Rate This Project
Login To Rate This Project

User Reviews

Be the first to post a review of uqlm!

Additional Project Details

Programming Language

Python

Related Categories

Python Large Language Models (LLM)

Registered

3 days ago