Autolabel is a Python library to label, clean and enrich datasets with Large Language Models (LLMs). Autolabel data for NLP tasks such as classification, question-answering and named entity recognition, entity matching and more. Seamlessly use commercial and open-source LLMs from providers such as OpenAI, Anthropic, HuggingFace, Google and more.
Features
- Autolabel data for NLP tasks such as classification, question-answering and named entity-recognition, entity matching and more
- Seamlessly use commercial and open source LLMs from providers such as OpenAI, Anthropic, HuggingFace, Google and more
- Leverage research-proven LLM techniques to boost label quality, such as few-shot learning and chain-of-thought prompting
- Confidence estimation and explanations out of the box for every single output label
- Caching and state management to minimize costs and experimentation time
- You can get started with Autolabel by simpling bringing the dataset you want to label, picking your favorite LLM and writing a few lines of code
Categories
Large Language Models (LLM)License
MIT LicenseFollow Autolabel
Other Useful Business Software
MongoDB Atlas runs apps anywhere
MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
Rate This Project
Login To Rate This Project
User Reviews
Be the first to post a review of Autolabel!