An easy-to-use LLMs quantization package with user-friendly apis
Evaluation code for various unsupervised automated metrics
Underthesea - Vietnamese NLP Toolkit
Easy-to-use and high-performance NLP and LLM framework
Unified embedding model
Persian NLP Toolkit
WikiChat is an improved RAG
The no-nonsense RAG chunking library
ReFT: Representation Finetuning for Language Models
Dealing with all unstructured data, such as reverse image search
Stanford NLP Python library for many human languages
Code repo for "WebArena to build Autonomous Agents
DataDreamer: Prompt. Generate Synthetic Data. Train & Align Models
A tool for learning vector representations of words and entities
Fast and customizable framework for automatic ML model creation
Semantic search and workflows for medical/scientific papers
Efficient Retrieval Augmentation and Generation Framework
A full spaCy pipeline and models for scientific/biomedical documents
The library to build & auto-optimize LLM applications
Data processing for and with foundation models
Extract schema, statistics and entities from datasets
A coding-free framework built on PyTorch
Industrial-strength Natural Language Processing (NLP)
Efficient few-shot learning with Sentence Transformers
A Unified Library for Parameter-Efficient Learning