Extract schema, statistics and entities from datasets
Stanford NLP Python library for many human languages
Easy-to-use and high-performance NLP and LLM framework
Unified embedding model
DataDreamer: Prompt. Generate Synthetic Data. Train & Align Models
A tool for learning vector representations of words and entities
Trained models & code to predict toxic comments
Obsei is a low code AI powered automation tool
Persian NLP Toolkit
Fast and customizable framework for automatic ML model creation
WikiChat is an improved RAG
A coding-free framework built on PyTorch
Efficient Retrieval Augmentation and Generation Framework
A full spaCy pipeline and models for scientific/biomedical documents
The no-nonsense RAG chunking library
The library to build & auto-optimize LLM applications
Data processing for and with foundation models
An easy-to-use LLMs quantization package with user-friendly apis
An LLM-powered knowledge curation system that researches topics
ReFT: Representation Finetuning for Language Models
Efficient few-shot learning with Sentence Transformers
A Unified Library for Parameter-Efficient Learning
Industrial-strength Natural Language Processing (NLP)
Recognition and resolution of numbers, units, date/time, etc.
Transformers4Rec is a flexible and efficient library