A natural language interface for computers
The Classical Language Toolkit
A full spaCy pipeline and models for scientific/biomedical documents
Industrial-strength Natural Language Processing (NLP)
Build AI-powered semantic search applications
Hub of ready-to-use datasets for ML models
A Repo For Document AI
Han Language Processing
Chinese XLNet pre-trained model
The no-nonsense RAG chunking library
The library to build & auto-optimize LLM applications
An LLM-powered knowledge curation system that researches topics
The most accurate natural language detection library for Python
Training data (data labeling, annotation, workflow) for all data types
Toolkit for conversational AI
Stanford NLP Python library for many human languages
Trained models & code to predict toxic comments
Persian NLP Toolkit
WikiChat is an improved RAG
ReFT: Representation Finetuning for Language Models
Easy-to-use and high-performance NLP and LLM framework
ExtractThinker is a Document Intelligence library for LLMs
Superlinked is a Python framework for AI Engineers
DataDreamer: Prompt. Generate Synthetic Data. Train & Align Models
Fast and customizable framework for automatic ML model creation