The library to build & auto-optimize LLM applications
Data processing for and with foundation models
Efficient few-shot learning with Sentence Transformers
A Repo For Document AI
Extract schema, statistics and entities from datasets
Pretrained model hub for Keras 3
Toolkit for conversational AI
Superlinked is a Python framework for AI Engineers
Semantic search and workflows for medical/scientific papers
A full spaCy pipeline and models for scientific/biomedical documents
Industrial-strength Natural Language Processing (NLP)
Data loaders and abstractions for text and NLP
The Classical Language Toolkit
Code repo for "WebArena to build Autonomous Agents
Persian NLP Toolkit
WikiChat is an improved RAG
An LLM-powered knowledge curation system that researches topics
ReFT: Representation Finetuning for Language Models
Evaluation code for various unsupervised automated metrics
Underthesea - Vietnamese NLP Toolkit
The most accurate natural language detection library for Python
Stanford NLP Python library for many human languages
Easy-to-use and high-performance NLP and LLM framework
ExtractThinker is a Document Intelligence library for LLMs
DataDreamer: Prompt. Generate Synthetic Data. Train & Align Models