A Repo For Document AI
Data loaders and abstractions for text and NLP
Industrial-strength Natural Language Processing (NLP)
Hub of ready-to-use datasets for ML models
Sparsity-aware deep learning inference runtime for CPUs
Local Lambda debug, CodeWhisperer, SAM/CFN syntax, etc.
Stanford NLP Python library for many human languages
The no-nonsense RAG chunking library
Toolkit for conversational AI
Training data (data labeling, annotation, workflow) for all data types
Easy-to-use and high-performance NLP and LLM framework
An LLM-powered knowledge curation system that researches topics
A natural language interface for computers
C++ implementation of ChatGLM-6B & ChatGLM2-6B & ChatGLM3 & GLM4(V)
Large Language Model Text Generation Inference
Recognition and resolution of numbers, units, date/time, etc.
Superlinked is a Python framework for AI Engineers
The Classical Language Toolkit
Underthesea - Vietnamese NLP Toolkit
The most accurate natural language detection library for Python
Extract schema, statistics and entities from datasets
ExtractThinker is a Document Intelligence library for LLMs
DataDreamer: Prompt. Generate Synthetic Data. Train & Align Models
A tool for learning vector representations of words and entities
Trained models & code to predict toxic comments