Data loaders and abstractions for text and NLP
Industrial-strength Natural Language Processing (NLP)
Toolkit for conversational AI
Han Language Processing
Dealing with all unstructured data, such as reverse image search
A Repo For Document AI
Training data (data labeling, annotation, workflow) for all data types
The Classical Language Toolkit
Underthesea - Vietnamese NLP Toolkit
The most accurate natural language detection library for Python
Trained models & code to predict toxic comments
Obsei is a low code AI powered automation tool
Semantic search and workflows for medical/scientific papers
C++ implementation of ChatGLM-6B & ChatGLM2-6B & ChatGLM3 & GLM4(V)
Data processing for and with foundation models
Recognition and resolution of numbers, units, date/time, etc.
Hub of ready-to-use datasets for ML models
Superlinked is a Python framework for AI Engineers
Model explainability that works seamlessly with Hugging Face
Evaluation code for various unsupervised automated metrics
Extract schema, statistics and entities from datasets
Stanford NLP Python library for many human languages
Code repo for "WebArena to build Autonomous Agents
Easy-to-use and high-performance NLP and LLM framework
ExtractThinker is a Document Intelligence library for LLMs