ExtractThinker is a Document Intelligence library for LLMs
Extract schema, statistics and entities from datasets
Public opinion analysis system
The Classical Language Toolkit
DataDreamer: Prompt. Generate Synthetic Data. Train & Align Models
Toolkit for conversational AI
A natural language interface for computers
Stanford NLP Python library for many human languages
A Repo For Document AI
Obsei is a low code AI powered automation tool
Resources, corpora, and tools for Chinese natural language processing
fastNLP: A Modularized and Extensible NLP Framework
InferSent sentence embeddings
AiLearning, data analysis plus machine learning practice
We describe a simple XML format to share text documents and annotation
Lexicon and rule-based sentiment analysis tool
TextBlob is a Python library for processing textual data