ExtractThinker is a Document Intelligence library for LLMs
Extract schema, statistics and entities from datasets
DataDreamer: Prompt. Generate Synthetic Data. Train & Align Models
Public opinion analysis system
Toolkit for conversational AI
A natural language interface for computers
The Classical Language Toolkit
Stanford NLP Python library for many human languages
A Repo For Document AI
Efficient few-shot learning with Sentence Transformers
Obsei is a low code AI powered automation tool
AI-powered semantic indexing: automating the creation of book indexes
Resources, corpora, and tools for Chinese natural language processing
fastNLP: A Modularized and Extensible NLP Framework
InferSent sentence embeddings
AiLearning, data analysis plus machine learning practice
We describe a simple XML format to share text documents and annotation
Lexicon and rule-based sentiment analysis tool
TextBlob is a Python library for processing textual data