DataDreamer: Prompt. Generate Synthetic Data. Train & Align Models
ExtractThinker is a Document Intelligence library for LLMs
Extract schema, statistics and entities from datasets
A natural language interface for computers
Kener is a Modern Self hosted Status Page, batteries included
Obsei is a low code AI powered automation tool
Modular Suite of NLP Tools
Common Resource Grep
An open-source NLP research library, built on PyTorch
Transforms PDF, Documents and Images into Enriched Structured Data
A model library for exploring state-of-the-art deep learning
A Chinese information extraction tool
Dataset generation for AI chatbots, NLP tasks
Library to scrape and clean web pages to create massive datasets
Lexicon and rule-based sentiment analysis tool
NLP tool for statistical analysis of words, sentences, documents
An user friendly grammar tool for natural language processing tasks