ExtractThinker is a Document Intelligence library for LLMs
Superlinked is a Python framework for AI Engineers
The no-nonsense RAG chunking library
Recognition and resolution of numbers, units, date/time, etc.
A Repo For Document AI
Weaviate is a cloud-native, modular, real-time vector search engine
Assist in organizing your piles of documents
Build AI-powered semantic search applications
Modular Suite of NLP Tools
Dev tools to reliably understand text and automate conversations
Python package for Korean natural language processing
Language, engine, and tooling for testing composable language rules
Chinese synonyms, chat robot, intelligent question and answer toolkit
An NLP library for building bots
Tools to download and cleanup Common Crawl data
A Deep Neural Text Understanding Framework
Deep learning based natural language and speech processing platform
A Chinese information extraction tool
Named-entity recognition using neural networks
TextRank implementation for Python 3
Natural Language Processing (NLP) for the Masses
Collection of NLP Tools developed at DBMI at University of Pittsburgh
Ansj word segmentation
Statistical phrase-based machine translation system
JSON based text search Java Project