ExtractThinker is a Document Intelligence library for LLMs
The no-nonsense RAG chunking library
Superlinked is a Python framework for AI Engineers
Recognition and resolution of numbers, units, date/time, etc.
Build AI-powered semantic search applications
Weaviate is a cloud-native, modular, real-time vector search engine
A Repo For Document AI
Assist in organizing your piles of documents
Modular Suite of NLP Tools
Aseryla2 code repositories
Python package for Korean natural language processing
Aseryla code repositories
Language, engine, and tooling for testing composable language rules
Chinese synonyms, chat robot, intelligent question and answer toolkit
An NLP library for building bots
Tools to download and cleanup Common Crawl data
A Deep Neural Text Understanding Framework
Deep learning based natural language and speech processing platform
A Chinese information extraction tool
A smart search engine for medical documents
Named-entity recognition using neural networks
TextRank implementation for Python 3
Natural Language Processing (NLP) for the Masses
Turku Event Extraction System
Collection of NLP Tools developed at DBMI at University of Pittsburgh