A Repo For Document AI
ExtractThinker is a Document Intelligence library for LLMs
Semantic search and workflows for medical/scientific papers
A Heterogeneous Benchmark for Information Retrieval
Haystack is an open source NLP framework to interact with your data
State-of-the-art Multilingual Question Answering research
Chinese synonyms, chat robot, intelligent question and answer toolkit
TextRank implementation for Python 3
AiLearning, data analysis plus machine learning practice