A Repo For Document AI
Text mining using tidy tools
ExtractThinker is a Document Intelligence library for LLMs
Semantic search and workflows for medical/scientific papers
Common Resource Grep
Converting text to a structured representation
AiLearning, data analysis plus machine learning practice
NLP tool for statistical analysis of words, sentences, documents
JSON based text search Java Project