Document (PDF, Word, PPTX ...) extraction and parse API
Python tool for converting files and office documents to Markdown
Discourse Network Analyzer (DNA)
Han Language Processing
Text mining using tidy tools
Open source annotation tool for machine learning practitioners
Open-Source Python3 tool for recognizing layouts, tables, and math
Metaprogramming library to analyze and transform Java source code
Open source semantic search and text analytics for large document sets
General natural language facilities for node
Connect MATLAB to LLM APIs, including OpenAI® Chat Completions
Toolkit for conversational AI
https://github.com/The-Osint-Toolbox/Telegram-OSINT
Deep Research framework, combining language models with tools
Export disassemblies into Protocol Buffers
Parser generator to read, process, or translate structured text
Open source healthcare AI
Public opinion analysis system
Stanford NLP Python library for many human languages
Contexts Optical Compression
Persian NLP Toolkit
System Analysis Software
Underthesea - Vietnamese NLP Toolkit
pprof is a tool for visualization and analysis of profiling data
A Repo For Document AI