Python binding to the Apache Tika™ REST services
File Parser optimised for LLM Ingestion with no loss
Tools like web browser, computer access and code runner for LLMs
Industrial-strength Natural Language Processing (NLP)
Standalone, small, language-neutral
Parse files for optimal RAG
Qwen3-Coder is the code version of Qwen3
LLM powered fuzzing via OSS-Fuzz
A Unified Toolkit for Deep Learning Based Document Image Analysis
A C library for parsing/normalizing street addresses around the world
High-accuracy NLP parser with models for 11 languages
Bangla text to speech synthesis in python
A simple resume parser used for extracting information from resumes
Tool to parse the command line and configuration files.
MSTParser is a non-projective dependency parser that searches for maxi