OCR model for complex documents with layout-aware structured outputs
Document (PDF, Word, PPTX ...) extraction and parse API
Generate audiobooks from EPUBs, PDFs and text with captions
Enhances Tesseract OCR output using LLMs (local or API)
Open source healthcare AI
A Repo For Document AI
Free, local, open-source Cowork for Gemini CLI, Claude Code, Codex
A multimedia transcoded treasure chest / a FFmpeg case
PDF to Markdown with vision models
A professional video compression tool accessible to all
New way to create web server and NoSQL data model
Text mining using tidy tools
Convert any video/image into a tiny size. 100% free & open-source
Python ETL framework for stream processing, real-time analytics, LLM
Stable Diffusion web UI
OCR software, free and offline
Comprehensive Gradio WebUI for audio processing
Visual Causal Flow
Stanford CoreNLP, a Java suite of core NLP tools
Persian NLP Toolkit
Parser generator to read, process, or translate structured text
The official Go library for the OpenAI API
Open speech-to-speech models and pipelines by Hugging Face toolkit AI
Automatic subtitle synchronization tool
Contexts Optical Compression