FFmpeg Batch AV Converter
Document (PDF, Word, PPTX ...) extraction and parse API
OCR model for complex documents with layout-aware structured outputs
Enhances Tesseract OCR output using LLMs (local or API)
Generate audiobooks from EPUBs, PDFs and text with captions
Open source healthcare AI
A Repo For Document AI
Free, local, open-source Cowork for Gemini CLI, Claude Code, Codex
PDF to Markdown with vision models
A multimedia transcoded treasure chest / a FFmpeg case
OCR software, free and offline
Stable Diffusion web UI
A professional video compression tool accessible to all
Python ETL framework for stream processing, real-time analytics, LLM
New way to create web server and NoSQL data model
Text mining using tidy tools
Comprehensive Gradio WebUI for audio processing
Convert any video/image into a tiny size. 100% free & open-source
Faster Whisper transcription with CTranslate2
The official Go library for the OpenAI API
Visual Causal Flow
A full spaCy pipeline and models for scientific/biomedical documents
Persian NLP Toolkit
Automatic subtitle synchronization tool
Stanford CoreNLP, a Java suite of core NLP tools