FFmpeg Batch AV Converter
OCR model for complex documents with layout-aware structured outputs
Document (PDF, Word, PPTX ...) extraction and parse API
Generate audiobooks from EPUBs, PDFs and text with captions
Enhances Tesseract OCR output using LLMs (local or API)
Based onVapoursynthGraphic video batch pressing processing tool
Open source healthcare AI
A Repo For Document AI
Free, local, open-source Cowork for Gemini CLI, Claude Code, Codex
Video encoding GUI for Windows
PDF to Markdown with vision models
A professional video compression tool accessible to all
A multimedia transcoded treasure chest / a FFmpeg case
Stable Diffusion web UI
Convert any video/image into a tiny size. 100% free & open-source
New way to create web server and NoSQL data model
OCR software, free and offline
Python ETL framework for stream processing, real-time analytics, LLM
Text mining using tidy tools
Misc; latest version of waifu2x; 2D video to stereo 3D video
Open speech-to-speech models and pipelines by Hugging Face toolkit AI
3FUI is ffmpeg's light professional interactive shell on Windows
Faster Whisper transcription with CTranslate2
Visual Causal Flow
Comprehensive Gradio WebUI for audio processing