Python binding to the Apache Tika™ REST services
Real time face swap and one-click video deepfake
OCRmyPDF adds an OCR text layer to scanned PDF files
A GUI tool for extracting hard-coded subtitle (hardsub) from videos
3D reconstruction software
A lightweight audio-to-MIDI converter with pitch bend detection
Telegram Drive
Ready-to-use OCR with 80+ supported languages
Distribute and run LLMs with a single file
Comprehensive Gradio WebUI for audio processing
Nexa SDK is a comprehensive toolkit for supporting ONNX and GGML
Download media files from a telegram conversation/chat/channel
Image polygonal annotation with Python
Aider is AI pair programming in your terminal
Agent Zero AI framework
Speech recognition module for Python
Local Lambda debug, CodeWhisperer, SAM/CFN syntax, etc.
A Python library for audio
A reactive notebook for Python
An MCP server that provides fast file searching capabilities
Control Any Computer Using LLMs
A solution to build and deploy MCP agents and applications
Create HTML profiling reports from pandas DataFrame objects
Haystack is an open source NLP framework to interact with your data
README file generator, powered by AI