Concatenate a directory full of files into a single prompt
GUI for a Vocal Remover that uses Deep Neural Networks
OCRmyPDF adds an OCR text layer to scanned PDF files
Python tool for converting files and office documents to Markdown
A GUI tool for extracting hard-coded subtitle (hardsub) from videos
Voice Recognition to Text Tool
Use Microsoft Edge's online text-to-speech service from Python
The most powerful and modular diffusion model GUI, api and backend
Automatically translates the text of a video based on a subtitle file
EPUB to audiobook converter, optimized for Audiobookshelf
Offline Text To Speech synthesis for python
An AI-powered security review GitHub Action using Claude
Framework for Telegram Bot API written in Python 3.7 with asyncio
A nearly-live implementation of OpenAI's Whisper
A Python package for segmenting geospatial data with the SAM
TTS with kokoro and onnx runtime
Deterministic LLMs Outputs for AI Applications and AI Agents
Python library and CLI tool to interface with Google Translate
Mixture-of-Experts Vision-Language Models for Advanced Multimodal
A Binary Ninja plugin, MCP server
Private chat with local GPT with document, images, video, etc.
Fast and accurate AI powered file content types detection
Models for object and human mesh reconstruction
A community-supported supercharged version of paperless
A Python library for audio