A natural language interface for computers
An Open Source implementation of Notebook LM with more flexibility
Concatenate a directory full of files into a single prompt
Pre-trained Deep Learning models and demos
GUI for a Vocal Remover that uses Deep Neural Networks
Python tool for converting files and office documents to Markdown
OCRmyPDF adds an OCR text layer to scanned PDF files
Unified web UI for training and running open models locally
An AI-powered file management tool that ensures privacy
TTS with kokoro and onnx runtime
The most powerful and modular diffusion model GUI, api and backend
Deterministic LLMs Outputs for AI Applications and AI Agents
A GUI tool for extracting hard-coded subtitle (hardsub) from videos
Use Microsoft Edge's online text-to-speech service from Python
AI tool that removes hardcoded subtitles and text from videos locally
Voice Recognition to Text Tool
Data manipulation and transformation for audio signal processing
Offline Text To Speech synthesis for python
A Web UI for easy subtitle using whisper model
Framework for Telegram Bot API written in Python 3.7 with asyncio
Portable AI agent orchestration with mechanical protocol enforcement
EPUB to audiobook converter, optimized for Audiobookshelf
A community-supported supercharged version of paperless
Visual intelligence for your home.
A nearly-live implementation of OpenAI's Whisper