An Open Source implementation of Notebook LM with more flexibility
A natural language interface for computers
Concatenate a directory full of files into a single prompt
Pre-trained Deep Learning models and demos
GUI for a Vocal Remover that uses Deep Neural Networks
OCRmyPDF adds an OCR text layer to scanned PDF files
Python tool for converting files and office documents to Markdown
The most powerful and modular diffusion model GUI, api and backend
A GUI tool for extracting hard-coded subtitle (hardsub) from videos
An AI-powered file management tool that ensures privacy
Use Microsoft Edge's online text-to-speech service from Python
Offline Text To Speech synthesis for python
Framework for Telegram Bot API written in Python 3.7 with asyncio
EPUB to audiobook converter, optimized for Audiobookshelf
A nearly-live implementation of OpenAI's Whisper
TTS with kokoro and onnx runtime
A Python package for segmenting geospatial data with the SAM
A Web UI for easy subtitle using whisper model
AI tool that removes hardcoded subtitles and text from videos locally
Mixture-of-Experts Vision-Language Models for Advanced Multimodal
A community-supported supercharged version of paperless
Deterministic LLMs Outputs for AI Applications and AI Agents
Unified web UI for training and running open models locally
Voice Recognition to Text Tool
Custom Home Assistant configuration with automations and scripts setup