GUI for a Vocal Remover that uses Deep Neural Networks
Generate audiobooks from EPUBs, PDFs and text with captions
Use Microsoft Edge's online text-to-speech service from Python
Comprehensive Gradio WebUI for audio processing
Open speech-to-speech models and pipelines by Hugging Face toolkit AI
Unofficial Python API and agentic skill for Google NotebookLM
GenAI Processors is a lightweight Python library
Translate the video from one language to another and embed dubbing
AI video generator optimized for low VRAM and older GPUs use
One-click deployment (including offline integration package)
Qwen3-ASR is an open-source series of ASR models
Python library and CLI tool to interface with Google Translate
Official repository for LTX-Video
Unlimited, private and free Speech-To-Text program
A python tool that uses GPT-4, FFmpeg, and OpenCV
Audio metadata editor with MusicBrainz integration.
Official Repository for Pot-O MusiQT
Code for the paper Hybrid Spectrogram and Waveform Source Separation
Download 'TIDAL' Music On Windows/Linux/MacOs (PYTHON/C#)
Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)
multiplatform, small and handy audio/video player with network remote
A cross platform front-end GUI of the popular youtube-dl downloader
Qt-based Graphical Interface Wrapper for FFMPEG