GUI for a Vocal Remover that uses Deep Neural Networks
Generate audiobooks from e-books
Generate audiobooks from e-books, voice cloning & 1107+ languages
A single Gradio + React WebUI with extensions for ACE-Step
The most powerful and modular diffusion model GUI, api and backend
Clone a voice in 5 seconds to generate arbitrary speech in real-time
Speech recognition for your site
StreamSpeech is a seamless model for offline speech recognition
Meta-Datenbank-Anwendung für die Audio- und TV-Sendungen des CC2.TV
Batch file to install and run NAM (neural-amp-modeler) easily.
Tool that can record speech synthesis
Based on the Disco Diffusion, version of the AI art creation software
Windows-GUI
Easy-OCR solution and Tesseract trainer for GNU/Linux
Provides speech to text gui to sphinx4