GUI for a Vocal Remover that uses Deep Neural Networks
Generate audiobooks from e-books
Generate audiobooks from e-books, voice cloning & 1107+ languages
A single Gradio + React WebUI with extensions for ACE-Step
The most powerful and modular diffusion model GUI, api and backend
Clone a voice in 5 seconds to generate arbitrary speech in real-time
Speech recognition for your site
StreamSpeech is a seamless model for offline speech recognition
Unlimited, private and free Speech-To-Text program
Batch file to install and run NAM (neural-amp-modeler) easily.
Txt-2-Mp3 6.3 Mark 2 [Improved.Simplified.Alternative]
Tool that can record speech synthesis
Based on the Disco Diffusion, version of the AI art creation software
Windows-GUI
Provides GUI for Tessaract OCR
Provides speech to text gui to sphinx4