Build cross-modal and multimodal applications on the cloud
Official MiniMax Model Context Protocol (MCP) server
Meta-Datenbank-Anwendung fรผr die Audio- und TV-Sendungen des CC2.TV
Audiocraft is a library for audio processing and generation
An AI for Music Generation
An opinionated CLI to transcribe Audio files w/ Whisper on-device
Private chat with local GPT with document, images, video, etc.
A Conversational Speech Generation Model
SPPAS - the automatic annotation and analyses of speech
Two Integrated Text To Speech Engines uses MMS & Silero
โจ:AI-Powered Piano Audio to MIDI Converter ๐ถ
Free AI Audio Enhancer & Noise Reduction tool for Windows.
A subtitle generator for Japanese Adult Videos.
DiโชโชRhythm: Blazingly Fast & Simple End-to-End Song Generation
SoundTranscriber can be used to generate automatic transcription / aut
Software that uses AI to perform real-time voice conversion
Unlimited, private and free Speech-To-Text program
Eva is an A.I. assistant that helps users multi-task.
Toolkit for audio, music, and speech generation
Ainee - AI Notetaking and Learning Companion
Towards Human-Level Text-to-Speech through Style Diffusion
High-quality multi-lingual text-to-speech library by MyShell.ai
An extremely simple tool for separating vocals and background music
AI powered speech denoising and enhancement
A deep learning toolkit for Text-to-Speech, battle-tested in research