Generate blog articles from video or audio
Open source AI model for generating full songs from lyrics prompts
Helps scientists define testable, modular, self-documenting dataflow
DoWhy is a Python library for causal inference
The music player of today
MARS5 speech model (TTS) from CAMB.AI
A general fine-tuning kit geared toward image/video/audio diffusion
Interface for OuteTTS models
AI tool converting video/audio into structured documents instantly
An SSH/Telnet/Serial client in your browser
Data manipulation and transformation for audio signal processing
Trying to be a robust, user-friendly and hackable music player
Label Studio is a multi-type data labeling and annotation tool
An Open Source implementation of Notebook LM with more flexibility
Use Microsoft Edge's online text-to-speech service from Python
Scalable data pre processing and curation toolkit for LLMs
An extremely simple tool for separating vocals and background music
Qwen3-ASR is an open-source series of ASR models
A TTS model capable of generating ultra-realistic dialogue
Open Source Speech Language Model
VMZ: Model Zoo for Video Modeling
Streamlines and simplifies prompt design for both developers
Unified web UI for training and running open models locally
Towards Human-Sounding Speech
Multi-user UI for managing and running Stable Diffusion workflows tool