ComfyUI integration for Microsoft's VibeVoice text-to-speech model
Open source AI model for generating full songs from lyrics prompts
The official Allegro 5 git repository. Pull requests welcome
Robust Speech Recognition via Large-Scale Weak Supervision
Mixxx is Free DJ software that gives you everything you need
Workflow and speech recognition app
Offline Text To Speech synthesis for python
Interface for OuteTTS models
AI tool that turns Hacker News posts into daily podcast updates
Framework for building real-time voice and multimodal AI agents
Looks and smells like Sonarr but made for music
Lightweight, efficient Tags input component in Vanilla JS
Go implementation of the MediaDevices API
Converts text to speech in realtime
Cutting Edge WebRTC Video Conferencing
Automatic Speech Recognition with Word-level Timestamps
WhatsApp library for NodeJS that connects through the browser app
A general fine-tuning kit geared toward image/video/audio diffusion
A single Gradio + React WebUI with extensions for ACE-Step
JavaScript player library / DASH & HLS client / MSE-EME player
Unofficial Python API and agentic skill for Google NotebookLM
Local-first AI Notepad for Private Meetings
Video player for improving quality of hand-drawn images
The official Node.js / Typescript library for the Groq API
Make videos programmatically with React