SOTA discrete acoustic codec models with 40/75 tokens per second
A nearly-live implementation of OpenAI's Whisper
Butterchurn is a WebGL implementation of the Milkdrop Visualizer
Streaming Real-time Audio-Driven Avatar Generation
A JavaScript NES emulator
AI video generator optimized for low VRAM and older GPUs use
Stable diffusion for real-time music generation (web app)
Flash + AIR sound effects generator. Based on Sfxr.
Download videos from almost any website
Download videos from websites like YouTube and many others
Tencent Hunyuan Multimodal diffusion transformer (MM-DiT) model
Media Player .NET Library for WinUI 3/ WPF/WinForms
The Serenity Operating System
Self-hosted AI audio transcription
Fast multimodal LLM for real-time voice interaction and AI apps
Robust Speech Recognition via Large-Scale Weak Supervision
Implementation of AudioLM audio generation model in Pytorch
Cross platform GUI tool for downloading videos from Bilibili sites
OpenCore bootloader
The missing YouTube Music macOS app
Give Claude the ability to watch and understand videos
Plug-in, customized, ad-free free music player
Speakr is a personal, self-hosted web application
Interface for OuteTTS models
MARS5 speech model (TTS) from CAMB.AI