AirPlay audio player
A text-to-speech, speech-to-text and speech-to-speech library
Secure, open-source platform for file storage, sharing, collaboration
Chat & pretrained large audio language model proposed by Alibaba Cloud
Repo of Qwen2-Audio chat & pretrained large audio language model
Audio foundation model excelling in audio understanding
Open-source framework for intelligent speech interaction
Taming Stable Diffusion for Lip Sync
Anki is a smart spaced repetition flashcard program
LLM-based Reinforcement Learning audio edit model
Multi-modal large language model designed for audio understanding
Oobabooga - The definitive Web UI for local AI, with powerful features
A music software developed based on React native
Wire for iOS (iPhone and iPad)
Automatically translates the text of a video based on a subtitle file
The subtitle editor
A safe home for all your data
Synchronized Translation for Videos
Audiocraft is a library for audio processing and generation
Speech-to-text, text-to-speech, and speaker recognition
LilyPond sheet music text editor
QOwnNotes is a plain-text file notepad and todo-list manager
Tokenizer-Free TTS for Multilingual Speech Generation
Qwen3-omni is a natively end-to-end, omni-modal LLM