Large Audio Language Model built for natural interactions
Comprehensive Gradio WebUI for audio processing
Streaming Real-time Audio-Driven Avatar Generation
A Systematic Framework for Interactive World Modeling
State-of-the-art diffusion models for image and audio generation
Software that uses AI to perform real-time voice conversion
Desktop piano playable with a PC keyboard, mouse, or MIDI device.
LLM Large Model of Selling Anchor
GLM-4-Voice | End-to-End Chinese-English Conversational Model
Foundational model for human-like, expressive TTS
An Ubuntu Linux-based OS that aims to end user.
A webui for different audio related Neural Networks
A DLNA-compliant UPnP Media Server