Open-source abilities for OpenHome agents
"VideoRAG: Chat with Your Videos
An AI for Music Generation
Build cross-modal and multimodal applications on the cloud
✨:AI-Powered Piano Audio to MIDI Converter 🎶
Software that uses AI to perform real-time voice conversion
Di♪♪Rhythm: Blazingly Fast & Simple End-to-End Song Generation
A Conversational Speech Generation Model
Ainee - AI Notetaking and Learning Companion
A deep learning toolkit for Text-to-Speech, battle-tested in research
Implementation of MusicLM music generation model in Pytorch
Code for the paper Hybrid Spectrogram and Waveform Source Separation
Multimodal AI Story Teller, built with Stable Diffusion, GPT, etc.
Audio generation using diffusion models, in PyTorch
PyTorch implementation of VALL-E (Zero-Shot Text-To-Speech)
Audio generation using diffusion models
A walk along memory lane
Implementation of NÜWA, attention network for text to video synthesis
Real-time music generation using stable diffusion techniques AI
Based on the Disco Diffusion, version of the AI art creation software
Implementation of NWT, audio-to-video generation, in Pytorch
Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)
Dia-1.6B generates lifelike English dialogue and vocal expressions