SoftVC VITS Singing Voice Conversion
Code for the paper Hybrid Spectrogram and Waveform Source Separation
A webui for different audio related Neural Networks
Implementation of MusicLM music generation model in Pytorch
Multimodal AI Story Teller, built with Stable Diffusion, GPT, etc.
User-friendly library to find similar objects
Audio generation using diffusion models, in PyTorch
PyTorch implementation of VALL-E (Zero-Shot Text-To-Speech)
No-code tool for creating a neural search solution in minutes
A walk along memory lane
Implementation of NÜWA, attention network for text to video synthesis
Audio generation using diffusion models
Real-time music generation using stable diffusion techniques AI
WaveRNN Vocoder + TTS
Data augmentation for NLP
Based on the Disco Diffusion, version of the AI art creation software
Implementation of NWT, audio-to-video generation, in Pytorch
Task of transcribing piano recordings into MIDI files
Separate audio recordings into individual sources
General Speech Restoration
Clone a voice in 5 seconds to generate arbitrary speech in real-time
Open source embedded speech-to-text engine
Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)
Generative Adversarial Networks for Efficient and High Fidelity Speech
Easy-OCR solution and Tesseract trainer for GNU/Linux