Framework that is dedicated to making neural data processing
Generate 3D objects conditioned on text or images
Unofficial Parallel WaveGAN
The first Chinese LLaMA2 model in the open source community
Clarity in the current fast-paced mess of Open Source innovation
SoftVC VITS Singing Voice Conversion
Let us control diffusion models
Implementation of MusicLM music generation model in Pytorch
Resources, corpora, and tools for Chinese natural language processing
Explore large language models in 512MB of RAM
Chinese text-to-speech engine
Python package for easily interfacing with chat apps
se GPT or other prompt based models to get structured output
A webui for different audio related Neural Networks
Repo for external large-scale work
Official PyTorch Implementation of "Scalable Diffusion Models"
An unnecessarily tiny implementation of GPT-2 in NumPy
Deep learning tool that converts portrait photos into line art
Real-time music generation using stable diffusion techniques AI
Point cloud diffusion for 3D model synthesis
A latent text-to-image diffusion model
Chinese-language edition of Dive into Deep Learning
Singing Voice Synthesis via Shallow Diffusion Mechanism
Repository of notes, code and notebooks in Python
WaveRNN Vocoder + TTS