Chinese LLaMA & Alpaca large language model + local CPU/GPU training
Python SDK/API for reverse engineered Google Bard
A webui for different audio related Neural Networks
A large open dataset + tools to speed up MRI scans using ML
800,000 step-level correctness labels on LLM solutions to MATH problem
High-Resolution 3D Human Digitization from A Single Image
Code for the paper Fine-Tuning Language Models from Human Preferences
Point cloud diffusion for 3D model synthesis
Discord bot and Interface for Stable Diffusion
A latent text-to-image diffusion model
A Strong and Easy-to-use Single View 3D Hand+Body Pose Estimator
Human Activity Recognition example using TensorFlow on smartphone
Large-scale pretraining for dialogue
A minimal implementation of diffusion models for text generation
Codebase for Diffusion Models Beat GANS on Image Synthesis
Learning to Act by Watching Unlabeled Online Videos
WaveRNN Vocoder + TTS
Code release for "Masked-attention Mask Transformer
PyTorch implementation of MAE
GLIDE: a diffusion-based text-conditional image synthesis model
RWKV for Chinese novel generation
Separate audio recordings into individual sources
PyTorch implementation of MoCo v3
PyTorch implementation of YOLOv4
Clone a voice in 5 seconds to generate arbitrary speech in real-time