Generate Any 3D Scene in Seconds
Memory-efficient and performant finetuning of Mistral's models
Block Diffusion for Ultra-Fast Speculative Decoding
Multi-modal large language model designed for audio understanding
Open-source industrial-grade ASR models
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training
Towards Real-World Vision-Language Understanding
High-Resolution Image Synthesis with Latent Diffusion Models
AI-powered tool to quickly remove watermarks from images flawlessly
Chinese LLaMA & Alpaca large language model + local CPU/GPU training
Official PyTorch Implementation of "Scalable Diffusion Models"
PyTorch implementation of VALL-E (Zero-Shot Text-To-Speech)
GLIDE: a diffusion-based text-conditional image synthesis model
Dual LSTM Encoder for Dialog Response Generation