Release for Improved Denoising Diffusion Probabilistic Models
Tiny vision language model
Open Source Speech Language Model
Wan2.2: Open and Advanced Large-Scale Video Generative Model
Wan2.1: Open and Advanced Large-Scale Video Generative Model
A PyTorch library for implementing flow matching algorithms
LTX-Video Support for ComfyUI
High-Fidelity and Controllable Generation of Textured 3D Assets
The most powerful local music generation model
Inference script for Oasis 500M
Lets make video diffusion practical
Official inference repo for FLUX.2 models
Open-source framework for intelligent speech interaction
Official inference repo for FLUX.1 models
Audio foundation model excelling in audio understanding
Stable Virtual Camera: Generative View Synthesis with Diffusion Models
Sharp Monocular Metric Depth in Less Than a Second
4M: Massively Multimodal Masked Modeling
Official implementation of DreamCraft3D
Controllable & emotion-expressive zero-shot TTS
Dataset of GPT-2 outputs for research in detection, biases, and more
Official repo for consistency models
Official PyTorch Implementation of "Scalable Diffusion Models"
PyTorch implementation of VALL-E (Zero-Shot Text-To-Speech)
Implementation of model parallel autoregressive transformers on GPUs