Renderer for the harmony response format to be used with gpt-oss
Official implementation of DreamCraft3D
ICLR2024 Spotlight: curation/training code, metadata, distribution
Language modeling in a sentence representation space
A Conversational Speech Generation Model
High-Resolution Image Synthesis with Latent Diffusion Models
Let us control diffusion models
Repo for external large-scale work
Official PyTorch Implementation of "Scalable Diffusion Models"
Code for "Image Generation from Scene Graphs", Johnson et al, CVPR 201
Dia-1.6B generates lifelike English dialogue and vocal expressions
CTC-based forced aligner for audio-text in 158 languages
Vision-language-action model for robot control via images and text