Reference PyTorch implementation and models for DINOv3
Visual Causal Flow
GLM-4.5: Open-source LLM for intelligent agents by Z.ai
Qwen3 is the large language model series developed by Qwen team
A Systematic Framework for Interactive World Modeling
Official repository for LTX-Video
Lets make video diffusion practical
Text and image to video generation: CogVideoX and CogVideo
Stable Diffusion with Core ML on Apple Silicon
A Family of Open Sourced Music Foundation Models
Qwen3-TTS is an open-source series of TTS models
Models for object and human mesh reconstruction
Open-source multi-speaker long-form text-to-speech model
An experimental version of DeepSeek model
Z80-μLM is a 2-bit quantized language model
HY-Motion model for 3D character animation generation
Generating Immersive, Explorable, and Interactive 3D Worlds
RGBD video generation model conditioned on camera input
Accurate × Fast × Comprehensive
Foundation Models for Time Series
tiktoken is a fast BPE tokeniser for use with OpenAI's models
AlphaFold 3 inference pipeline
Recovering the Visual Space from Any Views
ChatGLM-6B: An Open Bilingual Dialogue Language Model
Uncommon Objects in 3D dataset