Release for Improved Denoising Diffusion Probabilistic Models
A Powerful Native Multimodal Model for Image Generation
The Clay Foundation Model - An open source AI model and interface
MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training
Code for running inference and finetuning with SAM 3 model
Hackable and optimized Transformers building blocks
Qwen3-Coder is the code version of Qwen3
Ling is a MoE LLM provided and open-sourced by InclusionAI
A Unified Framework for Text-to-3D and Image-to-3D Generation
Multimodal-Driven Architecture for Customized Video Generation
Multimodal Diffusion with Representation Alignment
GLM-4.5: Open-source LLM for intelligent agents by Z.ai
Lets make video diffusion practical
CogView4, CogView3-Plus and CogView3(ECCV 2024)
A Customizable Image-to-Video Model based on HunyuanVideo
gpt-oss-120b and gpt-oss-20b are two open-weight language models
Reference PyTorch implementation and models for DINOv3
An experimental version of DeepSeek model
Real-time behaviour synthesis with MuJoCo, using Predictive Control
GPT4V-level open-source multi-modal model based on Llama3-8B
Tool for exploring and debugging transformer model behaviors
Example Discord bot written in Python that uses the completions API
AlphaFold 3 inference pipeline
Programmatic access to the AlphaGenome model
ICLR2024 Spotlight: curation/training code, metadata, distribution