Lets make video diffusion practical
Qwen3 is the large language model series developed by Qwen team
OpenTinker is an RL-as-a-Service infrastructure for foundation models
Generate Any 3D Scene in Seconds
Visual Causal Flow
Models for object and human mesh reconstruction
High-resolution models for human tasks
A Unified Framework for Text-to-3D and Image-to-3D Generation
Multimodal-Driven Architecture for Customized Video Generation
High-Resolution 3D Assets Generation with Large Scale Diffusion Models
Code for running inference with the SAM 3D Body Model 3DB
Uncommon Objects in 3D dataset
Reference PyTorch implementation and models for DINOv3
Open-Source Financial Large Language Models
Repo for SeedVR2 & SeedVR
Large Multimodal Models for Video Understanding and Editing
An AI-powered security review GitHub Action using Claude
Research code artifacts for Code World Model (CWM)
AlphaFold 3 inference pipeline
Programmatic access to the AlphaGenome model
gpt-oss-120b and gpt-oss-20b are two open-weight language models
Inference code for scalable emulation of protein equilibrium ensembles
Open-source multi-speaker long-form text-to-speech model
An experimental version of DeepSeek model
Hackable and optimized Transformers building blocks