Lets make video diffusion practical
Official inference repo for FLUX.1 models
LTX-Video Support for ComfyUI
A Systematic Framework for Interactive World Modeling
Open-source multi-speaker long-form text-to-speech model
Contexts Optical Compression
CodeGeeX: An Open Multilingual Code Generation Model (KDD 2023)
DeepSeek Coder: Let the Code Write Itself
Renderer for the harmony response format to be used with gpt-oss
Python bindings for llama.cpp
Industrial-level controllable zero-shot text-to-speech system
Generating Immersive, Explorable, and Interactive 3D Worlds
Python SDK for Claude Agent
Diffusion Transformer with Fine-Grained Chinese Understanding
Tongyi Deep Research, the Leading Open-source Deep Research Agent
Official repository for LTX-Video
HY-Motion model for 3D character animation generation
Code for running inference with the SAM 3D Body Model 3DB
CodeGeeX2: A More Powerful Multilingual Code Generation Model
State-of-the-art (SoTA) text-to-video pre-trained model
gpt-oss-120b and gpt-oss-20b are two open-weight language models
Programmatic access to the AlphaGenome model
OCR expert VLM powered by Hunyuan's native multimodal architecture
VMZ: Model Zoo for Video Modeling
Video understanding codebase from FAIR for reproducing video models