GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
Industrial-level controllable zero-shot text-to-speech system
MOSS‑TTS Family open‑source speech and sound generation model
Open-source, high-performance AI model with advanced reasoning
DeepSeek Coder: Let the Code Write Itself
Wan2.2: Open and Advanced Large-Scale Video Generative Model
HY-Motion model for 3D character animation generation
State-of-the-art Image & Video CLIP, Multimodal Large Language Models
Inference code for scalable emulation of protein equilibrium ensembles
Wan2.1: Open and Advanced Large-Scale Video Generative Model
Tool for exploring and debugging transformer model behaviors
Powerful AI language model (MoE) optimized for efficiency/performance
Lets make video diffusion practical
Official implementation of DreamCraft3D
Open Source Speech Language Model
Advanced language and coding AI model
Official inference repo for FLUX.2 models
A Family of Open Sourced Music Foundation Models
Code for running inference and finetuning with SAM 3 model
Video understanding codebase from FAIR for reproducing video models
A theoretical reconstruction of the Claude Mythos architecture
General-purpose image editing model that delivers high-fidelity
A SOTA open-source image editing model
Open image model at the forefront of design
Audio foundation model excelling in audio understanding