A Family of Open Sourced Music Foundation Models
Code for running inference and finetuning with SAM 3 model
Lets make video diffusion practical
Official inference repo for FLUX.2 models
The official repo of Qwen chat & pretrained large language model
Industrial-level controllable zero-shot text-to-speech system
Repo for SeedVR2 & SeedVR
A Powerful Native Multimodal Model for Image Generation
HY-Motion model for 3D character animation generation
DeepSeek Coder: Let the Code Write Itself
An AI-powered security review GitHub Action using Claude
OpenTinker is an RL-as-a-Service infrastructure for foundation models
Block Diffusion for Ultra-Fast Speculative Decoding
Easy Docker setup for Stable Diffusion with user-friendly UI
Hunyuan Translation Model Version 1.5
CLIP, Predict the most relevant text snippet given an image
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
Generate Any 3D Scene in Seconds
Inference code for scalable emulation of protein equilibrium ensembles
Chat & pretrained large audio language model proposed by Alibaba Cloud
A Pragmatic VLA Foundation Model
Personalize Any Characters with a Scalable Diffusion Transformer
FAIR Sequence Modeling Toolkit 2
Official DeiT repository
Memory-efficient and performant finetuning of Mistral's models