Text and image to video generation: CogVideoX and CogVideo
CodeGeeX: An Open Multilingual Code Generation Model (KDD 2023)
FAIR Sequence Modeling Toolkit 2
CodeGeeX2: A More Powerful Multilingual Code Generation Model
A state-of-the-art open visual language model
Chinese and English multimodal conversational language model
Renderer for the harmony response format to be used with gpt-oss
DeepMind model for tracking arbitrary points across videos & robotics
VMZ: Model Zoo for Video Modeling
VGGSfM: Visual Geometry Grounded Deep Structure From Motion
code for Mesh R-CNN, ICCV 2019
Wan2.2: Open and Advanced Large-Scale Video Generative Model
Powerful AI language model (MoE) optimized for efficiency/performance
GLM-4.5: Open-source LLM for intelligent agents by Z.ai
Open-source, high-performance AI model with advanced reasoning
The most powerful local music generation model
Official Python inference and LoRA trainer package
Wan2.1: Open and Advanced Large-Scale Video Generative Model
Agentic, Reasoning, and Coding (ARC) foundation models
Code for running inference and finetuning with SAM 3 model
Advanced language and coding AI model
Official inference repo for FLUX.2 models
Awesome multilingual OCR toolkits based on PaddlePaddle
Python inference and LoRA trainer package for the LTX-2 audio–video
Qwen3-TTS is an open-source series of TTS models