Qwen3-TTS is an open-source series of TTS models
Official inference repo for FLUX.2 models
A Production-ready Reinforcement Learning AI Agent Library
Text and image to video generation: CogVideoX and CogVideo
Official repository for LTX-Video
code for Mesh R-CNN, ICCV 2019
Multimodal embedding and reranking models built on Qwen3-VL
Accurate × Fast × Comprehensive
Robust Speech Recognition Across Languages, Dialects
Qwen-Image is a powerful image generation foundation model
gpt-oss-120b and gpt-oss-20b are two open-weight language models
RGBD video generation model conditioned on camera input
Advancing Open-source World Models
Collection of Gemma 3 variants that are trained for performance
CogView4, CogView3-Plus and CogView3(ECCV 2024)
VMZ: Model Zoo for Video Modeling
Open-weight, large-scale hybrid-attention reasoning model
Tongyi Deep Research, the Leading Open-source Deep Research Agent
Controllable & emotion-expressive zero-shot TTS
Contexts Optical Compression
Provides convenient access to the Anthropic REST API from any Python 3
Pretrained time-series foundation model developed by Google Research
GLM-4-Voice | End-to-End Chinese-English Conversational Model
Video Object and Interaction Deletion
HY-Motion model for 3D character animation generation