CodeGeeX: An Open Multilingual Code Generation Model (KDD 2023)
RGBD video generation model conditioned on camera input
GLM-4 series: Open Multilingual Multimodal Chat LMs
Python SDK for Claude Agent
Bidirectional token-classification model for identifiable info
General-purpose image editing model that delivers high-fidelity
Easy Docker setup for Stable Diffusion with user-friendly UI
A Production-ready Reinforcement Learning AI Agent Library
PyTorch code and models for the DINOv2 self-supervised learning
Open-source large language model family from Tencent Hunyuan
Code for running inference with the SAM 3D Body Model 3DB
Sharp Monocular Metric Depth in Less Than a Second
Towards self-verifiable mathematical reasoning
An Efficient Agentic Model for Computer Use
Robust Speech Recognition Across Languages, Dialects
Repo for SeedVR2 & SeedVR
A 0.1B Omni model trained from scratch
Qwen3-ASR is an open-source series of ASR models
Block Diffusion for Ultra-Fast Speculative Decoding
Official repository for LTX-Video
Official implementation of Watermark Anything with Localized Messages
Multimodal Diffusion with Representation Alignment
A Customizable Image-to-Video Model based on HunyuanVideo
Global weather forecasting model using graph neural networks and JAX
Uncommon Objects in 3D dataset