GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
Programmatic access to the AlphaGenome model
Block Diffusion for Ultra-Fast Speculative Decoding
Python SDK for Claude Agent
Tool for exploring and debugging transformer model behaviors
A Unified Framework for Text-to-3D and Image-to-3D Generation
Tencent Hunyuan Multimodal diffusion transformer (MM-DiT) model
Sharp Monocular Metric Depth in Less Than a Second
GPT4V-level open-source multi-modal model based on Llama3-8B
Chat & pretrained large vision language model
Release for Improved Denoising Diffusion Probabilistic Models
Hunyuan Translation Model Version 1.5
Multimodal embedding and reranking models built on Qwen3-VL
Z80-μLM is a 2-bit quantized language model
LTX-Video Support for ComfyUI
ChatGLM-6B: An Open Bilingual Dialogue Language Model
Official inference repo for FLUX.1 models
Pushing the Limits of Mathematical Reasoning in Open Language Models
Open-source large language model family from Tencent Hunyuan
Open-source multi-speaker long-form text-to-speech model
MapAnything: Universal Feed-Forward Metric 3D Reconstruction
CodeGeeX2: A More Powerful Multilingual Code Generation Model
Chinese and English multimodal conversational language model
OpenTinker is an RL-as-a-Service infrastructure for foundation models
HY-Motion model for 3D character animation generation