Powerful AI language model (MoE) optimized for efficiency/performance
A Multi-Modal World Model for Reconstructing, Generating, Simulation
From Vibe Coding to Agentic Engineering
Qwen3.5 is the large language model series developed by Qwen team
Models for object and human mesh reconstruction
Official repository for LTX-Video
MapAnything: Universal Feed-Forward Metric 3D Reconstruction
Tencent Hunyuan Multimodal diffusion transformer (MM-DiT) model
Official inference repo for FLUX.2 models
FlashMLA: Efficient Multi-head Latent Attention Kernels
Open-source multi-speaker long-form text-to-speech model
Wan2.2: Open and Advanced Large-Scale Video Generative Model
FAIR Sequence Modeling Toolkit 2
MOSS‑TTS Family open‑source speech and sound generation model
MiMo-V2-Flash: Efficient Reasoning, Coding, and Agentic Foundation
CodeGeeX2: A More Powerful Multilingual Code Generation Model
Qwen2.5-VL is the multimodal large language model series
Long-form streaming TTS system for multi-speaker dialogue generation
Diffusion Transformer with Fine-Grained Chinese Understanding
MiniMax-M2, a model built for Max coding & agentic workflows
Claude Code image, a one-stop open source transit service
High-Resolution 3D Assets Generation with Large Scale Diffusion Models
Diffusion model(SD,Flux,Wan,Qwen Image,Z-Image,...) inference
New family of code large language models (LLMs)
Multimodal-Driven Architecture for Customized Video Generation