Official inference repo for FLUX.2 models
Qwen3-omni is a natively end-to-end, omni-modal LLM
Multimodal Diffusion with Representation Alignment
gpt-oss-120b and gpt-oss-20b are two open-weight language models
A theoretical reconstruction of the Claude Mythos architecture
An easy 1-click way to create beautiful artwork on your PC using AI
Renderer for the harmony response format to be used with gpt-oss
Long-form streaming TTS system for multi-speaker dialogue generation
Qwen3-TTS is an open-source series of TTS models
Qwen3-Coder is the code version of Qwen3
GLM-4-Voice | End-to-End Chinese-English Conversational Model
Controllable & emotion-expressive zero-shot TTS
An experimental version of DeepSeek model
Industrial-level controllable zero-shot text-to-speech system
Contexts Optical Compression
Advancing Open-source World Models
Easy Docker setup for Stable Diffusion with user-friendly UI
Qwen3-ASR is an open-source series of ASR models
A Multi-Modal World Model for Reconstructing, Generating, Simulation
Provides convenient access to the Anthropic REST API from any Python 3
A 0.1B Omni model trained from scratch
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
Hunyuan Translation Model Version 1.5
Towards self-verifiable mathematical reasoning
HY-Motion model for 3D character animation generation