Official inference repo for FLUX.2 models
Qwen3-omni is a natively end-to-end, omni-modal LLM
gpt-oss-120b and gpt-oss-20b are two open-weight language models
A theoretical reconstruction of the Claude Mythos architecture
An easy 1-click way to create beautiful artwork on your PC using AI
Multimodal Diffusion with Representation Alignment
GLM-4-Voice | End-to-End Chinese-English Conversational Model
Renderer for the harmony response format to be used with gpt-oss
Long-form streaming TTS system for multi-speaker dialogue generation
Qwen3-TTS is an open-source series of TTS models
Qwen3-Coder is the code version of Qwen3
Controllable & emotion-expressive zero-shot TTS
Industrial-level controllable zero-shot text-to-speech system
An experimental version of DeepSeek model
Advancing Open-source World Models
Contexts Optical Compression
Easy Docker setup for Stable Diffusion with user-friendly UI
A 0.1B Omni model trained from scratch
Qwen3-ASR is an open-source series of ASR models
A Multi-Modal World Model for Reconstructing, Generating, Simulation
Provides convenient access to the Anthropic REST API from any Python 3
HY-Motion model for 3D character animation generation
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
code for Mesh R-CNN, ICCV 2019
Hunyuan Translation Model Version 1.5