Python inference and LoRA trainer package for the LTX-2 audio–video
Image generation model with single-stream diffusion transformer
High-Resolution 3D Assets Generation with Large Scale Diffusion Models
Qwen3-TTS is an open-source series of TTS models
Official inference repo for FLUX.2 models
Qwen3-Coder is the code version of Qwen3
Text and image to video generation: CogVideoX and CogVideo
Diffusion model(SD,Flux,Wan,Qwen Image,Z-Image,...) inference
A Family of Open Sourced Music Foundation Models
Open-source multi-speaker long-form text-to-speech model
A theoretical reconstruction of the Claude Mythos architecture
A multimodal model for brain response prediction
Models for object and human mesh reconstruction
Reference PyTorch implementation and models for DINOv3
Qwen3.5 is the large language model series developed by Qwen team
ChatGPT interface with better UI
Qwen3 is the large language model series developed by Qwen team
Python bindings for llama.cpp
Qwen-Image-Layered: Layered Decomposition for Inherent Editablity
LTX-Video Support for ComfyUI
C++ implementation of ChatGLM-6B & ChatGLM2-6B & ChatGLM3 & GLM4(V)
High-Resolution Image Synthesis with Latent Diffusion Models
Lets make video diffusion practical
State-of-the-art TTS model under 25MB
Foundation model for image generation