Visual Causal Flow
Contexts Optical Compression
Qwen3-ASR is an open-source series of ASR models
General-purpose image editing model that delivers high-fidelity
Open Source Speech Language Model
Audio foundation model excelling in audio understanding
OCR expert VLM powered by Hunyuan's native multimodal architecture
Official implementation of DreamCraft3D
A Conversational Speech Generation Model
Code for the paper Hybrid Spectrogram and Waveform Source Separation