Open-source, high-performance AI model with advanced reasoning
Powerful AI language model (MoE) optimized for efficiency/performance
State-of-the-art TTS model under 25MB
A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming
Open-source multi-speaker long-form text-to-speech model
Qwen3-VL, the multimodal large language model series by Alibaba Cloud
Video understanding codebase from FAIR for reproducing video models
Locally run an Instruction-Tuned Chat-Style LLM
Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)