Open-source, high-performance AI model with advanced reasoning
Powerful AI language model (MoE) optimized for efficiency/performance
State-of-the-art TTS model under 25MB
A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming
Awesome multilingual OCR toolkits based on PaddlePaddle
Open-source multi-speaker long-form text-to-speech model
Qwen3.5 is the large language model series developed by Qwen team
Visual Causal Flow
Industrial-level controllable zero-shot text-to-speech system
Qwen3-VL, the multimodal large language model series by Alibaba Cloud
From Images to High-Fidelity 3D Assets
A multimodal model for brain response prediction
RGBD video generation model conditioned on camera input
Python SDK for Claude Agent
Show usage stats for OpenAI Codex and Claude Code
Open Source Speech Language Model
State of the art LLM and coding model
Claude Code image, a one-stop open source transit service
Contexts Optical Compression
Video understanding codebase from FAIR for reproducing video models
Official code base for LeWorldModel: Stable End-to-End Joint-Embedding
Qwen3-ASR is an open-source series of ASR models
General-purpose image editing model that delivers high-fidelity
FAIR Sequence Modeling Toolkit 2
Pushing the Limits of Mathematical Reasoning in Open Language Models