Python SDK for Claude Agent
Visual Causal Flow
From Images to High-Fidelity 3D Assets
Video Object and Interaction Deletion
A Multi-Modal World Model for Reconstructing, Generating, Simulation
Qwen3.5 is the large language model series developed by Qwen team
Open Source Speech Language Model
RGBD video generation model conditioned on camera input
Bidirectional token-classification model for identifiable info
Contexts Optical Compression
Qwen3-ASR is an open-source series of ASR models
Long-form streaming TTS system for multi-speaker dialogue generation
Project Lyra: Open Generative 3D World Models
Audio foundation model excelling in audio understanding
Controllable & emotion-expressive zero-shot TTS
Foundational Models for State-of-the-Art Speech and Text Translation
Analyze computation-communication overlap in V3/R1
Pushing the Limits of Mathematical Reasoning in Open Language Models
Let us control diffusion models
LL model providing reasoning and conversational capabilities
Llama 3.2–1B: Multilingual, instruction-tuned model for mobile AI
Compact 8B multimodal instruct model optimized for edge deployment
Efficient 14B multimodal instruct model with edge deployment and FP8
Compact 3B-param multimodal model for efficient on-device reasoning
Powerful 14B LLM with strong instruction and long-text handling