Open-source, high-performance AI model with advanced reasoning
Powerful AI language model (MoE) optimized for efficiency/performance
State-of-the-art TTS model under 25MB
A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming
Awesome multilingual OCR toolkits based on PaddlePaddle
Open-source multi-speaker long-form text-to-speech model
Qwen3.5 is the large language model series developed by Qwen team
Qwen3-VL, the multimodal large language model series by Alibaba Cloud
Visual Causal Flow
AlphaFold 3 inference pipeline
From Images to High-Fidelity 3D Assets
Industrial-level controllable zero-shot text-to-speech system
A multimodal model for brain response prediction
RGBD video generation model conditioned on camera input
Show usage stats for OpenAI Codex and Claude Code
Open Source Speech Language Model
Contexts Optical Compression
Python SDK for Claude Agent
Official code base for LeWorldModel: Stable End-to-End Joint-Embedding
Video understanding codebase from FAIR for reproducing video models
State of the art LLM and coding model
FAIR Sequence Modeling Toolkit 2
Pushing the Limits of Mathematical Reasoning in Open Language Models
Long-form streaming TTS system for multi-speaker dialogue generation
Audio foundation model excelling in audio understanding