PaddlePaddle End-to-End Development Toolkit
MII makes low-latency and high-throughput inference possible
A set of Docker images for training and serving models in TensorFlow
Robust Speech Recognition via Large-Scale Weak Supervision
The most powerful local music generation model
Advanced language and coding AI model
Open-source, high-performance AI model with advanced reasoning
A high-throughput and memory-efficient inference and serving engine
Learning agent trained in a diffusion world model
Document Image Parsing via Heterogeneous Anchor Prompting”
Framework for building neural networks
Generate Any 3D Scene in Seconds
Refer and Ground Anything Anywhere at Any Granularity
PyTorch code and models for V-JEPA self-supervised learning from video
A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming
Machine Learning Pipelines for Kubeflow
Geometric deep learning extension library for PyTorch
Powerful AI language model (MoE) optimized for efficiency/performance
A Lightweight Face Recognition and Facial Attribute Analysis
OCR software, free and offline
Agentic, Reasoning, and Coding (ARC) foundation models
Qwen3 is the large language model series developed by Qwen team
1 min voice data can also be used to train a good TTS model
DeepVariant is an analysis pipeline that uses a deep neural networks
Python inference and LoRA trainer package for the LTX-2 audio–video