Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
Guiding Instruction-based Image Editing via Multimodal Large Language
A Production-ready Reinforcement Learning AI Agent Library
Official DeiT repository
[CVPR 2025 Best Paper Award] VGGT
A collection of reference Jupyter notebooks and demo AI/ML application
Making large AI models cheaper, faster and more accessible
PyTorch code and models for V-JEPA self-supervised learning from video
Democratizing Reinforcement Learning for LLMs
SOTA discrete acoustic codec models with 40/75 tokens per second
Sample code and notebooks for Generative AI on Google Cloud
Volcano Engine Reinforcement Learning for LLMs
Language modeling in a sentence representation space
airda(Air Data Agent
A Python library for audio data augmentation
Train machine learning models within Docker containers
Data Lake for Deep Learning. Build, manage, and query datasets
Bailing is a voice dialogue robot similar to GPT-4o
Interface for OuteTTS models
Plug-and-play library to enable agents to call MCP and UTCP tools
MII makes low-latency and high-throughput inference possible
A fast, powerful, and simple hierarchical vision transformer
Self-Modifying Framework from the Future
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
UI-TARS-desktop version that can operate on your local personal device