Open-source platform for building enterprise-grade agents
Fast3R: Towards 3D Reconstruction of 1000+ Images in One Forward Pass
MobileLLM Optimizing Sub-billion Parameter Language Models
ICLR2024 Spotlight: curation/training code, metadata, distribution
A Production-ready Reinforcement Learning AI Agent Library
PyTorch code and models for V-JEPA self-supervised learning from video
A PyTorch library for implementing flow matching algorithms
An implementation of a deep learning recommendation model (DLRM)
Hackable and optimized Transformers building blocks
[CVPR 2025 Best Paper Award] VGGT
Code to accompany "A Method for Animating Children's Drawings"
Memory-efficient and performant finetuning of Mistral's models
Official implementation of DreamCraft3D
Ring is a reasoning MoE LLM provided and open-sourced by InclusionAI
Research code artifacts for Code World Model (CWM)
Diffusion Transformer with Fine-Grained Chinese Understanding
Tencent Hunyuan Multimodal diffusion transformer (MM-DiT) model
A Customizable Image-to-Video Model based on HunyuanVideo
NVIDIA Isaac GR00T N1.5 is the world's first open foundation model
State-of-the-art TTS model under 25MB
A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming
The Memory layer for AI Agents
Structured outputs for llms
Low-latency REST API for serving text-embeddings
Python client for the Telegram's tdlib