SkyPilot: Run AI and batch jobs on any infra
Document content and metadata extraction microservice
Improve human sleep through scientifically
I Agent designed to interact with ROS1- and ROS2-based robotics system
Open-source framework for intelligent speech interaction
code for Mesh R-CNN, ICCV 2019
Deploy and share agents with open infrastructure
No-code multi-agent framework to build LLM Agents, workflows
Open platform connecting AI agents to tools via unified MCP server
Accessible large language models via k-bit quantization for PyTorch
Open-sourced unified customization model
RGBD video generation model conditioned on camera input
Benchmarking synthetic data generation methods
Spatiotemporal Signal Processing with Neural Machine Learning Models
CRAB: Cross-environment Agent Benchmark for Multimodal Language Model
Controllable & emotion-expressive zero-shot TTS
Inference Llama 2 in one file of pure C
UI-TARS-desktop version that can operate on your local personal device
Autonomous LLM agent for end-to-end data science workflows
A Next-Generation Training Engine Built for Ultra-Large MoE Models
PyTorch code and models for the DINOv2 self-supervised learning
Fault-tolerant, highly scalable GPU orchestration
All course materials for the Zero to Mastery Machine Learning
Director, Screenwriter, Producer, and Video Generator All-in-One
Block Diffusion for Ultra-Fast Speculative Decoding