MII makes low-latency and high-throughput inference possible
AI agents running research on single-GPU nanochat training
TensorRT LLM provides users with an easy-to-use Python API
NeurIPS2025 Spotlight] Quantized Attention
Official inference framework for 1-bit LLMs
An open sourced end-to-end VLM-based GUI Agent
Numerical differential equation solvers in JAX
Core ML tools contain supporting tools for Core ML model conversion
Unified Model Serving Framework
Chat & pretrained large vision language model
Code for Cicero, an AI agent that plays the game of Diplomacy
Pytorch domain library for recommendation systems
A PyTorch-based Speech Toolkit
An MCP server for interacting with Google Colab
Traditional Mandarin LLMs for Taiwan
A library for accelerating Transformer models on NVIDIA GPUs
Models and examples built with TensorFlow
Open deep learning compiler stack for cpu, gpu, etc.
DeepMind model for tracking arbitrary points across videos & robotics
Deep learning library
Software that uses AI to perform real-time voice conversion
Deep learning optimization library: makes distributed training easy
A Customizable Image-to-Video Model based on HunyuanVideo
AI Suite for upscaling, interpolating & restoring images/videos
Open-source, high-performance Mixture-of-Experts large language model