A tool to snap pixels to a perfect grid
Train multi-step agents for real-world tasks using GRPO
Tiny vision language model
Modular Deep Reinforcement Learning framework in PyTorch
Offline inference engine for art, real-time voice conversations
New set of lightweight state-of-the-art, open foundation models
AI-assisted storyboard and video generation tool
Foundation model for image generation
MemU is an open-source memory framework for AI companions
Open-sourced unified customization model
Reference PyTorch implementation and models for DINOv3
Code for openai.fm, a demo for the OpenAI Speech API
Fast State-of-the-Art Static Embeddings
Scaling Reinforcement Learning with LLMs
AI suite powered by state-of-the-art models and providing advanced AI
Industrial-level controllable zero-shot text-to-speech system
Ultralytics YOLO
PyTorch implementation of JiT
Official inference repo for FLUX.2 models
MiniMax M2.1, a SOTA model for real-world dev & agents.
A collection of awesome-lists for AI, creativity and art. AI
Multilingual Document Layout Parsing in a Single Vision-Language Model
Shared repository for open-sourced projects from the Google AI Lang
Pluggable SOTA multi-object tracking modules for segmentation
This repository contains code released by Google Research