Synthetic data curation for post-training and data extraction
Pre & Post-training & Dataset & Evaluation & Depoly & RAG
Easy token price estimates for 400+ LLMs. TokenOps
From nobody to big model (LLM) hero
Deploy your agentic worfklows to production
MoBA: Mixture of Block Attention for Long-Context LLMs
Mastering Applied AI, One Concept at a Time
Modular AI runtime for robots
How to optimize some algorithm in cuda
NeurIPS2025 Spotlight] Quantized Attention
An open-source, modern-design AI training tracking and visualization
Open-source evaluation toolkit of large multi-modality models (LMMs)
General technology for enabling AI capabilities w/ LLMs and MLLMs
The first AI agent that builds permissionless integrations
A python module to repair invalid JSON from LLMs
Open-source model for program synthesis
Cybersecurity AI (CAI), the framework for AI Security
Using AI models to automatically provide commentary and edit videos
Llama Chinese community, real-time aggregation
One-stop solution for creating your digital avatar from chat history
Memory Management Kit for Agents
RAG Search API
Large-Scale Agentic RL for High-Performance CUDA Kernel Generation
Ready-to-run cloud templates for RAG
This repository contains code released by Google Research