Library for OCR-related tasks powered by Deep Learning
A Modular Simulation Framework and Benchmark for Robot Learning
MemU is an open-source memory framework for AI companions
Automate browser-based workflows with LLMs and Computer Vision
Operating LLMs in production
Agent S: an open agentic framework that uses computers like a human
OCR expert VLM powered by Hunyuan's native multimodal architecture
Efficient Retrieval Augmentation and Generation Framework
Industrial-level controllable zero-shot text-to-speech system
CodeGeeX: An Open Multilingual Code Generation Model (KDD 2023)
Qwen3-Coder is the code version of Qwen3
A Model Context Protocol (MCP) server
TFDS is a collection of datasets ready to use with TensorFlow,
Automate native Android apps with AI using accessibility APIs
Multilingual Automatic Speech Recognition with word-level timestamps
Deep learning optimization library making distributed training easy
A Python library for audio
Advancing Open-source World Models
⚡ Building applications with LLMs through composability ⚡
This repository contains code released by Google Research
Flower: A Friendly Federated Learning Framework
Inference script for Oasis 500M
Text-space optimizer that trains reusable natural-language skills
Gemma open-weight LLM library, from Google DeepMind
Portia Labs Python SDK for building agentic workflows