LLM-based Reinforcement Learning audio edit model
INT4/INT5/INT8 and FP16 inference on CPU for RWKV language model
Tiny vision language model
The official PyTorch implementation of Google's Gemma models
Inference code for scalable emulation of protein equilibrium ensembles
Programmatic access to the AlphaGenome model
Safety reasoning models built-upon gpt-oss
A state-of-the-art open visual language model
Chinese and English multimodal conversational language model
Hunyuan Translation Model Version 1.5
Multimodal embedding and reranking models built on Qwen3-VL
High-resolution models for human tasks
Video understanding codebase from FAIR for reproducing video models
Instructions on how to use the Realtime API on Microcontrollers
Tool for exploring and debugging transformer model behaviors
Genome modeling and design across all domains of life
Achieving 3+ generation speedup on reasoning tasks
Ultra-Efficient LLMs on End Device
Pretrained time-series foundation model developed by Google Research
Continuous Autonomy for the AI SDK
Fast and Universal 3D reconstruction model for versatile tasks
4M: Massively Multimodal Masked Modeling
This repository contains the official implementation of FastVLM
New set of lightweight state-of-the-art, open foundation models
Foundation Models for Time Series