The official PyTorch implementation of Google's Gemma models
Pokee Deep Research Model Open Source Repo
Long-form streaming TTS system for multi-speaker dialogue generation
OpenTinker is an RL-as-a-Service infrastructure for foundation models
Multimodal embedding and reranking models built on Qwen3-VL
Inference script for Oasis 500M
A trainable PyTorch reproduction of AlphaFold 3
High-Fidelity and Controllable Generation of Textured 3D Assets
Multi-modal large language model designed for audio understanding
This repository contains the official implementation of FastVLM
Memory-efficient and performant finetuning of Mistral's models
Ring is a reasoning MoE LLM provided and open-sourced by InclusionAI
Research code artifacts for Code World Model (CWM)
An Efficient Agentic Model for Computer Use
New family of code large language models (LLMs)
Unified Multimodal Understanding and Generation Models
A series of math-specific large language models of our Qwen2 series
Tiny vision language model
Repo of Qwen2-Audio chat & pretrained large audio language model
Official implementation of Watermark Anything with Localized Messages
High-resolution models for human tasks
Video understanding codebase from FAIR for reproducing video models
CLIP, Predict the most relevant text snippet given an image
Pretrained time-series foundation model developed by Google Research
4M: Massively Multimodal Masked Modeling