Gemma in PyTorch

gemma_pytorch provides the official PyTorch reference for running and fine-tuning Google’s Gemma family of open models. It includes model definitions, configuration files, and loading utilities for multiple parameter scales, enabling quick evaluation and downstream adaptation. The repository demonstrates text generation pipelines, tokenizer setup, quantization paths, and adapters for low-rank or parameter-efficient fine-tuning. Example notebooks walk through instruction tuning and evaluation so teams can benchmark and iterate rapidly. The code is organized to be legible and hackable, exposing attention blocks, positional encodings, and head configurations. With standard PyTorch abstractions, it integrates easily into existing training loops, loggers, and evaluation harnesses.

Features

PyTorch implementations and configs for Gemma model variants
Ready-to-use generation, tokenization, and checkpoint loading
Drop-in modules compatible with common PyTorch stacks
Example notebooks for tuning and evaluation
Quantization and inference optimization paths
Parameter-efficient fine-tuning adapters and examples

Project Samples

Project Activity

See All Activity >

License

Apache License V2.0

Follow Gemma in PyTorch

Gemma in PyTorch Web Site

Other Useful Business Software

Keep company data safe with Chrome Enterprise

Protect your business with AI policies and data loss prevention in the browser

Make AI work your way with Chrome Enterprise. Block unapproved sites and set custom data controls that align with your company's policies.

Download Chrome

Rate This Project

User Reviews

Be the first to post a review of Gemma in PyTorch!

Additional Project Details

Operating Systems

Windows

Programming Language

Python

Related Categories

Python AI Models

Registered

4 days ago

Similar Business Software

Gemma 2

A family of state-of-the-art, light-open models created from the same research and technology that were used to create Gemini models. These models incorporate comprehensive security measures and help ensure responsible and reliable AI solutions through selected data sets and rigorous...

See Software
EXAONE Deep

EXAONE Deep is a series of reasoning-enhanced language models developed by LG AI Research, featuring parameter sizes of 2.4 billion, 7.8 billion, and 32 billion. These models demonstrate superior capabilities in various reasoning tasks, including math and coding benchmarks. Notably, EXAONE Deep...

See Software
PaliGemma 2

PaliGemma 2, the next evolution in tunable vision-language models, builds upon the performant Gemma 2 models, adding the power of vision and making it easier than ever to fine-tune for exceptional performance. With PaliGemma 2, these models can see, understand, and interact with visual input,...

See Software
Phi-4-mini-reasoning

Phi-4-mini-reasoning is a 3.8-billion parameter transformer-based language model optimized for mathematical reasoning and step-by-step problem solving in environments with constrained computing or latency. Fine-tuned with synthetic data generated by the DeepSeek-R1 model, it balances efficiency...

See Software
NVIDIA Cosmos

NVIDIA Cosmos is a developer-first platform of state-of-the-art generative World Foundation Models (WFMs), advanced video tokenizers, guardrails, and an accelerated data processing and curation pipeline designed to supercharge physical AI development. It enables developers working on autonomous...

See Software
GPT-5 nano

GPT-5 nano is OpenAI’s fastest and most affordable version of the GPT-5 family, designed for high-speed text processing tasks like summarization and classification. It supports text and image inputs, generating high-quality text outputs with a large 400,000-token context window and up to 128,000...

See Software

Report inappropriate content

Gemma in PyTorch

The official PyTorch implementation of Google's Gemma models

Get an email when there's a new version of Gemma in PyTorch

Features

Project Samples

Project Activity

Categories

License

Follow Gemma in PyTorch

User Reviews

Additional Project Details

Operating Systems

Programming Language

Related Categories

Registered