Gemma: Google’s family of compact AI models

Google Gemma is a collection of smaller models built by Google DeepMind and other Google teams, drawing on the architecture and techniques used in the broader Gemini lineup. Although Gemma models are lightweight enough to run on a developer’s laptop or desktop, they retain the core innovations found in larger Gemini offerings, enabling strong performance in many tasks while keeping resource needs low.

Built-in safety and alignment practices

To reduce risks and improve behavior consistency, Gemma’s training pipeline includes automated removal of sensitive content from datasets and alignment steps using reinforcement learning from human feedback. The development process also incorporates both human-led red-team exercises and automated adversarial evaluations to surface weaknesses before release.

Available developer-facing safety resources include:

  • A toolkit for classifying and mitigating unsafe outputs, plus utilities for troubleshooting model behavior.
  • Guidance and best-practice documentation based on Google’s internal safety work.

Support for development, deployment, and hardware

Gemma is optimized to work across common ML frameworks and runtimes, including native support for Keras 3, JAX, PyTorch, and TensorFlow, making fine-tuning and integration straightforward. The models are designed to run on a range of devices — from laptops and mobile phones to IoT devices and cloud instances — facilitating experimentation in diverse environments.

Key platform and deployment features:

  • GPU optimizations developed in partnership with NVIDIA to improve throughput on compatible hardware.
  • One-click deployment and MLOps tooling available through Vertex AI for Google Cloud customers.
  • Free community access via Kaggle, plus promotional credits for new Google Cloud users and grant opportunities for researchers seeking cloud compute.

Availability and limitations

Gemma provides broad access to powerful, smaller-scale models, but it is not an open-source release: Google has not published the source code or full training datasets required to independently retrain the models from scratch. Developers and researchers can still leverage the provided binaries, APIs, and managed services to build and evaluate applications.

Takeaway

Gemma offers a practical path to experiment with high-quality, resource-efficient models that emphasize safety and portability. Its combination of alignment measures, tooling, and multi-platform support makes it useful for rapid prototyping and research, while the absence of full open-source artifacts means complete reproducibility of training is not available.

Technical

Title
Google Gemma
Requirements
  • Windows
Language
No language has been specified.
Available languages
License
  • Free
Latest update
2024-02-22
Author
Google LLC
Other Useful Business Software
Full-stack observability with actually useful AI | Grafana Cloud Icon
Full-stack observability with actually useful AI | Grafana Cloud

Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
Create free account
Rate This App
Login To Rate This App

User Reviews

Be the first to post a review of Google Gemma!