Gemma 4 is Google DeepMind’s flagship dense open-weight multimodal model, designed for high-end reasoning, coding, agentic workflows, and multimodal understanding. The model contains approximately 30.7B parameters and supports text and image inputs with text generation output, while also processing video as image-frame sequences. Built as the most capable model in the Gemma 4 family, it combines strong reasoning performance with a large 256K-token context window and configurable thinking modes. Gemma 4 31B supports native function calling, structured outputs, and more than 140 languages, making it suitable for enterprise assistants, coding agents, document analysis, and multilingual applications. Google positions it as a frontier-level model that can run on consumer GPUs and workstations while achieving leading results across reasoning, mathematics, coding, and multimodal benchmarks.
Features
- 30.7B-parameter dense transformer architecture
- Multimodal support for text, images, and video frames
- 256K-token context window for long-document reasoning
- Configurable thinking modes for deeper reasoning workflows
- Native function calling and structured output support
- Supports over 140 languages for multilingual applications
- Strong performance in coding, mathematics, and reasoning tasks
- Optimized for consumer GPUs, workstations, and local deployment