SuperGemma4

SuperGemma4 is a locally deployable large language model built on the Gemma 4 26B A4B instruction base, optimized for speed, flexibility, and less restricted conversational behavior. It is designed to provide a more open and natural chat experience compared to standard censored models, while still maintaining practical usability across general text, coding, and multilingual tasks, especially Korean. Unlike raw base models, it inherits improvements from the SuperGemma Fast line, resulting in better performance in logic, coding, and real-world text workflows. The model is packaged in GGUF format for efficient use with llama.cpp and has been specifically tested on Apple Silicon hardware, delivering high token speeds and smooth local inference. A neutral chat template is embedded to prevent prompt misrouting issues, ensuring consistent responses without unintended shifts into coding or tool-use modes.

Features

Uncensored conversational behavior for more flexible outputs
Optimized GGUF format for fast local inference
High token generation speed on Apple Silicon devices
Neutral chat template to prevent prompt misrouting
Improved performance over base Gemma in coding and logic
Supports multilingual tasks including Korean
Balanced general chat and coding capabilities
Compatible with llama.cpp for local deployment

Project Samples

Project Activity

See All Activity >

Follow SuperGemma4

SuperGemma4 Web Site

Other Useful Business Software

Ship Agents Faster

Transform your applications and workflows into powerful agentic systems at global scale.

Gemini Enterprise Agent Platform lets you rapidly build, scale, govern and optimize production-ready agents grounded in your organization's data. The platform enables developers to build custom or pre-built agents for virtually any use case. New customers get $300 in free credits.

Get Started Free

Rate This Project

User Reviews

Be the first to post a review of SuperGemma4!

Additional Project Details

Registered

2026-04-14

Similar Business Software

LM-Kit.NET

LM-Kit.NET is a complete local AI runtime for .NET that lets engineering teams ship AI-powered features without cloud dependencies, per-token costs, or data leaving the network. Most .NET AI integrations stop at inference. LM-Kit.NET covers the full range of capabilities production...

See Software
Google AI Studio

Google AI Studio is a unified development platform that helps teams explore, build, and deploy applications using Google’s most advanced AI models, including Gemini 3.5. It brings text, image, audio, and video models together in one interactive playground. With vibe coding, developers can use...

See Software
Gemini Enterprise Agent Platform

Gemini Enterprise Agent Platform is a comprehensive solution from Google Cloud designed to help organizations build, scale, govern, and optimize AI agents. It represents the evolution of Vertex AI, combining advanced model development with new capabilities for agent orchestration and...

See Software
Sarvam 30B

Sarvam-30B is an open source, next-generation large language model designed as a unified system for both real-time conversational AI and deep reasoning workloads, built with a strong focus on multilingual intelligence and practical deployment. The 30B model is optimized for speed and efficiency,...

See Software
DiffusionGemma

DiffusionGemma is an experimental open model that explores text diffusion, an exceptionally fast approach to text generation. Released under an Apache 2.0 license, this 26B Mixture of Experts (MoE) model moves beyond the sequential token-by-token processing of typical autoregressive Large...

See Software
GPT-3.5

GPT-3.5 is the next evolution of GPT 3 large language model from OpenAI. GPT-3.5 models can understand and generate natural language. We offer four main models with different levels of power suitable for different tasks. The main GPT-3.5 models are meant to be used with the text completion...

See Software