Model Variants
Four purpose-built variants from edge devices to workstation-grade performance, all under the Apache 2.0 license.
Gemma 4 E2B
Ultra-lightweight model optimized for on-device and edge deployments. Delivers strong performance in a compact footprint suitable for mobile and IoT applications.
Gemma 4 E4B
Balanced model offering excellent quality-to-size ratio. Ideal for laptop and desktop deployments where resources are limited but high-quality output is required.
Gemma 4 26B A4B
Sparse Mixture-of-Experts architecture with 128 experts, activating only 4B parameters per inference. Achieves large-model quality with small-model compute cost.
Gemma 4 31B
Flagship dense model delivering state-of-the-art performance across all benchmarks. Best choice when maximum quality and reasoning depth are the priority.
Model Comparison
| E2B | E4B | 26B MoE | 31B Dense | |
|---|---|---|---|---|
| Parameters | 2B | 4B | 26B (A4B) | 31B |
| Architecture | Dense | Dense | MoE (128 experts) | Dense |
| Context Length | 128K | 128K | 256K | 256K |
| Modalities | Text, Image, Audio | Text, Image, Audio | Text, Image, Video | Text, Image, Video, Audio |
Hardware Recommendations
Find the right hardware configuration for your Gemma 4 deployment based on model variant and use case.
Smartphone / Edge Device
Gemma 4 E2B
Laptop / Desktop
Gemma 4 E4B
Desktop GPU
Gemma 4 26B MoE
Workstation / Server
Gemma 4 31B Dense
VRAM Requirements
| Model | BF16 | INT8 | INT4 |
|---|---|---|---|
| Gemma 4 E2B | 4 GB | 2.5 GB | 1.5 GB |
| Gemma 4 E4B | 8 GB | 5 GB | 3 GB |
| Gemma 4 26B (MoE) | 52 GB | 28 GB | 16 GB |
| Gemma 4 31B (Dense) | 62 GB | 33 GB | 18 GB |