Google launches Gemma 4 for servers and Gemini Nano 4 for mobile devices
06.04.26
Google continues to expand the capabilities of its large language models. Following the success of Gemini 3 Pro, the company has introduced the new Gemma 4 series, bringing high-performance LLMs to both server systems and mobile devices.
Different versions for different needs
Gemma 4 is available in four versions:
- For lightweight devices — 2B and 4B “Effective”;
- For powerful servers — 26B “Mixture of Experts” and 31B “Dense”.
The number of parameters determines a model’s ability to handle complex tasks: more parameters usually mean higher intelligence, but also higher hardware requirements.
Arena AI tests showed that the 31B Dense and 26B MoE models rank highly, outperforming models that are 20 times larger. All versions support video and image processing, while the smaller models can also handle audio and speech recognition.
Offline coding and Apache 2.0 license

Gemma 4 models can generate code offline and have been trained in more than 140 languages. Released under the Apache 2.0 license, they provide developers with freedom to adapt and deploy safely, without the restrictions imposed by previous Gemma versions.
The Effective models are optimized for smartphones, Raspberry Pi, and Jetson Nano, consuming less power and demonstrating minimal latency. Server versions are designed for NVIDIA H100 GPUs with 80 GB of VRAM.
Performance and scalability
The 26B MoE model uses only a portion of its parameters during inference, speeding up token generation, while the 31B Dense model focuses on output quality and can be fine-tuned for specific tasks.
Gemma 4 supports agent-based workflows, function calling, structured JSON output, and context windows up to 256k tokens for large models, with cloud Gemini versions extending up to 1 million tokens.
Gemini Nano 4 for mobile devices

Google also unveiled Gemini Nano 4 for mobile devices. The Nano 4 Fast version, based on Gemma 4 E2B, is optimized for speed and low latency, while Nano 4 Full, based on Gemma 4 E4B, prioritizes quality over speed.
Nano 4 operates up to four times faster than previous models and reduces battery consumption by up to 60%. Code written for Gemma 4 is fully compatible with Nano 4.
In the near future, Google will add support for tool invocation, structured output, system prompts, and a “thinking” mode in the Prompt API for developers.
Don't miss interesting news
Subscribe to our channels and read announcements of high-tech news, tes
Oppo A6 Pro smartphone review: ambitious
Creating new mid-range smartphones is no easy task. Manufacturers have to balance performance, camera capabilities, displays, and the overall cost impact of each component. How the new Oppo A6 Pro balances these factors is discussed in our review.
One UI 8.5 Gives Older Samsung Phones a New Lease on Life — Here’s What the Update Brings
One UI 8.5 brings features once exclusive to Samsung’s newest flagships to older Galaxy devices. But can the update really make the Galaxy S22, S23 and S24 feel closer to the Galaxy S26 experience? Here’s what actually changes after installing the new firmware.
Redmi will release full-sized headphones for the first time: bet on sound and comfort
The Redmi brand is preparing for the release of its first full-size wireless headphones with active noise cancellation
The Lies of P co-op mod is now available on Nexus Mods
The popular soulslike action game Lies of P can now be played together thanks to a modification from enthusiasts


