Zac Zuo

Gemma 3n - Run powerful multimodal AI right on your phone

Gemma 3n is Google's new open model, optimized for on-device multimodal AI. Its novel MatFormer architecture enables powerful yet efficient models (like the 2B/4B variants) that can run locally on phones and laptops. Supports image, audio & video.

Add a comment

Replies

Best
Zac Zuo
Hunter
📌

Hi everyone!

Google is back with a major release for their open model family: Gemma 3n. It’s a big step forward for powerful, on-device multimodal AI.

It has the new MatFormer architecture. It's like a Matryoshka doll, a single, larger model contains smaller, fully-functional models inside. This gives developers incredible flexibility. You can deploy a tiny 2B effective parameter model for speed, a more powerful 4B version, or even create custom sizes in between.

And it’s built to be very efficient. Techniques like Per-Layer Embeddings mean only a small part of the model needs to live in VRAM, making it truly viable for phones and laptops. It’s also fully multimodal, handling image, audio, and video inputs with strong performance.

Zaheer Khan

For those of us involved in the Google ecosystem — from Firebase to Android to Cloud — this brings some much-needed structure. Excited to see how it evolves.

Rohan Gayen

How to use them in Nextjs apps? Any APIs? Can't find info online.

Sami El Kouroka

Google has cooked this year, good job gemini team

Ilya Vorobiev

Gemma family keeps getting better 🔥 This is huge! Local LLMs are definitely the future - no more worrying about internet connection or privacy. Massive progress from the team, thank you! Can't wait to build with this 🚀