Tag

Gemma 4

All articles tagged with #gemma 4

Gemma 4 triples on-device AI speed with speculative token drafting
technology21 days ago

Gemma 4 triples on-device AI speed with speculative token drafting

Google's Gemma 4 open models gain a 3x speed boost on local hardware through Multi-Token Prediction (MTP) drafters that guess future tokens and verify them in parallel with the main model, using shared memory and sparse decoding to accelerate generation without sacrificing quality. The approach works across devices from Pixel phones to Apple M4, with varying gains by hardware and under Apache 2.0 licensing.

Gemma 4 Goes Local: Google's Offline, Multimodal AI Hits Your Device
technology1 month ago

Gemma 4 Goes Local: Google's Offline, Multimodal AI Hits Your Device

Google’s Gemma 4 is an open-source, locally installable multimodal AI that runs on smartphones and laptops without needing cloud processing, prioritizing privacy and cost efficiency. It comes in dense (31B parameters) and sparse (26B parameters) architectures using a mixture-of-experts approach to balance performance and efficiency. Capable of text, image, and audio processing, it targets coding, creative writing, UI design, healthcare and education, with deployment support through LM Studio, Ollama, Llama CPP and Supabase. Four device-optimized versions offer offline functionality and reduced reliance on cloud services, pushing accessible, private AI for areas with limited connectivity.