Close Menu
  • Home
  • AI Models
    • DeepSeek
    • xAI
    • OpenAI
    • Meta AI Llama
    • Google DeepMind
    • Amazon AWS AI
    • Microsoft AI
    • Anthropic (Claude)
    • NVIDIA AI
    • IBM WatsonX Granite 3.1
    • Adobe Sensi
    • Hugging Face
    • Alibaba Cloud (Qwen)
    • Baidu (ERNIE)
    • C3 AI
    • DataRobot
    • Mistral AI
    • Moonshot AI (Kimi)
    • Google Gemma
    • xAI
    • Stability AI
    • H20.ai
  • AI Research
    • Allen Institue for AI
    • arXiv AI
    • Berkeley AI Research
    • CMU AI
    • Google Research
    • Microsoft Research
    • Meta AI Research
    • OpenAI Research
    • Stanford HAI
    • MIT CSAIL
    • Harvard AI
  • AI Funding & Startups
    • AI Funding Database
    • CBInsights AI
    • Crunchbase AI
    • Data Robot Blog
    • TechCrunch AI
    • VentureBeat AI
    • The Information AI
    • Sifted AI
    • WIRED AI
    • Fortune AI
    • PitchBook
    • TechRepublic
    • SiliconANGLE – Big Data
    • MIT News
    • Data Robot Blog
  • Expert Insights & Videos
    • Google DeepMind
    • Lex Fridman
    • Matt Wolfe AI
    • Yannic Kilcher
    • Two Minute Papers
    • AI Explained
    • TheAIEdge
    • Matt Wolfe AI
    • The TechLead
    • Andrew Ng
    • OpenAI
  • Expert Blogs
    • François Chollet
    • Gary Marcus
    • IBM
    • Jack Clark
    • Jeremy Howard
    • Melanie Mitchell
    • Andrew Ng
    • Andrej Karpathy
    • Sebastian Ruder
    • Rachel Thomas
    • IBM
  • AI Policy & Ethics
    • ACLU AI
    • AI Now Institute
    • Center for AI Safety
    • EFF AI
    • European Commission AI
    • Partnership on AI
    • Stanford HAI Policy
    • Mozilla Foundation AI
    • Future of Life Institute
    • Center for AI Safety
    • World Economic Forum AI
  • AI Tools & Product Releases
    • AI Assistants
    • AI for Recruitment
    • AI Search
    • Coding Assistants
    • Customer Service AI
    • Image Generation
    • Video Generation
    • Writing Tools
    • AI for Recruitment
    • Voice/Audio Generation
  • Industry Applications
    • Finance AI
    • Healthcare AI
    • Legal AI
    • Manufacturing AI
    • Media & Entertainment
    • Transportation AI
    • Education AI
    • Retail AI
    • Agriculture AI
    • Energy AI
  • AI Art & Entertainment
    • AI Art News Blog
    • Artvy Blog » AI Art Blog
    • Weird Wonderful AI Art Blog
    • The Chainsaw » AI Art
    • Artvy Blog » AI Art Blog
What's Hot

Technologist Rahul Patil Named CTO of Anthropic, Maker of Claude AI

OpenAI Doubles Down on Chip Diversity With AMD, Nvidia Deals

When Your Primary Customer Folds Overnight

Facebook X (Twitter) Instagram
Advanced AI News
  • Home
  • AI Models
    • OpenAI (GPT-4 / GPT-4o)
    • Anthropic (Claude 3)
    • Google DeepMind (Gemini)
    • Meta (LLaMA)
    • Cohere (Command R)
    • Amazon (Titan)
    • IBM (Watsonx)
    • Inflection AI (Pi)
  • AI Research
    • Allen Institue for AI
    • arXiv AI
    • Berkeley AI Research
    • CMU AI
    • Google Research
    • Meta AI Research
    • Microsoft Research
    • OpenAI Research
    • Stanford HAI
    • MIT CSAIL
    • Harvard AI
  • AI Funding
    • AI Funding Database
    • CBInsights AI
    • Crunchbase AI
    • Data Robot Blog
    • TechCrunch AI
    • VentureBeat AI
    • The Information AI
    • Sifted AI
    • WIRED AI
    • Fortune AI
    • PitchBook
    • TechRepublic
    • SiliconANGLE – Big Data
    • MIT News
    • Data Robot Blog
  • AI Experts
    • Google DeepMind
    • Lex Fridman
    • Meta AI Llama
    • Yannic Kilcher
    • Two Minute Papers
    • AI Explained
    • TheAIEdge
    • The TechLead
    • Matt Wolfe AI
    • Andrew Ng
    • OpenAI
    • Expert Blogs
      • François Chollet
      • Gary Marcus
      • IBM
      • Jack Clark
      • Jeremy Howard
      • Melanie Mitchell
      • Andrew Ng
      • Andrej Karpathy
      • Sebastian Ruder
      • Rachel Thomas
      • IBM
  • AI Tools
    • AI Assistants
    • AI for Recruitment
    • AI Search
    • Coding Assistants
    • Customer Service AI
  • AI Policy
    • ACLU AI
    • AI Now Institute
    • Center for AI Safety
  • Business AI
    • Advanced AI News Features
    • Finance AI
    • Healthcare AI
    • Education AI
    • Energy AI
    • Legal AI
LinkedIn Instagram YouTube Threads X (Twitter)
Advanced AI News
VentureBeat AI

Google’s open source AI Gemma 3 270M can run on smartphones

By Advanced AI EditorAugust 14, 2025No Comments6 Mins Read
Share Facebook Twitter Pinterest Copy Link Telegram LinkedIn Tumblr Email
Share
Facebook Twitter LinkedIn Pinterest Email


Want smarter insights in your inbox? Sign up for our weekly newsletters to get only what matters to enterprise AI, data, and security leaders. Subscribe Now

Google’s DeepMind AI research team has unveiled a new open source AI model today, Gemma 3 270M.

As its name would suggest, this is a 270-million-parameter model — far smaller than the 70 billion or more parameters of many frontier LLMs (parameters being the number of internal settings governing the model’s behavior).

While more parameters generally translates to a larger and more powerful model, Google’s focus with this is nearly the opposite: high-efficiency, giving developers a model small enough to run directly on smartphones and locally, without an internet connection, as shown in internal tests on a Pixel 9 Pro SoC.

Yet, the model is still capable of handling complex, domain-specific tasks and can be quickly fine-tuned in mere minutes to fit an enterprise or indie developer’s needs.

AI Scaling Hits Its Limits

Power caps, rising token costs, and inference delays are reshaping enterprise AI. Join our exclusive salon to discover how top teams are:

Turning energy into a strategic advantage

Architecting efficient inference for real throughput gains

Unlocking competitive ROI with sustainable AI systems

Secure your spot to stay ahead: https://bit.ly/4mwGngO

On the social network X, Google DeepMind Staff AI Developer Relations Engineer Omar Sanseviero added that it Gemma 3 270M can also run directly in a user’s web browser, on a Raspberry Pi, and “in your toaster,” underscoring its ability to operate on very lightweight hardware.

Gemma 3 270M combines 170 million embedding parameters — thanks to a large 256k vocabulary capable of handling rare and specific tokens — with 100 million transformer block parameters.

According to Google, the architecture supports strong performance on instruction-following tasks right out of the box while staying small enough for rapid fine-tuning and deployment on devices with limited resources, including mobile hardware.

Gemma 3 270M inherits the architecture and pretraining of the larger Gemma 3 models, ensuring compatibility across the Gemma ecosystem. With documentation, fine-tuning recipes, and deployment guides available for tools like Hugging Face, UnSloth, and JAX, developers can move from experimentation to deployment quickly.

High scores on benchmarks for its size, and high hefficiency

On the IFEval benchmark, which measures a model’s ability to follow instructions, the instruction-tuned Gemma 3 270M scored 51.2%.

The score places it well above similarly small models like SmolLM2 135M Instruct and Qwen 2.5 0.5B Instruct, and closer to the performance range of some billion-parameter models, according to Google’s published comparison.

However, as researchers and leaders at rival AI startup Liquid AI pointed out in replies on X, Google left off Liquid’s own LFM2-350M model released back in July of this year, which scored a whopping 65.12% with just a few more parameters (similar sized language model, however).

One of the model’s defining strengths is its energy efficiency. In internal tests using the INT4-quantized model on a Pixel 9 Pro SoC, 25 conversations consumed just 0.75% of the device’s battery.

This makes Gemma 3 270M a practical choice for on-device AI, particularly in cases where privacy and offline functionality are important.

The release includes both a pretrained and an instruction-tuned model, giving developers immediate utility for general instruction-following tasks.

Quantization-Aware Trained (QAT) checkpoints are also available, enabling INT4 precision with minimal performance loss and making the model production-ready for resource-constrained environments.

A small, fine-tuned version of Gemma 3 270M can perform many functions of larger LLMs

Google frames Gemma 3 270M as part of a broader philosophy of choosing the right tool for the job rather than relying on raw model size.

For functions like sentiment analysis, entity extraction, query routing, structured text generation, compliance checks, and creative writing, the company says a fine-tuned small model can deliver faster, more cost-effective results than a large general-purpose one.

The benefits of specialization are evident in past work, such as Adaptive ML’s collaboration with SK Telecom.

By fine-tuning a Gemma 3 4B model for multilingual content moderation, the team outperformed much larger proprietary systems.

Gemma 3 270M is designed to enable similar success at an even smaller scale, supporting fleets of specialized models tailored to individual tasks.

Demo Bedtime Story Generator app shows off the potential of Gemma 3 270M

Beyond enterprise use, the model also fits creative scenarios. In a demo video posted on YouTube, Google shows off a Bedtime Story Generator app built with Gemma 3 270M and Transformers.js that runs entirely offline in a web browser, showing the versatility of the model in lightweight, accessible applications.

The video highlights the model’s ability to synthesize multiple inputs by allowing selections for a main character (e.g., “a magical cat”), a setting (“in an enchanted forest”), a plot twist (“uncovers a secret door”), a theme (“Adventurous”), and a desired length (“Short”).

Once the parameters are set, the Gemma 3 270M model generates a coherent and imaginative story. The application proceeds to weave a short, adventurous tale based on the user’s choices, demonstrating the model’s capacity for creative, context-aware text generation.

This video serves as a powerful example of how the lightweight yet capable Gemma 3 270M can power fast, engaging, and interactive applications without relying on the cloud, opening up new possibilities for on-device AI experiences.

Open-sourced under a Gemma custom license

Gemma 3 270M is released under the Gemma Terms of Use, which allow use, reproduction, modification, and distribution of the model and derivatives, provided certain conditions are met.

These include carrying forward use restrictions outlined in Google’s Prohibited Use Policy, supplying the Terms of Use to downstream recipients, and clearly indicating any modifications made. Distribution can be direct or through hosted services such as APIs or web apps.

For enterprise teams and commercial developers, this means the model can be embedded in products, deployed as part of cloud services, or fine-tuned into specialized derivatives, so long as licensing terms are respected. Outputs generated by the model are not claimed by Google, giving businesses full rights over the content they create.

However, developers are responsible for ensuring compliance with applicable laws and for avoiding prohibited uses, such as generating harmful content or violating privacy rules.

The license is not open-source in the traditional sense, but it does enable broad commercial use without a separate paid license.

For companies building commercial AI applications, the main operational considerations are ensuring end users are bound by equivalent restrictions, documenting model modifications, and implementing safety measures aligned with the prohibited uses policy.

With the Gemmaverse surpassing 200 million downloads and the Gemma lineup spanning cloud, desktop, and mobile-optimized variants, Google AI Developers are positioning Gemma 3 270M as a foundation for building fast, cost-effective, and privacy-focused AI solutions, and already, it seems off to a great start.

Daily insights on business use cases with VB Daily

If you want to impress your boss, VB Daily has you covered. We give you the inside scoop on what companies are doing with generative AI, from regulatory shifts to practical deployments, so you can share insights for maximum ROI.

Read our Privacy Policy

Thanks for subscribing. Check out more VB newsletters here.

An error occured.



Source link

Follow on Google News Follow on Flipboard
Share. Facebook Twitter Pinterest LinkedIn Tumblr Email Copy Link
Previous ArticleBuzzy AI startup Multiverse creates two of the smallest high-performing models ever
Next Article IBM signs LoI to drive Maharashtra’s quantum initiatives
Advanced AI Editor
  • Website

Related Posts

Huawei's new open source technique shrinks LLMs to make them run on less powerful, less expensive hardware

October 6, 2025

OpenAI announces Apps SDK allowing ChatGPT to launch and run third party apps like Zillow, Canva, Spotify

October 6, 2025

Beyond Von Neumann: Toward a unified deterministic architecture

October 6, 2025

Comments are closed.

Latest Posts

Tomb of Amenhotep III Reopens After Two-Decade Renovation    

Morning Links for October 6, 2025

Sotheby’s to Sell René Magritte Held in Same Collection for 100 years

Former ARTnews Publisher Dies at 97

Latest Posts

Technologist Rahul Patil Named CTO of Anthropic, Maker of Claude AI

October 6, 2025

OpenAI Doubles Down on Chip Diversity With AMD, Nvidia Deals

October 6, 2025

When Your Primary Customer Folds Overnight

October 6, 2025

Subscribe to News

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

Recent Posts

  • Technologist Rahul Patil Named CTO of Anthropic, Maker of Claude AI
  • OpenAI Doubles Down on Chip Diversity With AMD, Nvidia Deals
  • When Your Primary Customer Folds Overnight
  • Huawei's new open source technique shrinks LLMs to make them run on less powerful, less expensive hardware
  • Taylor Swift fans accuse singer of using AI in her Google scavenger hunt videos

Recent Comments

  1. twistyneonpangolin5Nalay on Jony Ive is building a futuristic AI device and OpenAI may acquire it
  2. Shakita Eadie on Meta Platforms (NasdaqGS:META) Collaborates With Booz Allen To Pioneer AI-Powered Space Tech
  3. Gigachelrin3Nalay on Sam & Jony introduce io
  4. twistyneonpangolin5Nalay on Sam & Jony introduce io
  5. Magistrniker4Nalay on Trump’s Tech Sanctions To Empower China, Betray America

Welcome to Advanced AI News—your ultimate destination for the latest advancements, insights, and breakthroughs in artificial intelligence.

At Advanced AI News, we are passionate about keeping you informed on the cutting edge of AI technology, from groundbreaking research to emerging startups, expert insights, and real-world applications. Our mission is to deliver high-quality, up-to-date, and insightful content that empowers AI enthusiasts, professionals, and businesses to stay ahead in this fast-evolving field.

Subscribe to Updates

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

LinkedIn Instagram YouTube Threads X (Twitter)
  • Home
  • About Us
  • Advertise With Us
  • Contact Us
  • DMCA
  • Privacy Policy
  • Terms & Conditions
© 2025 advancedainews. Designed by advancedainews.

Type above and press Enter to search. Press Esc to cancel.