Close Menu
  • Home
  • AI Models
    • DeepSeek
    • xAI
    • OpenAI
    • Meta AI Llama
    • Google DeepMind
    • Amazon AWS AI
    • Microsoft AI
    • Anthropic (Claude)
    • NVIDIA AI
    • IBM WatsonX Granite 3.1
    • Adobe Sensi
    • Hugging Face
    • Alibaba Cloud (Qwen)
    • Baidu (ERNIE)
    • C3 AI
    • DataRobot
    • Mistral AI
    • Moonshot AI (Kimi)
    • Google Gemma
    • xAI
    • Stability AI
    • H20.ai
  • AI Research
    • Allen Institue for AI
    • arXiv AI
    • Berkeley AI Research
    • CMU AI
    • Google Research
    • Microsoft Research
    • Meta AI Research
    • OpenAI Research
    • Stanford HAI
    • MIT CSAIL
    • Harvard AI
  • AI Funding & Startups
    • AI Funding Database
    • CBInsights AI
    • Crunchbase AI
    • Data Robot Blog
    • TechCrunch AI
    • VentureBeat AI
    • The Information AI
    • Sifted AI
    • WIRED AI
    • Fortune AI
    • PitchBook
    • TechRepublic
    • SiliconANGLE – Big Data
    • MIT News
    • Data Robot Blog
  • Expert Insights & Videos
    • Google DeepMind
    • Lex Fridman
    • Matt Wolfe AI
    • Yannic Kilcher
    • Two Minute Papers
    • AI Explained
    • TheAIEdge
    • Matt Wolfe AI
    • The TechLead
    • Andrew Ng
    • OpenAI
  • Expert Blogs
    • François Chollet
    • Gary Marcus
    • IBM
    • Jack Clark
    • Jeremy Howard
    • Melanie Mitchell
    • Andrew Ng
    • Andrej Karpathy
    • Sebastian Ruder
    • Rachel Thomas
    • IBM
  • AI Policy & Ethics
    • ACLU AI
    • AI Now Institute
    • Center for AI Safety
    • EFF AI
    • European Commission AI
    • Partnership on AI
    • Stanford HAI Policy
    • Mozilla Foundation AI
    • Future of Life Institute
    • Center for AI Safety
    • World Economic Forum AI
  • AI Tools & Product Releases
    • AI Assistants
    • AI for Recruitment
    • AI Search
    • Coding Assistants
    • Customer Service AI
    • Image Generation
    • Video Generation
    • Writing Tools
    • AI for Recruitment
    • Voice/Audio Generation
  • Industry Applications
    • Finance AI
    • Healthcare AI
    • Legal AI
    • Manufacturing AI
    • Media & Entertainment
    • Transportation AI
    • Education AI
    • Retail AI
    • Agriculture AI
    • Energy AI
  • AI Art & Entertainment
    • AI Art News Blog
    • Artvy Blog » AI Art Blog
    • Weird Wonderful AI Art Blog
    • The Chainsaw » AI Art
    • Artvy Blog » AI Art Blog
What's Hot

AI could have ‘human-level’ intelligence in next few years, Google DeepMind CEO says

Google’s Gemma 3 270M is a compact yet powerful AI model that can run on your toaster

Tesla upgrades EV voice assistant system with AI from DeepSeek and ByteDance

Facebook X (Twitter) Instagram
Advanced AI News
  • Home
  • AI Models
    • OpenAI (GPT-4 / GPT-4o)
    • Anthropic (Claude 3)
    • Google DeepMind (Gemini)
    • Meta (LLaMA)
    • Cohere (Command R)
    • Amazon (Titan)
    • IBM (Watsonx)
    • Inflection AI (Pi)
  • AI Research
    • Allen Institue for AI
    • arXiv AI
    • Berkeley AI Research
    • CMU AI
    • Google Research
    • Meta AI Research
    • Microsoft Research
    • OpenAI Research
    • Stanford HAI
    • MIT CSAIL
    • Harvard AI
  • AI Funding
    • AI Funding Database
    • CBInsights AI
    • Crunchbase AI
    • Data Robot Blog
    • TechCrunch AI
    • VentureBeat AI
    • The Information AI
    • Sifted AI
    • WIRED AI
    • Fortune AI
    • PitchBook
    • TechRepublic
    • SiliconANGLE – Big Data
    • MIT News
    • Data Robot Blog
  • AI Experts
    • Google DeepMind
    • Lex Fridman
    • Meta AI Llama
    • Yannic Kilcher
    • Two Minute Papers
    • AI Explained
    • TheAIEdge
    • The TechLead
    • Matt Wolfe AI
    • Andrew Ng
    • OpenAI
    • Expert Blogs
      • François Chollet
      • Gary Marcus
      • IBM
      • Jack Clark
      • Jeremy Howard
      • Melanie Mitchell
      • Andrew Ng
      • Andrej Karpathy
      • Sebastian Ruder
      • Rachel Thomas
      • IBM
  • AI Tools
    • AI Assistants
    • AI for Recruitment
    • AI Search
    • Coding Assistants
    • Customer Service AI
  • AI Policy
    • ACLU AI
    • AI Now Institute
    • Center for AI Safety
  • Business AI
    • Advanced AI News Features
    • Finance AI
    • Healthcare AI
    • Education AI
    • Energy AI
    • Legal AI
LinkedIn Instagram YouTube Threads X (Twitter)
Advanced AI News
Google Gemma

Google’s Gemma 3 270M is a compact yet powerful AI model that can run on your toaster

By Advanced AI EditorAugust 24, 2025No Comments6 Mins Read
Share Facebook Twitter Pinterest Copy Link Telegram LinkedIn Tumblr Email
Share
Facebook Twitter LinkedIn Pinterest Email


Google LLC’s DeepMind artificial intelligence lab has released one of its smallest models yet in the shape of Gemma 3 270M, with just 270 million parameters.

That means it’s much smaller than many of the most powerful frontier large language models, which generally have billions of parameters, or internal settings that govern their behavior.

The number of parameters in a model generally describes how powerful it is, but with Gemma 3 270M, Google has opted to create something that’s much more streamlined, with the intention being that it can run directly on low power devices such as smartphones, without an internet connection. Despite this, Google says Gemma 3 270M is still more than capable of handling a narrow range of complex, domain-specific tasks, because developers can quickly fine-tune it to meet their needs.

Google DeepMind Staff AI Developer Relations Engineer Omar Sanseviero said in a post on X that Gemma 3 270M is open-source and small enough to run “in your toaster,” or alternatively on a device such as the palm-sized Raspberry Pi computer.

This can run in your toaster or directly in your browser

Try it in https://t.co/KAfiH3hUnf

— Omar Sanseviero (@osanseviero) Aug. 14, 2025

— Omar Sanseviero (@osanseviero) August 14, 2025

In a blog post announcing Gemma 3 270M, Google’s DeepMind team explained that the model combines 170 million “embedding parameters” with 100 million “transformer block parameters.” It’s able to handle very specific and rare tokens too, making it a “strong base model” that can be fine-tuned on specific tasks and languages.

The company added that Gemma 3 270M’s architecture is suitable for “strong performance” in instruction-following tasks, yet small enough to be fine-tuned rapidly and deployed on devices with limited power. Its architecture is based on the larger Gemma 3 models, which are designed to run on a single graphics processing unit, and comes with various fine-tuning recipes, documentation and deployment guides for developer tools including Hugging Face, JAX and UnSlot to help users start building applications for the model quickly.

Strong performance in instruction following

Gemma 3 270M’s benchmark results look fairly impressive. On the IFEval benchmark, which aims to measure AI models’ ability to follow instructions properly, an instruction-tuned version of the model achieved a 51.2% score, according to results shared on X. That surpasses the score of similarly sized small models such as Qwen 2.5 0.5B Instruct and SmolLM2 135M Instruct by a large margin. It’s also not far behind some of the smaller billion-parameter models, Google noted.

That said, Gemma 3 270M may not be the best in its class. One of Google’s rivals, a startup called Liquid AI Inc., posted in response that the company neglected to include its LFM2-350M model, which was launched last month and achieved a 65.12% score on the same benchmark, despite only having a few more parameters.

great release, tho you forgot to include the SoTA in the chart: LFM2-350M @LiquidAI_ pic.twitter.com/n0SQWPmyWV

— Ramin Hasani (@ramin_m_h) August 14, 2025

Nonetheless, Google stressed that Gemma 3 270M is all about energy efficiency, pointing to internal tests using the INT4-quantized version of the model on a Pixel 9 Pro smartphone. It said that in 25 conversations, the model only used up 0.75% of the Pixel’s battery power.

As such, Google says Gemma 3 270M is an excellent option for developers looking to deploy on-device AI, which is often preferable for applications where privacy and offline functionality are necessary.

Accelerating offline and on-device AI

Google stressed that AI developers need to choose the right tool for the job, rather than simply focusing on model size to increase the performance of their AI applications. For workloads such as creative writing, compliance checks, entity extraction, query routing, sentiment analysis and structured text generation, it believes that Gemma 3 270M can be fine-tuned to do an effective job with much greater cost efficiency than a multibillion-parameter large language model.

In a demo video posted on YouTube, Google showed how one developer built a Bedtime Story Generator app powered by Gemma 3 270M. It’s capable of running offline in a web browser and creating original stories for kids based on the parent’s prompts:

The video demonstrates Gemma 3 270M’s ability to synthesize multiple inputs at once, so the user could specify a main character, such as a magic cat, a setting, like an enchanted forest, a theme for the story, a plot twist, such as the character finds a mysterious box with something inside, and also the length of the story. Once the user sets these parameters, Gemma 3 270M quickly generates a coherent, original story based on the user’s inputs.

It’s a great example of how quickly on-device AI is progressing, creating possibilities for new kinds of applications that don’t even need an internet connection.

Google said Gemma 3 270M can be found on Hugging Face, Docker, Kaggle, Ollama and LM Studio, with both pretrained and instruction-tuned versions available to download.

Image: Google

Support our mission to keep content open and free by engaging with theCUBE community. Join theCUBE’s Alumni Trust Network, where technology leaders connect, share intelligence and create opportunities.

15M+ viewers of theCUBE videos, powering conversations across AI, cloud, cybersecurity and more
11.4k+ theCUBE alumni — Connect with more than 11,400 tech and business leaders shaping the future through a unique trusted-based network.

About SiliconANGLE Media

SiliconANGLE Media is a recognized leader in digital media innovation, uniting breakthrough technology, strategic insights and real-time audience engagement. As the parent company of SiliconANGLE, theCUBE Network, theCUBE Research, CUBE365, theCUBE AI and theCUBE SuperStudios — with flagship locations in Silicon Valley and the New York Stock Exchange — SiliconANGLE Media operates at the intersection of media, technology and AI.

Founded by tech visionaries John Furrier and Dave Vellante, SiliconANGLE Media has built a dynamic ecosystem of industry-leading digital media brands that reach 15+ million elite tech professionals. Our new proprietary theCUBE AI Video Cloud is breaking ground in audience interaction, leveraging theCUBEai.com neural network to help technology companies make data-driven decisions and stay at the forefront of industry conversations.





Source link

Follow on Google News Follow on Flipboard
Share. Facebook Twitter Pinterest LinkedIn Tumblr Email Copy Link
Previous ArticleTesla upgrades EV voice assistant system with AI from DeepSeek and ByteDance
Next Article AI could have ‘human-level’ intelligence in next few years, Google DeepMind CEO says
Advanced AI Editor
  • Website

Related Posts

How Google’s Pixel 10 Pro Will Change Smartphones Forever

August 22, 2025

A Compact AI Model That Can Run on Your Phone

August 20, 2025

Fine-tuning Google, building its own

August 18, 2025

Comments are closed.

Latest Posts

Mütter Museum in Philadelphia Announces New Policy for Human Remains

Inigo Philbrick, Art Dealer Convicted of Fraud, Appears in BBC Film

Links for August 22, 2025

White House Targets Specific Artworks at Smithsonian Museums

Latest Posts

AI could have ‘human-level’ intelligence in next few years, Google DeepMind CEO says

August 24, 2025

Google’s Gemma 3 270M is a compact yet powerful AI model that can run on your toaster

August 24, 2025

Tesla upgrades EV voice assistant system with AI from DeepSeek and ByteDance

August 24, 2025

Subscribe to News

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

Recent Posts

  • AI could have ‘human-level’ intelligence in next few years, Google DeepMind CEO says
  • Google’s Gemma 3 270M is a compact yet powerful AI model that can run on your toaster
  • Tesla upgrades EV voice assistant system with AI from DeepSeek and ByteDance
  • OpenAI deal could bring ChatGPT Plus to an entire country
  • OpenAI CEO Sam Altman Believes We’re in an AI Bubble

Recent Comments

  1. mpomm on 1-800-CHAT-GPT—12 Days of OpenAI: Day 10
  2. KennethZet on 1-800-CHAT-GPT—12 Days of OpenAI: Day 10
  3. press release. page on 1-800-CHAT-GPT—12 Days of OpenAI: Day 10
  4. KennethZet on 1-800-CHAT-GPT—12 Days of OpenAI: Day 10
  5. Ralphnaita on 1-800-CHAT-GPT—12 Days of OpenAI: Day 10

Welcome to Advanced AI News—your ultimate destination for the latest advancements, insights, and breakthroughs in artificial intelligence.

At Advanced AI News, we are passionate about keeping you informed on the cutting edge of AI technology, from groundbreaking research to emerging startups, expert insights, and real-world applications. Our mission is to deliver high-quality, up-to-date, and insightful content that empowers AI enthusiasts, professionals, and businesses to stay ahead in this fast-evolving field.

Subscribe to Updates

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

LinkedIn Instagram YouTube Threads X (Twitter)
  • Home
  • About Us
  • Advertise With Us
  • Contact Us
  • DMCA
  • Privacy Policy
  • Terms & Conditions
© 2025 advancedainews. Designed by advancedainews.

Type above and press Enter to search. Press Esc to cancel.