Close Menu
  • Home
  • AI Models
    • DeepSeek
    • xAI
    • OpenAI
    • Meta AI Llama
    • Google DeepMind
    • Amazon AWS AI
    • Microsoft AI
    • Anthropic (Claude)
    • NVIDIA AI
    • IBM WatsonX Granite 3.1
    • Adobe Sensi
    • Hugging Face
    • Alibaba Cloud (Qwen)
    • Baidu (ERNIE)
    • C3 AI
    • DataRobot
    • Mistral AI
    • Moonshot AI (Kimi)
    • Google Gemma
    • xAI
    • Stability AI
    • H20.ai
  • AI Research
    • Allen Institue for AI
    • arXiv AI
    • Berkeley AI Research
    • CMU AI
    • Google Research
    • Microsoft Research
    • Meta AI Research
    • OpenAI Research
    • Stanford HAI
    • MIT CSAIL
    • Harvard AI
  • AI Funding & Startups
    • AI Funding Database
    • CBInsights AI
    • Crunchbase AI
    • Data Robot Blog
    • TechCrunch AI
    • VentureBeat AI
    • The Information AI
    • Sifted AI
    • WIRED AI
    • Fortune AI
    • PitchBook
    • TechRepublic
    • SiliconANGLE – Big Data
    • MIT News
    • Data Robot Blog
  • Expert Insights & Videos
    • Google DeepMind
    • Lex Fridman
    • Matt Wolfe AI
    • Yannic Kilcher
    • Two Minute Papers
    • AI Explained
    • TheAIEdge
    • Matt Wolfe AI
    • The TechLead
    • Andrew Ng
    • OpenAI
  • Expert Blogs
    • François Chollet
    • Gary Marcus
    • IBM
    • Jack Clark
    • Jeremy Howard
    • Melanie Mitchell
    • Andrew Ng
    • Andrej Karpathy
    • Sebastian Ruder
    • Rachel Thomas
    • IBM
  • AI Policy & Ethics
    • ACLU AI
    • AI Now Institute
    • Center for AI Safety
    • EFF AI
    • European Commission AI
    • Partnership on AI
    • Stanford HAI Policy
    • Mozilla Foundation AI
    • Future of Life Institute
    • Center for AI Safety
    • World Economic Forum AI
  • AI Tools & Product Releases
    • AI Assistants
    • AI for Recruitment
    • AI Search
    • Coding Assistants
    • Customer Service AI
    • Image Generation
    • Video Generation
    • Writing Tools
    • AI for Recruitment
    • Voice/Audio Generation
  • Industry Applications
    • Finance AI
    • Healthcare AI
    • Legal AI
    • Manufacturing AI
    • Media & Entertainment
    • Transportation AI
    • Education AI
    • Retail AI
    • Agriculture AI
    • Energy AI
  • AI Art & Entertainment
    • AI Art News Blog
    • Artvy Blog » AI Art Blog
    • Weird Wonderful AI Art Blog
    • The Chainsaw » AI Art
    • Artvy Blog » AI Art Blog
What's Hot

MIT sees ‘significant new financial pressures’ from Trump cuts

Vast Data’s SyncEngine helps AI agents to tap unstructured data from every source

Apple’s new iPhone 17 devices don’t have an AI-powered Siri yet. It doesn’t matter.

Facebook X (Twitter) Instagram
Advanced AI News
  • Home
  • AI Models
    • OpenAI (GPT-4 / GPT-4o)
    • Anthropic (Claude 3)
    • Google DeepMind (Gemini)
    • Meta (LLaMA)
    • Cohere (Command R)
    • Amazon (Titan)
    • IBM (Watsonx)
    • Inflection AI (Pi)
  • AI Research
    • Allen Institue for AI
    • arXiv AI
    • Berkeley AI Research
    • CMU AI
    • Google Research
    • Meta AI Research
    • Microsoft Research
    • OpenAI Research
    • Stanford HAI
    • MIT CSAIL
    • Harvard AI
  • AI Funding
    • AI Funding Database
    • CBInsights AI
    • Crunchbase AI
    • Data Robot Blog
    • TechCrunch AI
    • VentureBeat AI
    • The Information AI
    • Sifted AI
    • WIRED AI
    • Fortune AI
    • PitchBook
    • TechRepublic
    • SiliconANGLE – Big Data
    • MIT News
    • Data Robot Blog
  • AI Experts
    • Google DeepMind
    • Lex Fridman
    • Meta AI Llama
    • Yannic Kilcher
    • Two Minute Papers
    • AI Explained
    • TheAIEdge
    • The TechLead
    • Matt Wolfe AI
    • Andrew Ng
    • OpenAI
    • Expert Blogs
      • François Chollet
      • Gary Marcus
      • IBM
      • Jack Clark
      • Jeremy Howard
      • Melanie Mitchell
      • Andrew Ng
      • Andrej Karpathy
      • Sebastian Ruder
      • Rachel Thomas
      • IBM
  • AI Tools
    • AI Assistants
    • AI for Recruitment
    • AI Search
    • Coding Assistants
    • Customer Service AI
  • AI Policy
    • ACLU AI
    • AI Now Institute
    • Center for AI Safety
  • Business AI
    • Advanced AI News Features
    • Finance AI
    • Healthcare AI
    • Education AI
    • Energy AI
    • Legal AI
LinkedIn Instagram YouTube Threads X (Twitter)
Advanced AI News
OpenAI Research

Humans beat AI at international math contest despite gold-level AI scores

By Advanced AI EditorJuly 22, 2025No Comments3 Mins Read
Share Facebook Twitter Pinterest Copy Link Telegram LinkedIn Tumblr Email
Share
Facebook Twitter LinkedIn Pinterest Email


mathematics
Credit: CC0 Public Domain

Humans beat generative AI models made by Google and OpenAI at a top international mathematics competition, despite the programs reaching gold-level scores for the first time.

Neither model scored full marks—unlike five young people at the International Mathematical Olympiad (IMO), a prestigious annual competition where participants must be under 20 years old.

Google said Monday that an advanced version of its Gemini chatbot had solved five out of the six math problems set at the IMO, held in Australia’s Queensland this month.

“We can confirm that Google DeepMind has reached the much-desired milestone, earning 35 out of a possible 42 points—a gold medal score,” the US tech giant cited IMO president Gregor Dolinar as saying.

“Their solutions were astonishing in many respects. IMO graders found them to be clear, precise and most of them easy to follow.”

Around 10% of human contestants won gold-level medals, and five received perfect scores of 42 points.

US ChatGPT maker OpenAI said that its experimental reasoning model had scored a gold-level 35 points on the test.

The result “achieved a longstanding grand challenge in AI” at “the world’s most prestigious math competition,” OpenAI researcher Alexander Wei wrote on social media.

“We evaluated our models on the 2025 IMO problems under the same rules as human contestants,” he said.

“For each problem, three former IMO medalists independently graded the model’s submitted proof.”

Google achieved a silver-medal score at last year’s IMO in the British city of Bath, solving four of the six problems.

That took two to three days of computation—far longer than this year, when its Gemini model solved the problems within the 4.5-hour time limit, it said.

The IMO said tech companies had “privately tested closed-source AI models on this year’s problems,” the same ones faced by 641 competing students from 112 countries.

“It is very exciting to see progress in the mathematical capabilities of AI models,” said IMO president Dolinar.

Contest organizers could not verify how much computing power had been used by the AI models or whether there had been human involvement, he cautioned.

© 2025 AFP

Citation:
Humans beat AI at international math contest despite gold-level AI scores (2025, July 22)
retrieved 22 July 2025
from https://phys.org/news/2025-07-humans-ai-international-math-contest.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no
part may be reproduced without the written permission. The content is provided for information purposes only.



Source link

Follow on Google News Follow on Flipboard
Share. Facebook Twitter Pinterest LinkedIn Tumblr Email Copy Link
Previous ArticleSurprising no one, new research says AI Overviews cause massive drop in search clicks
Next Article Removed Romanesque Murals Must Be Returned to Sijena Monastery
Advanced AI Editor
  • Website

Related Posts

OpenAI research reveals that doctors who use AI make 16% fewer diagnostic errors

August 11, 2025

OpenAI’s Search Engine Could be Announced as Early as May 13

August 7, 2025

A Strategic Move Or A Power Play?

August 7, 2025

Comments are closed.

Latest Posts

Anne Imhof Reimagines Football Jerseys with Nike

Storied Collector and MoMA Trustee Dies at 92

Congress Obtains Drawing Trump Apparently Made for Jeffrey Epstein

Galerie Gmurzynska Slated to Open in New York’s Fuller Building

Latest Posts

MIT sees ‘significant new financial pressures’ from Trump cuts

September 9, 2025

Vast Data’s SyncEngine helps AI agents to tap unstructured data from every source

September 9, 2025

Apple’s new iPhone 17 devices don’t have an AI-powered Siri yet. It doesn’t matter.

September 9, 2025

Subscribe to News

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

Recent Posts

  • MIT sees ‘significant new financial pressures’ from Trump cuts
  • Vast Data’s SyncEngine helps AI agents to tap unstructured data from every source
  • Apple’s new iPhone 17 devices don’t have an AI-powered Siri yet. It doesn’t matter.
  • IBM vs. QCOM: Which Tech Stock Deserves a Spot in Your Portfolio Now? – September 9, 2025
  • Interleaving Reasoning for Better Text-to-Image Generation – Takara TLDR

Recent Comments

  1. Gregorybeige on 1-800-CHAT-GPT—12 Days of OpenAI: Day 10
  2. نتایج نهایی کنکور دکتری ۱۴۰۴ on 1-800-CHAT-GPT—12 Days of OpenAI: Day 10
  3. Billybar on 1-800-CHAT-GPT—12 Days of OpenAI: Day 10
  4. Orlandolaf on 1-800-CHAT-GPT—12 Days of OpenAI: Day 10
  5. سامانه راهنمای انتخاب رشته مجازی on 1-800-CHAT-GPT—12 Days of OpenAI: Day 10

Welcome to Advanced AI News—your ultimate destination for the latest advancements, insights, and breakthroughs in artificial intelligence.

At Advanced AI News, we are passionate about keeping you informed on the cutting edge of AI technology, from groundbreaking research to emerging startups, expert insights, and real-world applications. Our mission is to deliver high-quality, up-to-date, and insightful content that empowers AI enthusiasts, professionals, and businesses to stay ahead in this fast-evolving field.

Subscribe to Updates

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

LinkedIn Instagram YouTube Threads X (Twitter)
  • Home
  • About Us
  • Advertise With Us
  • Contact Us
  • DMCA
  • Privacy Policy
  • Terms & Conditions
© 2025 advancedainews. Designed by advancedainews.

Type above and press Enter to search. Press Esc to cancel.