Close Menu
  • Home
  • AI Models
    • DeepSeek
    • xAI
    • OpenAI
    • Meta AI Llama
    • Google DeepMind
    • Amazon AWS AI
    • Microsoft AI
    • Anthropic (Claude)
    • NVIDIA AI
    • IBM WatsonX Granite 3.1
    • Adobe Sensi
    • Hugging Face
    • Alibaba Cloud (Qwen)
    • Baidu (ERNIE)
    • C3 AI
    • DataRobot
    • Mistral AI
    • Moonshot AI (Kimi)
    • Google Gemma
    • xAI
    • Stability AI
    • H20.ai
  • AI Research
    • Allen Institue for AI
    • arXiv AI
    • Berkeley AI Research
    • CMU AI
    • Google Research
    • Microsoft Research
    • Meta AI Research
    • OpenAI Research
    • Stanford HAI
    • MIT CSAIL
    • Harvard AI
  • AI Funding & Startups
    • AI Funding Database
    • CBInsights AI
    • Crunchbase AI
    • Data Robot Blog
    • TechCrunch AI
    • VentureBeat AI
    • The Information AI
    • Sifted AI
    • WIRED AI
    • Fortune AI
    • PitchBook
    • TechRepublic
    • SiliconANGLE – Big Data
    • MIT News
    • Data Robot Blog
  • Expert Insights & Videos
    • Google DeepMind
    • Lex Fridman
    • Matt Wolfe AI
    • Yannic Kilcher
    • Two Minute Papers
    • AI Explained
    • TheAIEdge
    • Matt Wolfe AI
    • The TechLead
    • Andrew Ng
    • OpenAI
  • Expert Blogs
    • François Chollet
    • Gary Marcus
    • IBM
    • Jack Clark
    • Jeremy Howard
    • Melanie Mitchell
    • Andrew Ng
    • Andrej Karpathy
    • Sebastian Ruder
    • Rachel Thomas
    • IBM
  • AI Policy & Ethics
    • ACLU AI
    • AI Now Institute
    • Center for AI Safety
    • EFF AI
    • European Commission AI
    • Partnership on AI
    • Stanford HAI Policy
    • Mozilla Foundation AI
    • Future of Life Institute
    • Center for AI Safety
    • World Economic Forum AI
  • AI Tools & Product Releases
    • AI Assistants
    • AI for Recruitment
    • AI Search
    • Coding Assistants
    • Customer Service AI
    • Image Generation
    • Video Generation
    • Writing Tools
    • AI for Recruitment
    • Voice/Audio Generation
  • Industry Applications
    • Finance AI
    • Healthcare AI
    • Legal AI
    • Manufacturing AI
    • Media & Entertainment
    • Transportation AI
    • Education AI
    • Retail AI
    • Agriculture AI
    • Energy AI
  • AI Art & Entertainment
    • AI Art News Blog
    • Artvy Blog » AI Art Blog
    • Weird Wonderful AI Art Blog
    • The Chainsaw » AI Art
    • Artvy Blog » AI Art Blog
What's Hot

Where C3.ai Stands With Analysts – C3.ai (NYSE:AI)

Paper page – LAPO: Internalizing Reasoning Efficiency via Length-Adaptive Policy Optimization

How PerformLine uses prompt engineering on Amazon Bedrock to detect compliance violations 

Facebook X (Twitter) Instagram
Advanced AI News
  • Home
  • AI Models
    • OpenAI (GPT-4 / GPT-4o)
    • Anthropic (Claude 3)
    • Google DeepMind (Gemini)
    • Meta (LLaMA)
    • Cohere (Command R)
    • Amazon (Titan)
    • IBM (Watsonx)
    • Inflection AI (Pi)
  • AI Research
    • Allen Institue for AI
    • arXiv AI
    • Berkeley AI Research
    • CMU AI
    • Google Research
    • Meta AI Research
    • Microsoft Research
    • OpenAI Research
    • Stanford HAI
    • MIT CSAIL
    • Harvard AI
  • AI Funding
    • AI Funding Database
    • CBInsights AI
    • Crunchbase AI
    • Data Robot Blog
    • TechCrunch AI
    • VentureBeat AI
    • The Information AI
    • Sifted AI
    • WIRED AI
    • Fortune AI
    • PitchBook
    • TechRepublic
    • SiliconANGLE – Big Data
    • MIT News
    • Data Robot Blog
  • AI Experts
    • Google DeepMind
    • Lex Fridman
    • Meta AI Llama
    • Yannic Kilcher
    • Two Minute Papers
    • AI Explained
    • TheAIEdge
    • The TechLead
    • Matt Wolfe AI
    • Andrew Ng
    • OpenAI
    • Expert Blogs
      • François Chollet
      • Gary Marcus
      • IBM
      • Jack Clark
      • Jeremy Howard
      • Melanie Mitchell
      • Andrew Ng
      • Andrej Karpathy
      • Sebastian Ruder
      • Rachel Thomas
      • IBM
  • AI Tools
    • AI Assistants
    • AI for Recruitment
    • AI Search
    • Coding Assistants
    • Customer Service AI
  • AI Policy
    • ACLU AI
    • AI Now Institute
    • Center for AI Safety
  • Industry AI
    • Finance AI
    • Healthcare AI
    • Education AI
    • Energy AI
    • Legal AI
LinkedIn Instagram YouTube Threads X (Twitter)
Advanced AI News
OpenAI

DeepMind and OpenAI models solve maths problems at level of top students

By Advanced AI EditorJuly 24, 2025No Comments3 Mins Read
Share Facebook Twitter Pinterest Copy Link Telegram LinkedIn Tumblr Email
Share
Facebook Twitter LinkedIn Pinterest Email


A participant holds up a gold medal won at at the 63rd International Mathematical Olympiad.

Models from OpenAI and DeepMind achieved gold medal scores in the International Mathematical Olympiad.Credit: MoiraM/Alamy

Google DeepMind announced on 21 July that its software had cracked a set of maths problems at the level of the world’s top high-school students, achieving a gold-medal score on questions from the International Mathematical Olympiad. At first sight, this marked only a marginal improvement over the prevous year’s performance. The company’s system had performed in the upper range of silver medal standard at the 2024 Olympiad, while this year it was evaluated in the lower range for a human gold medallist.

DeepMind AI crushes tough maths problems on par with top human solvers

But the grades this year hide a “big paradigm shift,” says Thang Luong, a computer scientist at DeepMind in Mountain View, California. The company achieved its previous feats using two artificial intelligence (AI) tools specifically designed to carry out rigorous logical steps in mathematical proofscalculations, called AlphaGeometry and AlphaProof. The process required human experts to first translate the problems’ statements into something similar to a programming language, and then to translate the AI’s solutions back into English.

“This year, everything is natural language, end to end,” says Luong. The team employed a large language model (LLM) called DeepThink, which is based on its Gemini system but with some additional developments that made it better and faster at producing mathematical arguments, such as handling multiple chains of thought in parallel. “For a long time, I didn’t think we could go that far with LLMs,” Luong adds.

DeepThink scored 35 out of 42 points on the 6 problems that had been given to participants in this year’s Olympiad. Under an agreement with the organizers, the computer’s solutions were marked by the same judges who evaluated the human participants.

Separately, ChatGPT creator OpenAI, based in San Francisco, California, had its own LLM solve the same Mathematical Olympiad problems at gold medal level, but had its solutions evaluated independently.

Impressive performance

For years, many AI researchers have fallen in one of two camps. Until 2012, the leading approach for was to code the rules of logical thinking into the machine by hand. Since then, neural networks — which train automatically by learning from vast troves of data — have made a series of sensational breakthroughs, and tools such as OpenAI’s ChatGPT have now entered mainstream use.

DeepMind AI solves geometry problems at star-student level

Gary Marcus, a neuroscientist at New York University (NYU) in New York City, called the results by DeepMind and OpenAI “Awfully impressive.” Marcus is an advocate of the ‘coding logic by hand’ approach — also known as neurosymbolic AI — and a frequent critic of what he sees as hype surrounding LLMs. Still, writing on Substack with NYU computer scientist Ernest Davis, he commented that “to be able to solve math problems at the level of the top 67 high school students in the world is to have really good math problem solving chops”.

It remains to be seen whether LLM superiority on IMO problems is here to stay, or if neurosymbolic AI will claw its way back to the top. “At this point the two camps still keep developing,” says Luong, who works on both approaches. “They could converge together.”



Source link

Follow on Google News Follow on Flipboard
Share. Facebook Twitter Pinterest LinkedIn Tumblr Email Copy Link
Previous ArticleTrump’s order to make chatbots anti-woke is unconstitutional, senator says
Next Article Nvidia AI chips worth $1B smuggled to China after Trump export controls
Advanced AI Editor
  • Website

Related Posts

OpenAI’s most capable AI model, GPT-5, may be coming in August

July 25, 2025

Samsung has its eye on Perplexity and OpenAI as it plans to expand beyond Gemini

July 25, 2025

Japan’s Legal AI Startup Scores $50 Million Round Led By Goldman Sachs, Partners With OpenAI

July 25, 2025

Comments are closed.

Latest Posts

Auction House Will Sell Egyptian Artifact Despite Concern From Experts

Anish Kapoor Lists New York Apartment for $17.75 M.

Artist Loses Final Appeal in Case of Apologising for ‘Fishrot Scandal’

US Appeals Court Overturns $8.8 M. Trademark Judgement For Yuga Labs

Latest Posts

Where C3.ai Stands With Analysts – C3.ai (NYSE:AI)

July 25, 2025

Paper page – LAPO: Internalizing Reasoning Efficiency via Length-Adaptive Policy Optimization

July 25, 2025

How PerformLine uses prompt engineering on Amazon Bedrock to detect compliance violations 

July 25, 2025

Subscribe to News

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

Recent Posts

  • Where C3.ai Stands With Analysts – C3.ai (NYSE:AI)
  • Paper page – LAPO: Internalizing Reasoning Efficiency via Length-Adaptive Policy Optimization
  • How PerformLine uses prompt engineering on Amazon Bedrock to detect compliance violations 
  • OpenAI’s most capable AI model, GPT-5, may be coming in August
  • Stanford HAI says generative AI model transparency is improving, but there’s a long way to go

Recent Comments

  1. 打开Binance账户 on Tanka CEO Kisson Lin to talk AI-native startups at Sessions: AI
  2. Sign up to get 100 USDT on The Do LaB On Capturing Lightning In A Bottle
  3. binance Anmeldebonus on David Patterson: Computer Architecture and Data Storage | Lex Fridman Podcast #104
  4. nude on Brain-to-voice neuroprosthesis restores naturalistic speech
  5. Dennisemupt on Local gov’t reps say they look forward to working with Thomas

Welcome to Advanced AI News—your ultimate destination for the latest advancements, insights, and breakthroughs in artificial intelligence.

At Advanced AI News, we are passionate about keeping you informed on the cutting edge of AI technology, from groundbreaking research to emerging startups, expert insights, and real-world applications. Our mission is to deliver high-quality, up-to-date, and insightful content that empowers AI enthusiasts, professionals, and businesses to stay ahead in this fast-evolving field.

Subscribe to Updates

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

LinkedIn Instagram YouTube Threads X (Twitter)
  • Home
  • About Us
  • Advertise With Us
  • Contact Us
  • DMCA
  • Privacy Policy
  • Terms & Conditions
© 2025 advancedainews. Designed by advancedainews.

Type above and press Enter to search. Press Esc to cancel.