Close Menu
  • Home
  • AI Models
    • DeepSeek
    • xAI
    • OpenAI
    • Meta AI Llama
    • Google DeepMind
    • Amazon AWS AI
    • Microsoft AI
    • Anthropic (Claude)
    • NVIDIA AI
    • IBM WatsonX Granite 3.1
    • Adobe Sensi
    • Hugging Face
    • Alibaba Cloud (Qwen)
    • Baidu (ERNIE)
    • C3 AI
    • DataRobot
    • Mistral AI
    • Moonshot AI (Kimi)
    • Google Gemma
    • xAI
    • Stability AI
    • H20.ai
  • AI Research
    • Allen Institue for AI
    • arXiv AI
    • Berkeley AI Research
    • CMU AI
    • Google Research
    • Microsoft Research
    • Meta AI Research
    • OpenAI Research
    • Stanford HAI
    • MIT CSAIL
    • Harvard AI
  • AI Funding & Startups
    • AI Funding Database
    • CBInsights AI
    • Crunchbase AI
    • Data Robot Blog
    • TechCrunch AI
    • VentureBeat AI
    • The Information AI
    • Sifted AI
    • WIRED AI
    • Fortune AI
    • PitchBook
    • TechRepublic
    • SiliconANGLE – Big Data
    • MIT News
    • Data Robot Blog
  • Expert Insights & Videos
    • Google DeepMind
    • Lex Fridman
    • Matt Wolfe AI
    • Yannic Kilcher
    • Two Minute Papers
    • AI Explained
    • TheAIEdge
    • Matt Wolfe AI
    • The TechLead
    • Andrew Ng
    • OpenAI
  • Expert Blogs
    • François Chollet
    • Gary Marcus
    • IBM
    • Jack Clark
    • Jeremy Howard
    • Melanie Mitchell
    • Andrew Ng
    • Andrej Karpathy
    • Sebastian Ruder
    • Rachel Thomas
    • IBM
  • AI Policy & Ethics
    • ACLU AI
    • AI Now Institute
    • Center for AI Safety
    • EFF AI
    • European Commission AI
    • Partnership on AI
    • Stanford HAI Policy
    • Mozilla Foundation AI
    • Future of Life Institute
    • Center for AI Safety
    • World Economic Forum AI
  • AI Tools & Product Releases
    • AI Assistants
    • AI for Recruitment
    • AI Search
    • Coding Assistants
    • Customer Service AI
    • Image Generation
    • Video Generation
    • Writing Tools
    • AI for Recruitment
    • Voice/Audio Generation
  • Industry Applications
    • Finance AI
    • Healthcare AI
    • Legal AI
    • Manufacturing AI
    • Media & Entertainment
    • Transportation AI
    • Education AI
    • Retail AI
    • Agriculture AI
    • Energy AI
  • AI Art & Entertainment
    • AI Art News Blog
    • Artvy Blog » AI Art Blog
    • Weird Wonderful AI Art Blog
    • The Chainsaw » AI Art
    • Artvy Blog » AI Art Blog
What's Hot

EU Commission: “AI Gigafactories” to strengthen Europe as a business location

United States, China, and United Kingdom Lead the Global AI Ranking According to Stanford HAI’s Global AI Vibrancy Tool

Foundation AI: Cisco launches AI model for integration in security applications

Facebook X (Twitter) Instagram
Advanced AI News
  • Home
  • AI Models
    • Adobe Sensi
    • Aleph Alpha
    • Alibaba Cloud (Qwen)
    • Amazon AWS AI
    • Anthropic (Claude)
    • Apple Core ML
    • Baidu (ERNIE)
    • ByteDance Doubao
    • C3 AI
    • Cohere
    • DataRobot
    • DeepSeek
  • AI Research & Breakthroughs
    • Allen Institue for AI
    • arXiv AI
    • Berkeley AI Research
    • CMU AI
    • Google Research
    • Meta AI Research
    • Microsoft Research
    • OpenAI Research
    • Stanford HAI
    • MIT CSAIL
    • Harvard AI
  • AI Funding & Startups
    • AI Funding Database
    • CBInsights AI
    • Crunchbase AI
    • Data Robot Blog
    • TechCrunch AI
    • VentureBeat AI
    • The Information AI
    • Sifted AI
    • WIRED AI
    • Fortune AI
    • PitchBook
    • TechRepublic
    • SiliconANGLE – Big Data
    • MIT News
    • Data Robot Blog
  • Expert Insights & Videos
    • Google DeepMind
    • Lex Fridman
    • Meta AI Llama
    • Yannic Kilcher
    • Two Minute Papers
    • AI Explained
    • TheAIEdge
    • Matt Wolfe AI
    • The TechLead
    • Andrew Ng
    • OpenAI
  • Expert Blogs
    • François Chollet
    • Gary Marcus
    • IBM
    • Jack Clark
    • Jeremy Howard
    • Melanie Mitchell
    • Andrew Ng
    • Andrej Karpathy
    • Sebastian Ruder
    • Rachel Thomas
    • IBM
  • AI Policy & Ethics
    • ACLU AI
    • AI Now Institute
    • Center for AI Safety
    • EFF AI
    • European Commission AI
    • Partnership on AI
    • Stanford HAI Policy
    • Mozilla Foundation AI
    • Future of Life Institute
    • Center for AI Safety
    • World Economic Forum AI
  • AI Tools & Product Releases
    • AI Assistants
    • AI for Recruitment
    • AI Search
    • Coding Assistants
    • Customer Service AI
    • Image Generation
    • Video Generation
    • Writing Tools
    • AI for Recruitment
    • Voice/Audio Generation
  • Industry Applications
    • Education AI
    • Energy AI
    • Finance AI
    • Healthcare AI
    • Legal AI
    • Media & Entertainment
    • Transportation AI
    • Manufacturing AI
    • Retail AI
    • Agriculture AI
  • AI Art & Entertainment
    • AI Art News Blog
    • Artvy Blog » AI Art Blog
    • Weird Wonderful AI Art Blog
    • The Chainsaw » AI Art
    • Artvy Blog » AI Art Blog
Advanced AI News
Home » Google’s Gemini 2.5 Pro is Better at Coding, Math & Science Than Your Favourite AI Model
TechRepublic

Google’s Gemini 2.5 Pro is Better at Coding, Math & Science Than Your Favourite AI Model

Advanced AI BotBy Advanced AI BotApril 1, 2025No Comments3 Mins Read
Share Facebook Twitter Pinterest Copy Link Telegram LinkedIn Tumblr Email
Share
Facebook Twitter LinkedIn Pinterest Email


Gemini 2.5 Google

Google has unveiled Gemini 2.5 Pro, the first in its Gemini 2.5 family. This multimodal reasoning model outperforms competitors from OpenAI, Anthropic, and DeepSeek in key benchmarks related to coding, mathematics, and science.

What are reasoning AI models?

Reasoning AIs are designed to “think before they speak.” They evaluate context, process details methodically, and fact-check responses to ensure logical accuracy — though these capabilities demand more computing power and higher operational costs.

OpenAI launched the first reasoning model last September with o1, a notable departure from the GPT series, which was largely focused on language generation. Since then, the major players in the AI race have responded: DeepSeek with R1, Anthropic with Claude Sonnet 3.7, and xAI’s with Grok 3.

Evolving beyond ‘flash thinking’

Google previously launched its first reasoning AI model, Gemini 2.0 Flash Thinking, in December. Marketed for its agentic capabilities, Flash Thinking was recently updated to allow file uploads and larger prompts; however, with the introduction of Gemini 2.5 Pro, Google appears to be retiring the “Thinking” label altogether.

According to Google’s announcement about Gemini 2.5, this is because reasoning capabilities will now be integrated natively across all future models. This shift marks a move toward a more unified AI architecture, rather than separating “thinking” features as standalone branding.

The new experimental model combines “a significantly enhanced base model” with “improved post-training.” Google touts its performance at the top of the LMArena leaderboard, which ranks major large language models across various tasks.

DOWNLOAD: How to Use AI in Business from TechRepublic Premium

Benchmark leader in science, math, and code

Gemini 2.5 Pro excels in academic reasoning benchmarks, scoring 86.7% on AIME 2025 (mathematics) and 84.0% on the GPQA diamond benchmark (science). On Humanity’s Last Exam — a broad test featuring thousands of questions across mathematics, science, and humanities —  the model leads with a score of 18.8%.

Notably, these results were achieved without the use of expensive test-time techniques, which allow models like o1 and R1 to continue learning during evaluation.

In software development benchmarks, Gemini 2.5 Pro performance is mixed. It scored 68.6% on the Aider Polyglot benchmark for code editing, outperforming most top-tier models. However, it scored 63.8% on SWE-bench Verified, placing second to Claude Sonnet 3.7 in broader programming tasks.

Despite this, Google says Gemini 2.5 Pro “excels at creating visually compelling web apps and agentic code applications,” as evidenced by its ability to create a video game from a single prompt.

The model supports a context window of one million tokens, meaning it can process the equivalent of a 750,000-word prompt, or the first six Harry Potter books. Google plans to increase this threshold to two million tokens in due course.

Gemini 2.5 Pro is currently available through the Gemini Advanced app, which requires a $20-a-month subscription, and to developers and enterprises through Google AI Studio. In the coming weeks, Gemini 2.5 Pro will be made available on Vertex AI, Google’s machine-learning platform for developers, and pricing details for different rate limits will also be introduced.



Source link

Follow on Google News Follow on Flipboard
Share. Facebook Twitter Pinterest LinkedIn Tumblr Email Copy Link
Previous ArticleAI and Web3 are leading the next wave of gaming innovation
Next Article Shares of Teradata sink on missed revenue and soft guidance
Advanced AI Bot
  • Website

Related Posts

AI Benchmark Discrepancy Reveals Gaps in Performance Claims

April 22, 2025

Huawei Readies Ascend 920 Chip to Replace Restricted NVIDIA H20

April 21, 2025

‘AI Is Fundamentally Incompatible With Environmental Sustainability’

April 21, 2025
Leave A Reply Cancel Reply

Latest Posts

Artists Accuse Dealer Reco Sturgis of Withholding Payments and Artworks

Casa Sanlorenzo Anchors New Arts And Culture Venue In Venice

Collector Hoping Elon Musk Buys Napoleon Collection

How Former Apple Music Mastermind Larry Jackson Signed Mariah Carey To His $400 Million Startup

Latest Posts

EU Commission: “AI Gigafactories” to strengthen Europe as a business location

June 6, 2025

United States, China, and United Kingdom Lead the Global AI Ranking According to Stanford HAI’s Global AI Vibrancy Tool

June 6, 2025

Foundation AI: Cisco launches AI model for integration in security applications

June 6, 2025

Subscribe to News

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

Welcome to Advanced AI News—your ultimate destination for the latest advancements, insights, and breakthroughs in artificial intelligence.

At Advanced AI News, we are passionate about keeping you informed on the cutting edge of AI technology, from groundbreaking research to emerging startups, expert insights, and real-world applications. Our mission is to deliver high-quality, up-to-date, and insightful content that empowers AI enthusiasts, professionals, and businesses to stay ahead in this fast-evolving field.

Subscribe to Updates

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

YouTube LinkedIn
  • Home
  • About Us
  • Advertise With Us
  • Contact Us
  • DMCA
  • Privacy Policy
  • Terms & Conditions
© 2025 advancedainews. Designed by advancedainews.

Type above and press Enter to search. Press Esc to cancel.