Close Menu
  • Home
  • AI Models
    • DeepSeek
    • xAI
    • OpenAI
    • Meta AI Llama
    • Google DeepMind
    • Amazon AWS AI
    • Microsoft AI
    • Anthropic (Claude)
    • NVIDIA AI
    • IBM WatsonX Granite 3.1
    • Adobe Sensi
    • Hugging Face
    • Alibaba Cloud (Qwen)
    • Baidu (ERNIE)
    • C3 AI
    • DataRobot
    • Mistral AI
    • Moonshot AI (Kimi)
    • Google Gemma
    • xAI
    • Stability AI
    • H20.ai
  • AI Research
    • Allen Institue for AI
    • arXiv AI
    • Berkeley AI Research
    • CMU AI
    • Google Research
    • Microsoft Research
    • Meta AI Research
    • OpenAI Research
    • Stanford HAI
    • MIT CSAIL
    • Harvard AI
  • AI Funding & Startups
    • AI Funding Database
    • CBInsights AI
    • Crunchbase AI
    • Data Robot Blog
    • TechCrunch AI
    • VentureBeat AI
    • The Information AI
    • Sifted AI
    • WIRED AI
    • Fortune AI
    • PitchBook
    • TechRepublic
    • SiliconANGLE – Big Data
    • MIT News
    • Data Robot Blog
  • Expert Insights & Videos
    • Google DeepMind
    • Lex Fridman
    • Matt Wolfe AI
    • Yannic Kilcher
    • Two Minute Papers
    • AI Explained
    • TheAIEdge
    • Matt Wolfe AI
    • The TechLead
    • Andrew Ng
    • OpenAI
  • Expert Blogs
    • François Chollet
    • Gary Marcus
    • IBM
    • Jack Clark
    • Jeremy Howard
    • Melanie Mitchell
    • Andrew Ng
    • Andrej Karpathy
    • Sebastian Ruder
    • Rachel Thomas
    • IBM
  • AI Policy & Ethics
    • ACLU AI
    • AI Now Institute
    • Center for AI Safety
    • EFF AI
    • European Commission AI
    • Partnership on AI
    • Stanford HAI Policy
    • Mozilla Foundation AI
    • Future of Life Institute
    • Center for AI Safety
    • World Economic Forum AI
  • AI Tools & Product Releases
    • AI Assistants
    • AI for Recruitment
    • AI Search
    • Coding Assistants
    • Customer Service AI
    • Image Generation
    • Video Generation
    • Writing Tools
    • AI for Recruitment
    • Voice/Audio Generation
  • Industry Applications
    • Finance AI
    • Healthcare AI
    • Legal AI
    • Manufacturing AI
    • Media & Entertainment
    • Transportation AI
    • Education AI
    • Retail AI
    • Agriculture AI
    • Energy AI
  • AI Art & Entertainment
    • AI Art News Blog
    • Artvy Blog » AI Art Blog
    • Weird Wonderful AI Art Blog
    • The Chainsaw » AI Art
    • Artvy Blog » AI Art Blog
What's Hot

HPE Expands NVIDIA AI Enterprise Integration with Blackwell GPU Solutions

Elon Musk cries antitrust as X & Grok can’t compete with OpenAI

IBM relocates thousands of employees to One Madison Ave

Facebook X (Twitter) Instagram
Advanced AI News
  • Home
  • AI Models
    • OpenAI (GPT-4 / GPT-4o)
    • Anthropic (Claude 3)
    • Google DeepMind (Gemini)
    • Meta (LLaMA)
    • Cohere (Command R)
    • Amazon (Titan)
    • IBM (Watsonx)
    • Inflection AI (Pi)
  • AI Research
    • Allen Institue for AI
    • arXiv AI
    • Berkeley AI Research
    • CMU AI
    • Google Research
    • Meta AI Research
    • Microsoft Research
    • OpenAI Research
    • Stanford HAI
    • MIT CSAIL
    • Harvard AI
  • AI Funding
    • AI Funding Database
    • CBInsights AI
    • Crunchbase AI
    • Data Robot Blog
    • TechCrunch AI
    • VentureBeat AI
    • The Information AI
    • Sifted AI
    • WIRED AI
    • Fortune AI
    • PitchBook
    • TechRepublic
    • SiliconANGLE – Big Data
    • MIT News
    • Data Robot Blog
  • AI Experts
    • Google DeepMind
    • Lex Fridman
    • Meta AI Llama
    • Yannic Kilcher
    • Two Minute Papers
    • AI Explained
    • TheAIEdge
    • The TechLead
    • Matt Wolfe AI
    • Andrew Ng
    • OpenAI
    • Expert Blogs
      • François Chollet
      • Gary Marcus
      • IBM
      • Jack Clark
      • Jeremy Howard
      • Melanie Mitchell
      • Andrew Ng
      • Andrej Karpathy
      • Sebastian Ruder
      • Rachel Thomas
      • IBM
  • AI Tools
    • AI Assistants
    • AI for Recruitment
    • AI Search
    • Coding Assistants
    • Customer Service AI
  • AI Policy
    • ACLU AI
    • AI Now Institute
    • Center for AI Safety
  • Business AI
    • Advanced AI News Features
    • Finance AI
    • Healthcare AI
    • Education AI
    • Energy AI
    • Legal AI
LinkedIn Instagram YouTube Threads X (Twitter)
Advanced AI News
VentureBeat AI

OpenAI’s GPT-5 rollout is not going smoothly

By Advanced AI EditorAugust 8, 2025No Comments7 Mins Read
Share Facebook Twitter Pinterest Copy Link Telegram LinkedIn Tumblr Email
Share
Facebook Twitter LinkedIn Pinterest Email


Want smarter insights in your inbox? Sign up for our weekly newsletters to get only what matters to enterprise AI, data, and security leaders. Subscribe Now

The launch of OpenAI’s long anticipated new model, GPT-5, is off to a rocky start to say the least.

Even forgiving errors in charts and voice demoes during yesterday’s livestreamed presentation of the new model (actually four separate models, and a ‘Thinking’ mode that can be engaged for three of them), a number of user reports have emerged since GPT-5’s release showing it erring badly when solving relatively simple problems that preceding OpenAI models — and rivals from competing AI labs — answer correctly.

For example, data scientist Colin Fraser posted screenshots showing GPT-5 getting a math proof wrong (whether 8.888 repeating is equal to 9 — it is of course, not).

It also failed on a simple algebra arithmetic problem that elementary schoolers could probably nail, 5.9 = x + 5.11.

AI Scaling Hits Its Limits

Power caps, rising token costs, and inference delays are reshaping enterprise AI. Join our exclusive salon to discover how top teams are:

Turning energy into a strategic advantage

Architecting efficient inference for real throughput gains

Unlocking competitive ROI with sustainable AI systems

Secure your spot to stay ahead: https://bit.ly/4mwGngO

Using GPT-5 to judge OpenAI’s own erroneous presentation charts also did not yield helpful or correct responses.

It also failed on this trickier math word problem below (which, to be fair, stumped this human at first…though Elon Musk’s Groq 4 AI answered it correctly. For a hint, think of the fact that flagstones in this case can’t be divided into smaller portions. They must remain in tact as 80 separate units, so no halves or quarters).

Not as good at coding as benchmarks indicate

Even though OpenAI’s internal benchmarks and some third-party external ones have shown GPT-5 to outperform all other models at coding, it appears that in real world usage, Anthropic’s recently updated Claude Opus 4.1 seems to do a better job at “one-shotting” certain tasks, that is, completing the user’s desired application or software build to their specifications. See an example below from developer Justin Sun posted to X :

Opus 4.1’s one-shot attempt at “create a 3d capybara petting zoo” – 8 minutes total

This was honestly pretty insane, not only are the capybaras way cuter and moving, there are individual pet affinity levels, a day/night switcher, feeding, and even a screenshot feature pic.twitter.com/FiKTO3FKK4

— justin (@justinsunyt) August 7, 2025

Unfortunately, OpenAI is slowly deprecating those older models — including the former default GPT-4o and the powerful reasoning model o3 — for users of ChatGPT, though they’ll continue to be available in the application programming interface (API) for developers for the foreseeable future.

In addition, a report from security firm SPLX found that OpenAI’s internal safety layer left major gaps in areas like business alignment and vulnerability to prompt injection and obfuscated logic attacks. 

While anecdotal, the checking the temperature on how the model is faring with early AI adopters seems to indicate a chilly reception.

AI influencer and former Googler Bilawal Sidhu posted a poll on X asking for a “vibe check” from his followers and the wider userbase, and so far, with 172 votes in, the overwhelming response is “Kinda mid.”

Alright, GPT-5 vibe check

— Bilawal Sidhu (@bilawalsidhu) August 7, 2025

And as the pseudonymous AI Leaks and News account wrote, “The overwhelming consensus on GPT-5 from both X and the Reddit AMA are overwhelmingly negative.”

The overwhelming consensus on GPT-5 from both X and the Reddit AMA are overwhelmingly negative

Most users are disgruntled about the broken model picker and non-pro users not having access to legacy models

What are your initial thoughts on GPT-5?

— AI Leaks and News (@AILeaksAndNews) August 8, 2025

Tibor Blaho, lead engineer at AIPRM and a popular AI leaks and news poster on X, summarized the many problems with the ChatGPT-5 rollout in an excellent post, highlighting that one of the new marquee features — an automatic “router” in ChatGPT that chooses a thinking or non-thinking mode for the underlying GPT-5 model depending on the difficulty of the query — has become one of the chief complaints, given the model seemed to default to non-thinking mode for many users.

A bit sad how the GPT-5 launch is going so far, especially after the long wait and high expectations

– The automatic switching between models (the router) seems partly broken/unreliable

– It’s unclear exactly which model you’re actually interacting with (standard or mini,…

— Tibor Blaho (@btibor91) August 8, 2025

Competition waiting in the wings

Thus, the sentiment toward ChatGPT-5 is far from universally positive, highlighting a major problem for OpenAI as it faces increasing competition from major U.S. rivals like Google and Anthropic, and a growing list of free, open source and powerful Chinese LLMs offering features that many U.S. models lack.

Take the Alibaba Qwen Team of AI researchers, who just today updated their highly performant Qwen 3 model to have 1 million token context — giving users the ability to exchange nearly 4x as much information with the model in a single back/forth interaction as GPT-5 offers.

Given OpenAI’s other big release this week — that of new open source gpt-oss models — also received a mixed reception from early users, things are not looking up for the number one dedicated AI company by users right now (700 million weekly active users of ChatGPT as of this month).

Indeed, this is also exemplified by users of the betting marketplace Polymarket overwhelmingly deciding following the release of GPT-5 that Google would likely have the best AI model by the end of this month, August 2025.

Other power users like Otherside AI co-founder and CEO Matt Schumer, who received early access to GPT-5 and blogged about it favorably in a review here, opined that views would shift as more people figured out the best ways to use the new model and adjusted their integration approaches:

A lot of folks who are having a bad experience are using GPT-5 in agent harnesses that aren’t yet optimized for it.

For every new model release, there’s a time lag between release + when companies that integrate the model have it truly working well.

Agent companies rush to…

— Matt Shumer (@mattshumer_) August 8, 2025

While it’s still early days for GPT-5 — and the sentiment could change dramatically as more users get their hands on it and try it for different tasks — the early indications are not looking like this is a “home run” release for OpenAI in the same way that prior releases such as GPT-4, or even the newer 4o and o3, were. And that’s a concerning indicator for a company that just raised yet another funding round, yet remains unprofitable due to its high costs of research and development.

Daily insights on business use cases with VB Daily

If you want to impress your boss, VB Daily has you covered. We give you the inside scoop on what companies are doing with generative AI, from regulatory shifts to practical deployments, so you can share insights for maximum ROI.

Read our Privacy Policy

Thanks for subscribing. Check out more VB newsletters here.

An error occured.





Source link

Follow on Google News Follow on Flipboard
Share. Facebook Twitter Pinterest LinkedIn Tumblr Email Copy Link
Previous ArticleBook your exhibit table before your competitor does at Disrupt 2025
Next Article Public Safety Leads, While Healthcare And Fintech Also See Big Deals
Advanced AI Editor
  • Website

Related Posts

Study warns of security risks as ‘OS agents’ gain control of computers and phones

August 12, 2025

TD Securities taps Layer 6 and OpenAI to deliver real-time equity insights to sales and trading teams

August 11, 2025

OpenAI is editing its GPT-5 rollout on the fly

August 11, 2025

Comments are closed.

Latest Posts

Midjourney Slams Lawsuit Filed by Disney to Prevent AI Training

Smithsonian Updates Museum Display on Impeachment To Include Trump

Funder Tried to Hijack Kandinsky Art Theft Suits, Says Collector

How to Stylize Your Images with Flux Kontext in ComfyUI

Latest Posts

HPE Expands NVIDIA AI Enterprise Integration with Blackwell GPU Solutions

August 12, 2025

Elon Musk cries antitrust as X & Grok can’t compete with OpenAI

August 12, 2025

IBM relocates thousands of employees to One Madison Ave

August 12, 2025

Subscribe to News

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

Recent Posts

  • HPE Expands NVIDIA AI Enterprise Integration with Blackwell GPU Solutions
  • Elon Musk cries antitrust as X & Grok can’t compete with OpenAI
  • IBM relocates thousands of employees to One Madison Ave
  • Creating uniquely human digital banking experiences at TD
  • C3 AI Stock Plunges After ‘Completely Unacceptable’ Q1 Sales – C3.ai (NYSE:AI)

Recent Comments

  1. EdwardEnror on 1-800-CHAT-GPT—12 Days of OpenAI: Day 10
  2. ThomasWep on 1-800-CHAT-GPT—12 Days of OpenAI: Day 10
  3. ThomasWep on 1-800-CHAT-GPT—12 Days of OpenAI: Day 10
  4. EdwardEnror on 1-800-CHAT-GPT—12 Days of OpenAI: Day 10
  5. ThomasWep on 1-800-CHAT-GPT—12 Days of OpenAI: Day 10

Welcome to Advanced AI News—your ultimate destination for the latest advancements, insights, and breakthroughs in artificial intelligence.

At Advanced AI News, we are passionate about keeping you informed on the cutting edge of AI technology, from groundbreaking research to emerging startups, expert insights, and real-world applications. Our mission is to deliver high-quality, up-to-date, and insightful content that empowers AI enthusiasts, professionals, and businesses to stay ahead in this fast-evolving field.

Subscribe to Updates

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

LinkedIn Instagram YouTube Threads X (Twitter)
  • Home
  • About Us
  • Advertise With Us
  • Contact Us
  • DMCA
  • Privacy Policy
  • Terms & Conditions
© 2025 advancedainews. Designed by advancedainews.

Type above and press Enter to search. Press Esc to cancel.