Close Menu
  • Home
  • AI Models
    • DeepSeek
    • xAI
    • OpenAI
    • Meta AI Llama
    • Google DeepMind
    • Amazon AWS AI
    • Microsoft AI
    • Anthropic (Claude)
    • NVIDIA AI
    • IBM WatsonX Granite 3.1
    • Adobe Sensi
    • Hugging Face
    • Alibaba Cloud (Qwen)
    • Baidu (ERNIE)
    • C3 AI
    • DataRobot
    • Mistral AI
    • Moonshot AI (Kimi)
    • Google Gemma
    • xAI
    • Stability AI
    • H20.ai
  • AI Research
    • Allen Institue for AI
    • arXiv AI
    • Berkeley AI Research
    • CMU AI
    • Google Research
    • Microsoft Research
    • Meta AI Research
    • OpenAI Research
    • Stanford HAI
    • MIT CSAIL
    • Harvard AI
  • AI Funding & Startups
    • AI Funding Database
    • CBInsights AI
    • Crunchbase AI
    • Data Robot Blog
    • TechCrunch AI
    • VentureBeat AI
    • The Information AI
    • Sifted AI
    • WIRED AI
    • Fortune AI
    • PitchBook
    • TechRepublic
    • SiliconANGLE – Big Data
    • MIT News
    • Data Robot Blog
  • Expert Insights & Videos
    • Google DeepMind
    • Lex Fridman
    • Matt Wolfe AI
    • Yannic Kilcher
    • Two Minute Papers
    • AI Explained
    • TheAIEdge
    • Matt Wolfe AI
    • The TechLead
    • Andrew Ng
    • OpenAI
  • Expert Blogs
    • François Chollet
    • Gary Marcus
    • IBM
    • Jack Clark
    • Jeremy Howard
    • Melanie Mitchell
    • Andrew Ng
    • Andrej Karpathy
    • Sebastian Ruder
    • Rachel Thomas
    • IBM
  • AI Policy & Ethics
    • ACLU AI
    • AI Now Institute
    • Center for AI Safety
    • EFF AI
    • European Commission AI
    • Partnership on AI
    • Stanford HAI Policy
    • Mozilla Foundation AI
    • Future of Life Institute
    • Center for AI Safety
    • World Economic Forum AI
  • AI Tools & Product Releases
    • AI Assistants
    • AI for Recruitment
    • AI Search
    • Coding Assistants
    • Customer Service AI
    • Image Generation
    • Video Generation
    • Writing Tools
    • AI for Recruitment
    • Voice/Audio Generation
  • Industry Applications
    • Finance AI
    • Healthcare AI
    • Legal AI
    • Manufacturing AI
    • Media & Entertainment
    • Transportation AI
    • Education AI
    • Retail AI
    • Agriculture AI
    • Energy AI
  • AI Art & Entertainment
    • AI Art News Blog
    • Artvy Blog » AI Art Blog
    • Weird Wonderful AI Art Blog
    • The Chainsaw » AI Art
    • Artvy Blog » AI Art Blog
What's Hot

Skip the AI ‘bake-off’ and build autonomous agents: Lessons from Intuit and Amex

SaaS is in the past. The future belongs to agents, says Narada AI’s CEO.

TU Wien Rendering #30 – Dispersion and Spectral Rendering

Facebook X (Twitter) Instagram
Advanced AI News
  • Home
  • AI Models
    • OpenAI (GPT-4 / GPT-4o)
    • Anthropic (Claude 3)
    • Google DeepMind (Gemini)
    • Meta (LLaMA)
    • Cohere (Command R)
    • Amazon (Titan)
    • IBM (Watsonx)
    • Inflection AI (Pi)
  • AI Research
    • Allen Institue for AI
    • arXiv AI
    • Berkeley AI Research
    • CMU AI
    • Google Research
    • Meta AI Research
    • Microsoft Research
    • OpenAI Research
    • Stanford HAI
    • MIT CSAIL
    • Harvard AI
  • AI Funding
    • AI Funding Database
    • CBInsights AI
    • Crunchbase AI
    • Data Robot Blog
    • TechCrunch AI
    • VentureBeat AI
    • The Information AI
    • Sifted AI
    • WIRED AI
    • Fortune AI
    • PitchBook
    • TechRepublic
    • SiliconANGLE – Big Data
    • MIT News
    • Data Robot Blog
  • AI Experts
    • Google DeepMind
    • Lex Fridman
    • Meta AI Llama
    • Yannic Kilcher
    • Two Minute Papers
    • AI Explained
    • TheAIEdge
    • The TechLead
    • Matt Wolfe AI
    • Andrew Ng
    • OpenAI
    • Expert Blogs
      • François Chollet
      • Gary Marcus
      • IBM
      • Jack Clark
      • Jeremy Howard
      • Melanie Mitchell
      • Andrew Ng
      • Andrej Karpathy
      • Sebastian Ruder
      • Rachel Thomas
      • IBM
  • AI Tools
    • AI Assistants
    • AI for Recruitment
    • AI Search
    • Coding Assistants
    • Customer Service AI
  • AI Policy
    • ACLU AI
    • AI Now Institute
    • Center for AI Safety
  • Industry AI
    • Finance AI
    • Healthcare AI
    • Education AI
    • Energy AI
    • Legal AI
LinkedIn Instagram YouTube Threads X (Twitter)
Advanced AI News
OpenAI

OpenAI Wants to be a ‘24/7 World-Class Doctor’ in Your Pocket

By Advanced AI EditorMay 13, 2025No Comments5 Mins Read
Share Facebook Twitter Pinterest Copy Link Telegram LinkedIn Tumblr Email
Share
Facebook Twitter LinkedIn Pinterest Email


OpenAI is making a serious push into the healthcare sector, with the release of a new benchmark called HealthBench, designed to evaluate the capabilities of AI systems in health. 

The benchmark aims to help large language models (LLMs) support patients and clinicians with health discussions that are trustworthy, meaningful, and open to continuous improvement. HealthBench looks at seven key areas, including emergency care, managing uncertainty, and global health.

“What if you had a world-class doctor in your pocket, 24/7, at no cost? That’s the promise of AI in healthcare, but mistakes can be catastrophic. That’s why OpenAI launched HealthBench, a new benchmark to test how well AI models handle real, complex medical conversations,”  Matthew Berman, CEO of Forward Future, wrote on X. 

Developed in partnership with 262 physicians from 60 countries, HealthBench includes 5,000 realistic health-related conversations, each paired with a custom physician-created rubric for grading model responses.

OpenAI shared in its blog that it used HealthBench to evaluate how well its latest models perform on healthcare tasks. According to the company, recent models have improved quickly, with o3 outperforming others, including Claude 3.7 Sonnet and Gemini 2.5 Pro (March 2025 version) in the tests.

OpenAI also mentioned that small models have gotten much better lately. GPT‑4.1 nano, for example, beats the August 2024 GPT‑4o model—even though it’s 25 times less expensive.

Compared to written responses from doctors, LLMs were found to write better answers for many of the instances. By April this year, the newest models had reached a point where physician responses no longer improved the quality of the answers.

Online, many users have shared stories of how ChatGPT helped them make sense of complicated health problems, ranging from chronic back pain to unexplained jaw issues.

“I’ve had half a dozen healthcare-related issues in my family over the last few months, and ChatGPT has been more helpful than the physician…,” said Joe Flaherty, a former Wired staff writer, in a post on X.

“ChatGPT outperforms human doctors for me. It diagnosed a condition I have and recommended the correct treatment after two human specialists failed. Perfect use-case for LLMs as it requires knowledge & pattern matching,” another user said on X. 

However, experts warn of the over-dependence on AI. “Using artificial intelligence for diagnosis and even for prescriptions, one has to be really cautious, because physical examination is missing,” Dr CN Manjunath, senior cardiologist and director of the Sri Jayadeva Institute of Cardiovascular Sciences and Research, Bengaluru, told AIM in an earlier interaction.

He further emphasised that, despite the widespread use of technology in healthcare, physical evaluation remains a cornerstone of accurate diagnosis. Though medications may alleviate symptoms, he advised always following up with a qualified medical practitioner for comprehensive care. He explained that once a particular diagnosis has been made, patients can follow up with ChatGPT.

OpenAI’s growing interest in healthcare is reflected in its job openings, which include roles such as health AI research engineer and healthcare software engineer.

This development comes against the backdrop of OpenAI appointing Fidji Simo as the CEO of applications, allowing Sam Altman to focus more on research, compute, and safety. Time and time again, Altman has reiterated that he is most excited about scientific discoveries with the help of AI. 

“I’m personally most excited about AI for science at this point. I’m a big believer that the most important driver of the world and people’s lives getting better and better is new scientific discovery,” said Altman in a recent TED talk. He added that they hear from scientists about how the latest AI models have been making them more productive and impacting what they are able to discover.

“I deeply believe that AGI can extend human life by broadening trustworthy access to care and accelerating longevity research,” said Karina Nguyen, researcher at OpenAI, in a post on X. 

Even Bryan Johnson, known for his radical approach to longevity and anti-ageing, weighed in on OpenAI’s development. He pointed out that AI-assisted physicians had outperformed human physicians without reference materials, adding that by April, the responses were so strong that physicians could no longer improve them.

Google is Stepping Up in Healthcare AI

OpenAI is not alone in focusing on healthcare. Google recently launched TxGemma, a new suite of open-source language models built to support therapeutic development. The models are intended to improve tasks such as drug candidate assessment, molecule property prediction, and clinical trial outcome estimation by applying LLM capabilities to biomedical data.

In 2024, Google developed Med-Gemini, a next-generation set of healthcare models that combine Gemini’s advanced multimodal and reasoning capabilities by fine-tuning on de-identified medical data.

To support care providers, Google, in 2023, introduced MedLM and Search for Healthcare. These are built to handle medical queries and are available on the Google Cloud Vertex AI platform. They help clinicians make better-informed decisions and enable patients to receive more accurate and personalised care.

Anthropic chief Dario Amodei, a rival of OpenAI, has also expressed excitement about AI’s potential in biology. “I’m optimistic that diseases which have plagued us for thousands of years—such as cancer, Alzheimer’s, and ageing itself—may be treatable,” he said. 

In his recent essay ‘Machines of Loving Grace’, Amodei outlined a future in which AI could “double our lifespans, cure all diseases, and create untold global economic wealth”.Anthropic recently launched the AI for Science Program to support scientific research and discovery by giving researchers access to its API. The program offers free API credits for high-impact projects, with a focus on biology and life sciences.



Source link

Follow on Google News Follow on Flipboard
Share. Facebook Twitter Pinterest LinkedIn Tumblr Email Copy Link
Previous ArticleTesla is building Cortex 2.0 supercomputer facility in Giga Texas
Next Article Google Launches AI Futures Fund to Support Next Wave of AI Startups
Advanced AI Editor
  • Website

Related Posts

OpenAI Is Trying to Reset

July 10, 2025

OpenAI’s open language model is imminent

July 9, 2025

Ex-OpenAI Exec Mira Murati’s New Startup Offers $500,000 Salaries As Meta And Sam Altman Fight To Keep AI Talent

July 9, 2025
Leave A Reply

Latest Posts

Is the Summer Group Show Dead or are Galleries Are Getting Smarter?

Adam Lindemann to Close Venus Over Manhattan After 14 Years

Ed Sheeran Is Ripping Off Jackson Pollock with His Paintings

Crystal Bridges and Art Bridges Acquire 90 Works of Contemporary Native Art

Latest Posts

Skip the AI ‘bake-off’ and build autonomous agents: Lessons from Intuit and Amex

July 10, 2025

SaaS is in the past. The future belongs to agents, says Narada AI’s CEO.

July 10, 2025

TU Wien Rendering #30 – Dispersion and Spectral Rendering

July 10, 2025

Subscribe to News

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

Recent Posts

  • Skip the AI ‘bake-off’ and build autonomous agents: Lessons from Intuit and Amex
  • SaaS is in the past. The future belongs to agents, says Narada AI’s CEO.
  • TU Wien Rendering #30 – Dispersion and Spectral Rendering
  • Robot Surgeon Executes Key Phase of Surgery Without Human Assistance
  • Configure fine-grained access to Amazon Bedrock models using Amazon SageMaker Unified Studio

Recent Comments

  1. "oppna binance-konto on Trump crypto czar Sacks stablecoin bill unlock trillions for Treasury
  2. Account binance on itel debuts CITY series with CITY 100 new model: A stylish, durable & DeepSeek AI-powered smartphone for Gen Z

Welcome to Advanced AI News—your ultimate destination for the latest advancements, insights, and breakthroughs in artificial intelligence.

At Advanced AI News, we are passionate about keeping you informed on the cutting edge of AI technology, from groundbreaking research to emerging startups, expert insights, and real-world applications. Our mission is to deliver high-quality, up-to-date, and insightful content that empowers AI enthusiasts, professionals, and businesses to stay ahead in this fast-evolving field.

Subscribe to Updates

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

LinkedIn Instagram YouTube Threads X (Twitter)
  • Home
  • About Us
  • Advertise With Us
  • Contact Us
  • DMCA
  • Privacy Policy
  • Terms & Conditions
© 2025 advancedainews. Designed by advancedainews.

Type above and press Enter to search. Press Esc to cancel.