Close Menu
  • Home
  • AI Models
    • DeepSeek
    • xAI
    • OpenAI
    • Meta AI Llama
    • Google DeepMind
    • Amazon AWS AI
    • Microsoft AI
    • Anthropic (Claude)
    • NVIDIA AI
    • IBM WatsonX Granite 3.1
    • Adobe Sensi
    • Hugging Face
    • Alibaba Cloud (Qwen)
    • Baidu (ERNIE)
    • C3 AI
    • DataRobot
    • Mistral AI
    • Moonshot AI (Kimi)
    • Google Gemma
    • xAI
    • Stability AI
    • H20.ai
  • AI Research
    • Allen Institue for AI
    • arXiv AI
    • Berkeley AI Research
    • CMU AI
    • Google Research
    • Microsoft Research
    • Meta AI Research
    • OpenAI Research
    • Stanford HAI
    • MIT CSAIL
    • Harvard AI
  • AI Funding & Startups
    • AI Funding Database
    • CBInsights AI
    • Crunchbase AI
    • Data Robot Blog
    • TechCrunch AI
    • VentureBeat AI
    • The Information AI
    • Sifted AI
    • WIRED AI
    • Fortune AI
    • PitchBook
    • TechRepublic
    • SiliconANGLE – Big Data
    • MIT News
    • Data Robot Blog
  • Expert Insights & Videos
    • Google DeepMind
    • Lex Fridman
    • Matt Wolfe AI
    • Yannic Kilcher
    • Two Minute Papers
    • AI Explained
    • TheAIEdge
    • Matt Wolfe AI
    • The TechLead
    • Andrew Ng
    • OpenAI
  • Expert Blogs
    • François Chollet
    • Gary Marcus
    • IBM
    • Jack Clark
    • Jeremy Howard
    • Melanie Mitchell
    • Andrew Ng
    • Andrej Karpathy
    • Sebastian Ruder
    • Rachel Thomas
    • IBM
  • AI Policy & Ethics
    • ACLU AI
    • AI Now Institute
    • Center for AI Safety
    • EFF AI
    • European Commission AI
    • Partnership on AI
    • Stanford HAI Policy
    • Mozilla Foundation AI
    • Future of Life Institute
    • Center for AI Safety
    • World Economic Forum AI
  • AI Tools & Product Releases
    • AI Assistants
    • AI for Recruitment
    • AI Search
    • Coding Assistants
    • Customer Service AI
    • Image Generation
    • Video Generation
    • Writing Tools
    • AI for Recruitment
    • Voice/Audio Generation
  • Industry Applications
    • Finance AI
    • Healthcare AI
    • Legal AI
    • Manufacturing AI
    • Media & Entertainment
    • Transportation AI
    • Education AI
    • Retail AI
    • Agriculture AI
    • Energy AI
  • AI Art & Entertainment
    • AI Art News Blog
    • Artvy Blog » AI Art Blog
    • Weird Wonderful AI Art Blog
    • The Chainsaw » AI Art
    • Artvy Blog » AI Art Blog
What's Hot

IBM releases Granite 4 series of Mamba-Transformer language models

Sources: Naveen Rao’s new AI hardware startup targets $5B valuation with backing from a16z 

Stocks to Gain From Quantum Computing in 2025: MSFT, IBM, QBTS, IONQ – October 2, 2025

Facebook X (Twitter) Instagram
Advanced AI News
  • Home
  • AI Models
    • OpenAI (GPT-4 / GPT-4o)
    • Anthropic (Claude 3)
    • Google DeepMind (Gemini)
    • Meta (LLaMA)
    • Cohere (Command R)
    • Amazon (Titan)
    • IBM (Watsonx)
    • Inflection AI (Pi)
  • AI Research
    • Allen Institue for AI
    • arXiv AI
    • Berkeley AI Research
    • CMU AI
    • Google Research
    • Meta AI Research
    • Microsoft Research
    • OpenAI Research
    • Stanford HAI
    • MIT CSAIL
    • Harvard AI
  • AI Funding
    • AI Funding Database
    • CBInsights AI
    • Crunchbase AI
    • Data Robot Blog
    • TechCrunch AI
    • VentureBeat AI
    • The Information AI
    • Sifted AI
    • WIRED AI
    • Fortune AI
    • PitchBook
    • TechRepublic
    • SiliconANGLE – Big Data
    • MIT News
    • Data Robot Blog
  • AI Experts
    • Google DeepMind
    • Lex Fridman
    • Meta AI Llama
    • Yannic Kilcher
    • Two Minute Papers
    • AI Explained
    • TheAIEdge
    • The TechLead
    • Matt Wolfe AI
    • Andrew Ng
    • OpenAI
    • Expert Blogs
      • François Chollet
      • Gary Marcus
      • IBM
      • Jack Clark
      • Jeremy Howard
      • Melanie Mitchell
      • Andrew Ng
      • Andrej Karpathy
      • Sebastian Ruder
      • Rachel Thomas
      • IBM
  • AI Tools
    • AI Assistants
    • AI for Recruitment
    • AI Search
    • Coding Assistants
    • Customer Service AI
  • AI Policy
    • ACLU AI
    • AI Now Institute
    • Center for AI Safety
  • Business AI
    • Advanced AI News Features
    • Finance AI
    • Healthcare AI
    • Education AI
    • Energy AI
    • Legal AI
LinkedIn Instagram YouTube Threads X (Twitter)
Advanced AI News
OpenAI Research

How OpenAI Trained for Its Big Coding Victory

By Advanced AI EditorOctober 3, 2025No Comments4 Mins Read
Share Facebook Twitter Pinterest Copy Link Telegram LinkedIn Tumblr Email
Share
Facebook Twitter LinkedIn Pinterest Email


eWeek content and product recommendations are
editorially independent. We may make money when you click on links
to our partners.
Learn More

OpenAI and Google DeepMind stood out at the International Collegiate Programming Contest world finals in Baku, Azerbaijan, last week, with OpenAI’s models solving all 12 of the problems at the world’s top university coding competition.

In attendance was OpenAI Research Lead Ahmed El-Kishky, who spoke to our colleagues at The Neuron in a podcast interview about the win and what OpenAI learned from the competition. 

OpenAI’s preparation for the contest 

Generative AI has made significant progress in competitive math and programming over the last year. Earlier versions, such as GPT-4, did so poorly that its attempts would crash the sandbox computer used in the competitions, El-Kishky said. 

This year, “people were nervous” on the small OpenAI team that traveled to Azerbaijan, El-Kishky noted. 

Still, circumstances were very different from those in 2024. The researchers had pre-trained the models they would use for the competition, including those based on reinforcement learning.

A week before the event, the team did a dry run, “spending long nights trying to make sure everything’s working,” El-Kishky said. 

Two models tackled the problems: GPT-5 and an experimental reasoning model that is not publicly available.  

Testing GPT-5 against a new reasoning model

“In this case, we knew that GPT-5 can go a long way, but we wanted to not just limit it there,” said El-Kishky. “We wanted to also test how well reasoning models perform. So we tried out both.” 

Both models attempted to solve the problem multiple times. The experimental reasoning model decided which solution to submit. GPT-5 was faster, but the experimental model was more powerful. GPT-5 couldn’t complete the last problem, but the experimental model did on its second attempt. 

The competition was an interesting example of problems that are difficult for humans but not so difficult for an AI — and vice versa, El-Kishky said. 

It also showed how far AI reasoning capabilities have come. Reasoning models and models with reinforcement learning work better because of their similarity to human problem-solving, El-Kishky explained.

“When you have a difficult problem, no human just immediately gives an answer,” he said. “If I asked you to multiply a four-digit number by a five-digit number, you wouldn’t give me an answer instantly.” 

The breakthrough was reinforcement learning, or the ability to “mimic how a human thinks through these problems. They try things out, and maybe they go down wrong directions to dead ends, they course correct,” El-Kishky said. 

Programming competitions are a chance to benchmark AI progress 

True to OpenAI’s mission statement, El-Kisky said the real purpose of AI models is to increase human knowledge, not win programming contests. 

“The competitions are not really an end goal in themselves,” El-Kishky said. “They’re just sort of a way to benchmark progress.” 

The experimental model used in the competition won’t be released to the public, but the research behind it may one day be applied to other models and, eventually, to ChatGPT. 

Teaching models to code autonomously is an important research topic for the company as a whole, El-Kishky noted. OpenAI aims to develop AI agents that can operate independently for days, weeks, or months. 

More from El-Kishky 

Check out The Neuron’s full interview with El-Kishky, which includes what it’s like to work at OpenAI and details about other competitions employees have participated in.

Spotlight on another episode of The Neuron podcast: Microsoft’s AI chief Mustafa Suleyman outlines his vision for the future of artificial intelligence and the responsibilities that come with it. His remarks could shape industry expectations on innovation and governance.



Source link

Follow on Google News Follow on Flipboard
Share. Facebook Twitter Pinterest LinkedIn Tumblr Email Copy Link
Previous ArticleOpenAI’s Sora soars to No. 1 on Apple’s US App Store
Next Article OpenAI’s Sora leaps to the top of the app download charts
Advanced AI Editor
  • Website

Related Posts

What OpenAI’s Research Reveals About The Future Of AI Search

September 30, 2025

Scheming AIs? OpenAI says models can mislead and hide their real intentions | Technology News

September 19, 2025

AI models know when they’re being tested – and change their behavior, research shows

September 17, 2025

Comments are closed.

Latest Posts

New Archaeological Research Reveals Life in Pompeii Post-Eruption

Director Fired After Declining to Give Trump Sword for King Charles

Statue of Trump and Epstein Holding Hands Returns to Washington, D.C.

Italian police seize 21 suspected forgeries attributed to Dalí

Latest Posts

IBM releases Granite 4 series of Mamba-Transformer language models

October 3, 2025

Sources: Naveen Rao’s new AI hardware startup targets $5B valuation with backing from a16z 

October 3, 2025

Stocks to Gain From Quantum Computing in 2025: MSFT, IBM, QBTS, IONQ – October 2, 2025

October 3, 2025

Subscribe to News

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

Recent Posts

  • IBM releases Granite 4 series of Mamba-Transformer language models
  • Sources: Naveen Rao’s new AI hardware startup targets $5B valuation with backing from a16z 
  • Stocks to Gain From Quantum Computing in 2025: MSFT, IBM, QBTS, IONQ – October 2, 2025
  • Transformers Discover Molecular Structure Without Graph Priors – Takara TLDR
  • Secure ingress connectivity to Amazon Bedrock AgentCore Gateway using interface VPC endpoints

Recent Comments

  1. PatrickGuide on 1-800-CHAT-GPT—12 Days of OpenAI: Day 10
  2. Bert Pressley on Stanford HAI’s 2025 AI Index Reveals Record Growth in AI Capabilities, Investment, and Regulation
  3. Halley Baughey on C3 AI and Arcfield Announce Partnership to Accelerate AI Capabilities to Serve U.S. Defense and Intelligence Communities
  4. Cheryll Funt on Down Over 40% This Year, Is C3.ai Stock Too Cheap to Pass Up?
  5. Lino Casciato on Stanford HAI’s 2025 AI Index Reveals Record Growth in AI Capabilities, Investment, and Regulation

Welcome to Advanced AI News—your ultimate destination for the latest advancements, insights, and breakthroughs in artificial intelligence.

At Advanced AI News, we are passionate about keeping you informed on the cutting edge of AI technology, from groundbreaking research to emerging startups, expert insights, and real-world applications. Our mission is to deliver high-quality, up-to-date, and insightful content that empowers AI enthusiasts, professionals, and businesses to stay ahead in this fast-evolving field.

Subscribe to Updates

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

LinkedIn Instagram YouTube Threads X (Twitter)
  • Home
  • About Us
  • Advertise With Us
  • Contact Us
  • DMCA
  • Privacy Policy
  • Terms & Conditions
© 2025 advancedainews. Designed by advancedainews.

Type above and press Enter to search. Press Esc to cancel.