Close Menu
  • Home
  • AI Models
    • DeepSeek
    • xAI
    • OpenAI
    • Meta AI Llama
    • Google DeepMind
    • Amazon AWS AI
    • Microsoft AI
    • Anthropic (Claude)
    • NVIDIA AI
    • IBM WatsonX Granite 3.1
    • Adobe Sensi
    • Hugging Face
    • Alibaba Cloud (Qwen)
    • Baidu (ERNIE)
    • C3 AI
    • DataRobot
    • Mistral AI
    • Moonshot AI (Kimi)
    • Google Gemma
    • xAI
    • Stability AI
    • H20.ai
  • AI Research
    • Allen Institue for AI
    • arXiv AI
    • Berkeley AI Research
    • CMU AI
    • Google Research
    • Microsoft Research
    • Meta AI Research
    • OpenAI Research
    • Stanford HAI
    • MIT CSAIL
    • Harvard AI
  • AI Funding & Startups
    • AI Funding Database
    • CBInsights AI
    • Crunchbase AI
    • Data Robot Blog
    • TechCrunch AI
    • VentureBeat AI
    • The Information AI
    • Sifted AI
    • WIRED AI
    • Fortune AI
    • PitchBook
    • TechRepublic
    • SiliconANGLE – Big Data
    • MIT News
    • Data Robot Blog
  • Expert Insights & Videos
    • Google DeepMind
    • Lex Fridman
    • Matt Wolfe AI
    • Yannic Kilcher
    • Two Minute Papers
    • AI Explained
    • TheAIEdge
    • Matt Wolfe AI
    • The TechLead
    • Andrew Ng
    • OpenAI
  • Expert Blogs
    • François Chollet
    • Gary Marcus
    • IBM
    • Jack Clark
    • Jeremy Howard
    • Melanie Mitchell
    • Andrew Ng
    • Andrej Karpathy
    • Sebastian Ruder
    • Rachel Thomas
    • IBM
  • AI Policy & Ethics
    • ACLU AI
    • AI Now Institute
    • Center for AI Safety
    • EFF AI
    • European Commission AI
    • Partnership on AI
    • Stanford HAI Policy
    • Mozilla Foundation AI
    • Future of Life Institute
    • Center for AI Safety
    • World Economic Forum AI
  • AI Tools & Product Releases
    • AI Assistants
    • AI for Recruitment
    • AI Search
    • Coding Assistants
    • Customer Service AI
    • Image Generation
    • Video Generation
    • Writing Tools
    • AI for Recruitment
    • Voice/Audio Generation
  • Industry Applications
    • Finance AI
    • Healthcare AI
    • Legal AI
    • Manufacturing AI
    • Media & Entertainment
    • Transportation AI
    • Education AI
    • Retail AI
    • Agriculture AI
    • Energy AI
  • AI Art & Entertainment
    • AI Art News Blog
    • Artvy Blog » AI Art Blog
    • Weird Wonderful AI Art Blog
    • The Chainsaw » AI Art
    • Artvy Blog » AI Art Blog
What's Hot

Nexl Bags $23m, Will Invest In Hires + Acquisitions – Artificial Lawyer

ASPO: Asymmetric Importance Sampling Policy Optimization – Takara TLDR

Vxceed builds the perfect sales pitch for sales teams at scale using Amazon Bedrock

Facebook X (Twitter) Instagram
Advanced AI News
  • Home
  • AI Models
    • OpenAI (GPT-4 / GPT-4o)
    • Anthropic (Claude 3)
    • Google DeepMind (Gemini)
    • Meta (LLaMA)
    • Cohere (Command R)
    • Amazon (Titan)
    • IBM (Watsonx)
    • Inflection AI (Pi)
  • AI Research
    • Allen Institue for AI
    • arXiv AI
    • Berkeley AI Research
    • CMU AI
    • Google Research
    • Meta AI Research
    • Microsoft Research
    • OpenAI Research
    • Stanford HAI
    • MIT CSAIL
    • Harvard AI
  • AI Funding
    • AI Funding Database
    • CBInsights AI
    • Crunchbase AI
    • Data Robot Blog
    • TechCrunch AI
    • VentureBeat AI
    • The Information AI
    • Sifted AI
    • WIRED AI
    • Fortune AI
    • PitchBook
    • TechRepublic
    • SiliconANGLE – Big Data
    • MIT News
    • Data Robot Blog
  • AI Experts
    • Google DeepMind
    • Lex Fridman
    • Meta AI Llama
    • Yannic Kilcher
    • Two Minute Papers
    • AI Explained
    • TheAIEdge
    • The TechLead
    • Matt Wolfe AI
    • Andrew Ng
    • OpenAI
    • Expert Blogs
      • François Chollet
      • Gary Marcus
      • IBM
      • Jack Clark
      • Jeremy Howard
      • Melanie Mitchell
      • Andrew Ng
      • Andrej Karpathy
      • Sebastian Ruder
      • Rachel Thomas
      • IBM
  • AI Tools
    • AI Assistants
    • AI for Recruitment
    • AI Search
    • Coding Assistants
    • Customer Service AI
  • AI Policy
    • ACLU AI
    • AI Now Institute
    • Center for AI Safety
  • Business AI
    • Advanced AI News Features
    • Finance AI
    • Healthcare AI
    • Education AI
    • Energy AI
    • Legal AI
LinkedIn Instagram YouTube Threads X (Twitter)
Advanced AI News
NVIDIA AI

Nvidia Scales AI Beyond The Data Center

By Advanced AI EditorAugust 22, 2025No Comments4 Mins Read
Share Facebook Twitter Pinterest Copy Link Telegram LinkedIn Tumblr Email
Share
Facebook Twitter LinkedIn Pinterest Email


Bill Dally, Nvidia chief scientist

Bill Dally, Nvidia chief scientist

Nvidia

The annual HotChips conference starts this Sunday, Aug. 24, in San Francisco. Nvidia is scheduled to present six sessions covering topics of interest to AI data center users and operators and will make several key announcements I’ll cover in this article. (Like most AI semiconductor-related companies, Nvidia is a client of Cambrian-AI Research.)

NVLink Fusion is perhaps the most fascinating topic, enabling the entire industry of CPUs and GPUs to create chips to access NVLink, the company’s secret sauce for interconnecting up to 72 accelerators and 36 CPUs in a rack. While I’m working on another article that specifically covers how Qualcomm is using NVLink Fusion to enter the data center with its super-fast Arm-based Oryon CPUs, I’ll focus here on how Nvidia is enabling AI to expand beyond a single data center, and a new 4-bit format that could significantly improve the efficiency of training AI models by as much as four-fold.

Nvidia will present six technical sessions at this year’s HotChips conference in San Francisco.

Nvidia

Connecting Multiple Data Data Centers for Massive AI

As older data centers struggle to grow AI due to power constraints, many are seeking a method to break through the walls and distances to connect their network of data centers, delivering on the promise of AI and growing their business. Nvidia has launched a new Ethernet card called Spectrum-XGS to enable these data centers to enter the world of giga-scale AI. This scale is needed for training large AI models but increasingly is also used for agentic AI and reasoning models. Nvidia claims this network can nearly double the performance of multi-site AI workloads.

Nvidia has introduced new Ethernet to support multi-data-center integration.

Nvidia

NVFP4: 4-Bit AI Training as Accurate as 16-Bit?

Nvidia is somewhat unique in the industry in having a large in-house research organization under Bill Dally, the company’s chief scientist and senior vice president. Dr. Dally’s team has developed many of the breakthroughs that have kept Nvidia in the lead and caused its competitors to rush to catch up with its multi-year head start.

Last year at HotChips ’24, Dr. Dally said that he thought there was more gold to mine in the realm of “quantization”; the ever-shrinking data formats that double or even quadruple the performance efficiency by using smaller and smaller data formats. While we may be nearing the end of that road, the new 4-bit floating point NVDP4 is pretty remarkable way to finish the story. NVDP4 will be available on all Blackwell and future Nvidia GPUs.

Nvidia has developed a new 4-Bit format for AI training that the company claims is as accurate as the 16-bit format used in nearly all AI training, enabling a four-fold increase in efficiency.

Nvidia

In another of Nvidia’s research results, the company discussed the use of speculative decoding, where the GPU creates drafts of the next token and then uses AI (duh!) to verify that that draft token is valid or not. Speculative execution has been used for decades in CPUs, and now is increasing being considered for deploying more efficient AI. Note that Cerebras has disputed the representation of their numbers on the graph below.

Speculative decoding creates new draft models for potential next tokens.

Nvidia

Nvidia Keeps its Research Dial Turned Up to 11

I hope you can attend the many fine sessions being offered next week at HotChips. I will, at least on-line! This is the hottest conference every year for the geekiest of the industry, those presenting and in attendance. It is this sort of sharing of ideas and research results that feeds our industry and enables the USA’s leadership in semiconductors.

The Nvidia roadmap through 2028

Nvidia

Disclosures: This article expresses the opinions of the author and is not to be taken as advice to purchase from or invest in the companies mentioned. My firm, Cambrian-AI Research, is fortunate to have many semiconductor companies as our clients, including Baya Systems BrainChip, Cadence, Cerebras Systems, D-Matrix, Esperanto, Flex, Groq, IBM, Intel, Micron, NVIDIA, Qualcomm, Graphcore, SImA.ai, Synopsys, Tenstorrent, Ventana Microsystems, and scores of investors. I have no investment positions in any of the companies mentioned in this article. For more information, please visit our website at https://cambrian-AI.com.



Source link

Follow on Google News Follow on Flipboard
Share. Facebook Twitter Pinterest LinkedIn Tumblr Email Copy Link
Previous ArticleMeta will license Midjourney’s AI tech to bring better slop to your feed
Next Article Gemini letting free users generate Veo 3 videos this weekend
Advanced AI Editor
  • Website

Related Posts

Few Investors See It Coming: Nvidia’s Next Growth Engine Is Already in Motion

October 6, 2025

Hyperscale Data to Mine Bitcoin, Expand AI Data Center in Michigan

October 6, 2025

Competition heats up to challenge Nvidia’s AI chip dominance

October 6, 2025

Comments are closed.

Latest Posts

Matthiesen Gallery Files Lawsuit Over Gustave Courbet Painting

MoMA Partners with Mattel for Van Gogh Barbie, Monet and Dalí Figures

Underground Film Legend and Artist Dies at 92

Artwork Forfeited by Inigo Philbrick’s Partner Flops at Sotheby’s

Latest Posts

Nexl Bags $23m, Will Invest In Hires + Acquisitions – Artificial Lawyer

October 8, 2025

ASPO: Asymmetric Importance Sampling Policy Optimization – Takara TLDR

October 8, 2025

Vxceed builds the perfect sales pitch for sales teams at scale using Amazon Bedrock

October 8, 2025

Subscribe to News

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

Recent Posts

  • Nexl Bags $23m, Will Invest In Hires + Acquisitions – Artificial Lawyer
  • ASPO: Asymmetric Importance Sampling Policy Optimization – Takara TLDR
  • Vxceed builds the perfect sales pitch for sales teams at scale using Amazon Bedrock
  • Arcade Welcomes Varun Jampani as AI Chief to Build the Next Era of AI Creation and Commerce
  • The Future of Artificial Intelligence: 10 Predictions for 10 Industries

Recent Comments

  1. turkey visa from australia on 13 AI-Focused Storage Offerings On Display At Nvidia GTC 2025
  2. RussellFuemn on [2405.19874] Is In-Context Learning Sufficient for Instruction Following in LLMs?
  3. Elmo Demorizi on Class Dismissed? Representative Claims in Getty v. Stability AI | Cooley LLP
  4. GregoryEffot on 1-800-CHAT-GPT—12 Days of OpenAI: Day 10
  5. EarnestJoize on Reverse Engineering The IBM PC110, One PCB At A Time

Welcome to Advanced AI News—your ultimate destination for the latest advancements, insights, and breakthroughs in artificial intelligence.

At Advanced AI News, we are passionate about keeping you informed on the cutting edge of AI technology, from groundbreaking research to emerging startups, expert insights, and real-world applications. Our mission is to deliver high-quality, up-to-date, and insightful content that empowers AI enthusiasts, professionals, and businesses to stay ahead in this fast-evolving field.

Subscribe to Updates

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

LinkedIn Instagram YouTube Threads X (Twitter)
  • Home
  • About Us
  • Advertise With Us
  • Contact Us
  • DMCA
  • Privacy Policy
  • Terms & Conditions
© 2025 advancedainews. Designed by advancedainews.

Type above and press Enter to search. Press Esc to cancel.