Close Menu
  • Home
  • AI Models
    • DeepSeek
    • xAI
    • OpenAI
    • Meta AI Llama
    • Google DeepMind
    • Amazon AWS AI
    • Microsoft AI
    • Anthropic (Claude)
    • NVIDIA AI
    • IBM WatsonX Granite 3.1
    • Adobe Sensi
    • Hugging Face
    • Alibaba Cloud (Qwen)
    • Baidu (ERNIE)
    • C3 AI
    • DataRobot
    • Mistral AI
    • Moonshot AI (Kimi)
    • Google Gemma
    • xAI
    • Stability AI
    • H20.ai
  • AI Research
    • Allen Institue for AI
    • arXiv AI
    • Berkeley AI Research
    • CMU AI
    • Google Research
    • Microsoft Research
    • Meta AI Research
    • OpenAI Research
    • Stanford HAI
    • MIT CSAIL
    • Harvard AI
  • AI Funding & Startups
    • AI Funding Database
    • CBInsights AI
    • Crunchbase AI
    • Data Robot Blog
    • TechCrunch AI
    • VentureBeat AI
    • The Information AI
    • Sifted AI
    • WIRED AI
    • Fortune AI
    • PitchBook
    • TechRepublic
    • SiliconANGLE – Big Data
    • MIT News
    • Data Robot Blog
  • Expert Insights & Videos
    • Google DeepMind
    • Lex Fridman
    • Matt Wolfe AI
    • Yannic Kilcher
    • Two Minute Papers
    • AI Explained
    • TheAIEdge
    • Matt Wolfe AI
    • The TechLead
    • Andrew Ng
    • OpenAI
  • Expert Blogs
    • François Chollet
    • Gary Marcus
    • IBM
    • Jack Clark
    • Jeremy Howard
    • Melanie Mitchell
    • Andrew Ng
    • Andrej Karpathy
    • Sebastian Ruder
    • Rachel Thomas
    • IBM
  • AI Policy & Ethics
    • ACLU AI
    • AI Now Institute
    • Center for AI Safety
    • EFF AI
    • European Commission AI
    • Partnership on AI
    • Stanford HAI Policy
    • Mozilla Foundation AI
    • Future of Life Institute
    • Center for AI Safety
    • World Economic Forum AI
  • AI Tools & Product Releases
    • AI Assistants
    • AI for Recruitment
    • AI Search
    • Coding Assistants
    • Customer Service AI
    • Image Generation
    • Video Generation
    • Writing Tools
    • AI for Recruitment
    • Voice/Audio Generation
  • Industry Applications
    • Finance AI
    • Healthcare AI
    • Legal AI
    • Manufacturing AI
    • Media & Entertainment
    • Transportation AI
    • Education AI
    • Retail AI
    • Agriculture AI
    • Energy AI
  • AI Art & Entertainment
    • AI Art News Blog
    • Artvy Blog » AI Art Blog
    • Weird Wonderful AI Art Blog
    • The Chainsaw » AI Art
    • Artvy Blog » AI Art Blog
What's Hot

UCSB to replace one-third of students with AI

ChatGPT just got smarter: OpenAI’s Study Mode helps students learn step-by-step

Nvidia AI chip challenger Groq said to be nearing new fundraising at $6B valuation 

Facebook X (Twitter) Instagram
Advanced AI News
  • Home
  • AI Models
    • OpenAI (GPT-4 / GPT-4o)
    • Anthropic (Claude 3)
    • Google DeepMind (Gemini)
    • Meta (LLaMA)
    • Cohere (Command R)
    • Amazon (Titan)
    • IBM (Watsonx)
    • Inflection AI (Pi)
  • AI Research
    • Allen Institue for AI
    • arXiv AI
    • Berkeley AI Research
    • CMU AI
    • Google Research
    • Meta AI Research
    • Microsoft Research
    • OpenAI Research
    • Stanford HAI
    • MIT CSAIL
    • Harvard AI
  • AI Funding
    • AI Funding Database
    • CBInsights AI
    • Crunchbase AI
    • Data Robot Blog
    • TechCrunch AI
    • VentureBeat AI
    • The Information AI
    • Sifted AI
    • WIRED AI
    • Fortune AI
    • PitchBook
    • TechRepublic
    • SiliconANGLE – Big Data
    • MIT News
    • Data Robot Blog
  • AI Experts
    • Google DeepMind
    • Lex Fridman
    • Meta AI Llama
    • Yannic Kilcher
    • Two Minute Papers
    • AI Explained
    • TheAIEdge
    • The TechLead
    • Matt Wolfe AI
    • Andrew Ng
    • OpenAI
    • Expert Blogs
      • François Chollet
      • Gary Marcus
      • IBM
      • Jack Clark
      • Jeremy Howard
      • Melanie Mitchell
      • Andrew Ng
      • Andrej Karpathy
      • Sebastian Ruder
      • Rachel Thomas
      • IBM
  • AI Tools
    • AI Assistants
    • AI for Recruitment
    • AI Search
    • Coding Assistants
    • Customer Service AI
  • AI Policy
    • ACLU AI
    • AI Now Institute
    • Center for AI Safety
  • Industry AI
    • Finance AI
    • Healthcare AI
    • Education AI
    • Energy AI
    • Legal AI
LinkedIn Instagram YouTube Threads X (Twitter)
Advanced AI News
Lamini

Meet Sharon Zhou, the AI Founder Doing Just Fine Without Nvidia Chips

By Advanced AI EditorMay 26, 2025No Comments5 Mins Read
Share Facebook Twitter Pinterest Copy Link Telegram LinkedIn Tumblr Email
Share
Facebook Twitter LinkedIn Pinterest Email


Tech CEOs with big plans for artificial intelligence spent a bunch of time scrambling around in search of Nvidia chips last year.

The Santa Clara giant’s chips, known as GPUs, became the hottest property of the generative AI boom. Figures as powerful as Mark Zuckerberg and Sam Altman raced to secure supplies of the vital computing resources needed to power apps like ChatGPT.

However, there’s one AI boss who hasn’t put herself at the mercy of Nvidia’s billionaire leader Jensen Huang, and his $2.2 trillion GPU empire. Meet Sharon Zhou.

The 30-year-old has had quite the career.

She’s the first person to major in both classics and computer science at Harvard. She received a Ph.D. in generative AI at Stanford under machine learning pioneer Andrew Ng, became an adjunct professor at the university, and has made time for online teaching and angel investing. If that wasn’t enough, she was also asked to be on the early founding team of Anthropic, the OpenAI rival that just raised an extra $2.75 billion from Amazon.

Her ambitions have taken her in a slightly different direction, however, as she’s now forging her own path forward by taking charge of an AI startup of her own.

Who needs Nvidia?

In April last year, Zhou and her cofounder Greg Diamos, based in Palo Alto, brought their new startup, Lamini AI, out of stealth. Its main ambition was to offer a platform that makes it easy for enterprises to train and create customized large language models with “just a few lines of code.”

Related stories

Business Insider tells the innovative stories you want to know

Business Insider tells the innovative stories you want to know

That could mean taking a foundation model like GPT from OpenAI and making it easy for an enterprise to fine-tune that model with its own data. “What we’re doing is making it essentially possible for every enterprise to have OpenAI’s infrastructure but in-house,” Zhou said.

An equally interesting revelation came months later, however.

In September, Zhou revealed that Lamini’s platform had been building customized LLMs with customers over the past year by exclusively using GPUs from Nvidia’s main rival, AMD, the chip giant run by Huang’s cousin, Lisa Su.

It was a big deal given that almost everyone seemed to be exclusively obsessed with H100 — GPUs that Nvidia has struggled to meet the demand of amid supply constraints. Lamini’s reveal even came with a video of Zhou teasing Nvidia about the shortage.

As Zhou acknowledges, though, it wasn’t an easy decision to look away from the thing everyone in generative AI has been desperate for. “The decision-making process was a long one,” she said. “It was not a trivial, small one.”

A few things helped the decision. For one, her cofounder Diamos played a key role in helping make the realization that GPUs other than those from Nvidia work perfectly well.

As a former Nvidia software architect, Diamos understood that while GPU hardware was vital for getting top performance out of AI models — he was, after all, the coauthor of a paper on “scaling laws” that showed the importance of computing power — software was important too.

Diamos was witness to that having worked on CUDA, the software first developed by Nvidia in the 2000s. It makes using AI models with GPUs like the H100 and Nvidia’s new Blackwell chip, as simple as a plug-and-play system.

Jensen Huang presenting at a conference, wearing a black leather jacket.

Justin Sullivan/Getty Images



So it became clear that if another company could build a similar software ecosystem around its GPUs, there’d be no reason they couldn’t compete with Nvidia. Fortunately for them, after consulting with Diamos, according to Zhou, AMD was on its way to building a rival system that they would eventually test.

“Greg and I were just jamming on things, so this has been years in the making, and then once the prototypes worked we were just like let’s just double down on this,” Zhou said.

More broadly, Zhou recognizes that businesses are so “excited to use LLMs,” but many may not want to — or simply can’t afford to — wait around for Nvidia to shore up enough supply of its GPUs to meet the demand.

It’s another reason AMD has proven so valuable to her ambitions. Thanks to its GPUs being more available, Zhou was confident that Lamini could offer “infrastructure that makes meeting that skyrocketing demand” for LLMs possible.

“This is because Lamini fully utilizes LLM compute at 10x performance and makes it possible to scale quickly without supply constraints, by offering vendor-agnostic compute options, i.e. it’s indiscernible to customers to run Lamini on Nvidia and AMD GPUs,” she explained.

A lot of people ask me what the AMD MI300 chip looks like.

Here it is, held by the incredible @LisaSu!

We’ve had many F500 enterprises and leading tech unicorns successfully get their proprietary data into an LLM. All on AMD. In production. pic.twitter.com/9vGzQlt4fE

— Sharon Zhou (@realSharonZhou) January 30, 2024

No wonder the company is ready to double down on AMD. In January, Zhou shared an image to X of the MI300X — AMD’s new chip first unveiled in December by CEO Su as the “highest performing accelerator in the world” — live in production at Lamini.

Nvidia’s Huang might be leading one of the most powerful companies in Silicon Valley now, but the competition is coming for him. Or as Zhou said of AMD: “They have a real horse in this race.”



Source link

Follow on Google News Follow on Flipboard
Share. Facebook Twitter Pinterest LinkedIn Tumblr Email Copy Link
Previous ArticleGoogle claims users find ads in AI search ‘helpful’
Next Article DeepSeek has rattled the AI industry. Here’s a quick look at other Chinese AI models.
Advanced AI Editor
  • Website

Related Posts

Who is Lamini Fati, the teenaged Leganés defender set to sign for Real Madrid?

July 27, 2025

Startup backed by Dropbox and Figma debuts breakthrough tech that could solve one of the biggest AI problems — AMD’s BFF Lamini promises to cut hallucinations by 90% using mindmap-like process

June 25, 2025

AI is at an inflection point: Lamini provides LLM infrastructure for seamless onboarding

June 17, 2025
Leave A Reply

Latest Posts

John Roberts Prevented Firing of National Portrait Gallery Director

At Comic-Con, George Lucas Previews Forthcoming Lucas Museum

Betye Saar Assembles an All-Star Group to Steward Her Legacy

Picasso’s ‘Demoiselles’ May Not Have Been Inspired by African Art

Latest Posts

UCSB to replace one-third of students with AI

July 29, 2025

ChatGPT just got smarter: OpenAI’s Study Mode helps students learn step-by-step

July 29, 2025

Nvidia AI chip challenger Groq said to be nearing new fundraising at $6B valuation 

July 29, 2025

Subscribe to News

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

Recent Posts

  • UCSB to replace one-third of students with AI
  • ChatGPT just got smarter: OpenAI’s Study Mode helps students learn step-by-step
  • Nvidia AI chip challenger Groq said to be nearing new fundraising at $6B valuation 
  • BidMax Launches 0% Commission AI-Powered Real Estate Service to Support South Florida Condo and Homeowner Associations | National Business
  • Apple’s Lack Of New AI Features At WWDC Is ‘Startling,’ Expert Says

Recent Comments

  1. binance on OpenAI updates its new Responses API rapidly with MCP support, GPT-4o native image gen, and more enterprise features
  2. binance kód on Anthropic closes $2.5 billion credit facility as Wall Street continues plunging money into AI boom – NBC Los Angeles
  3. 🖨 🔵 Incoming Message: 1.95 Bitcoin from exchange. Claim transfer => https://graph.org/ACTIVATE-BTC-TRANSFER-07-23?hs=40f06aae45d2dc14b01045540f836756& 🖨 on SFC Dialogue丨Jeffrey Sachs says he uses DeepSeek every hour_to_facts_its
  4. 📪 ✉️ Unread Notification: 1.65 BTC from user. Claim transfer >> https://graph.org/ACTIVATE-BTC-TRANSFER-07-23?hs=63f0a8159ef8316c31f5a9a8aca50f39& 📪 on Sean Carroll: Arrow of Time
  5. 🔋 📬 Unread Alert - 1.65 BTC from exchange. Accept funds > https://graph.org/ACTIVATE-BTC-TRANSFER-07-23?hs=db3ef91843302da628b83636ef7db949& 🔋 on Rohit Prasad: Amazon Alexa and Conversational AI | Lex Fridman Podcast #57

Welcome to Advanced AI News—your ultimate destination for the latest advancements, insights, and breakthroughs in artificial intelligence.

At Advanced AI News, we are passionate about keeping you informed on the cutting edge of AI technology, from groundbreaking research to emerging startups, expert insights, and real-world applications. Our mission is to deliver high-quality, up-to-date, and insightful content that empowers AI enthusiasts, professionals, and businesses to stay ahead in this fast-evolving field.

Subscribe to Updates

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

LinkedIn Instagram YouTube Threads X (Twitter)
  • Home
  • About Us
  • Advertise With Us
  • Contact Us
  • DMCA
  • Privacy Policy
  • Terms & Conditions
© 2025 advancedainews. Designed by advancedainews.

Type above and press Enter to search. Press Esc to cancel.