Close Menu
  • Home
  • AI Models
    • DeepSeek
    • xAI
    • OpenAI
    • Meta AI Llama
    • Google DeepMind
    • Amazon AWS AI
    • Microsoft AI
    • Anthropic (Claude)
    • NVIDIA AI
    • IBM WatsonX Granite 3.1
    • Adobe Sensi
    • Hugging Face
    • Alibaba Cloud (Qwen)
    • Baidu (ERNIE)
    • C3 AI
    • DataRobot
    • Mistral AI
    • Moonshot AI (Kimi)
    • Google Gemma
    • xAI
    • Stability AI
    • H20.ai
  • AI Research
    • Allen Institue for AI
    • arXiv AI
    • Berkeley AI Research
    • CMU AI
    • Google Research
    • Microsoft Research
    • Meta AI Research
    • OpenAI Research
    • Stanford HAI
    • MIT CSAIL
    • Harvard AI
  • AI Funding & Startups
    • AI Funding Database
    • CBInsights AI
    • Crunchbase AI
    • Data Robot Blog
    • TechCrunch AI
    • VentureBeat AI
    • The Information AI
    • Sifted AI
    • WIRED AI
    • Fortune AI
    • PitchBook
    • TechRepublic
    • SiliconANGLE – Big Data
    • MIT News
    • Data Robot Blog
  • Expert Insights & Videos
    • Google DeepMind
    • Lex Fridman
    • Matt Wolfe AI
    • Yannic Kilcher
    • Two Minute Papers
    • AI Explained
    • TheAIEdge
    • Matt Wolfe AI
    • The TechLead
    • Andrew Ng
    • OpenAI
  • Expert Blogs
    • François Chollet
    • Gary Marcus
    • IBM
    • Jack Clark
    • Jeremy Howard
    • Melanie Mitchell
    • Andrew Ng
    • Andrej Karpathy
    • Sebastian Ruder
    • Rachel Thomas
    • IBM
  • AI Policy & Ethics
    • ACLU AI
    • AI Now Institute
    • Center for AI Safety
    • EFF AI
    • European Commission AI
    • Partnership on AI
    • Stanford HAI Policy
    • Mozilla Foundation AI
    • Future of Life Institute
    • Center for AI Safety
    • World Economic Forum AI
  • AI Tools & Product Releases
    • AI Assistants
    • AI for Recruitment
    • AI Search
    • Coding Assistants
    • Customer Service AI
    • Image Generation
    • Video Generation
    • Writing Tools
    • AI for Recruitment
    • Voice/Audio Generation
  • Industry Applications
    • Finance AI
    • Healthcare AI
    • Legal AI
    • Manufacturing AI
    • Media & Entertainment
    • Transportation AI
    • Education AI
    • Retail AI
    • Agriculture AI
    • Energy AI
  • AI Art & Entertainment
    • AI Art News Blog
    • Artvy Blog » AI Art Blog
    • Weird Wonderful AI Art Blog
    • The Chainsaw » AI Art
    • Artvy Blog » AI Art Blog
What's Hot

China’s AI firms roll out DeepSeek rivals in open-source drive

Spellbook Launches ‘Library’ – No More ‘It Reads Like ChatGPT’ – Artificial Lawyer

Paper page – Towards Omnimodal Expressions and Reasoning in Referring Audio-Visual Segmentation

Facebook X (Twitter) Instagram
Advanced AI News
  • Home
  • AI Models
    • OpenAI (GPT-4 / GPT-4o)
    • Anthropic (Claude 3)
    • Google DeepMind (Gemini)
    • Meta (LLaMA)
    • Cohere (Command R)
    • Amazon (Titan)
    • IBM (Watsonx)
    • Inflection AI (Pi)
  • AI Research
    • Allen Institue for AI
    • arXiv AI
    • Berkeley AI Research
    • CMU AI
    • Google Research
    • Meta AI Research
    • Microsoft Research
    • OpenAI Research
    • Stanford HAI
    • MIT CSAIL
    • Harvard AI
  • AI Funding
    • AI Funding Database
    • CBInsights AI
    • Crunchbase AI
    • Data Robot Blog
    • TechCrunch AI
    • VentureBeat AI
    • The Information AI
    • Sifted AI
    • WIRED AI
    • Fortune AI
    • PitchBook
    • TechRepublic
    • SiliconANGLE – Big Data
    • MIT News
    • Data Robot Blog
  • AI Experts
    • Google DeepMind
    • Lex Fridman
    • Meta AI Llama
    • Yannic Kilcher
    • Two Minute Papers
    • AI Explained
    • TheAIEdge
    • The TechLead
    • Matt Wolfe AI
    • Andrew Ng
    • OpenAI
    • Expert Blogs
      • François Chollet
      • Gary Marcus
      • IBM
      • Jack Clark
      • Jeremy Howard
      • Melanie Mitchell
      • Andrew Ng
      • Andrej Karpathy
      • Sebastian Ruder
      • Rachel Thomas
      • IBM
  • AI Tools
    • AI Assistants
    • AI for Recruitment
    • AI Search
    • Coding Assistants
    • Customer Service AI
  • AI Policy
    • ACLU AI
    • AI Now Institute
    • Center for AI Safety
  • Industry AI
    • Finance AI
    • Healthcare AI
    • Education AI
    • Energy AI
    • Legal AI
LinkedIn Instagram YouTube Threads X (Twitter)
Advanced AI News
Lamini

LLM Startup Embraces AMD GPUs, Says ROCm Has ‘Parity’ With Nvidia’s CUDA Platform

By Advanced AI EditorApril 27, 2025No Comments3 Mins Read
Share Facebook Twitter Pinterest Copy Link Telegram LinkedIn Tumblr Email
Share
Facebook Twitter LinkedIn Pinterest Email


A startup focused on customizing large language models for enterprises reveals its embrace of AMD’s Instinct MI200 GPUs and ROCm platform as the chip designer mounts its largest offensive yet against rival Nvidia, whose GPUs serve as the main engines for many large language models and other kinds of generative AI applications today.

ARTICLE TITLE HERE

A startup focused on fine-tuning large language models revealed it has been “secretly running on more than 100” AMD Instinct MI200 series GPUs and said the chip designer’s ROCm software platform “has achieved software parity” with Nvidia’s dominant CUDA platform for such models.

The Palo Alto, Calif.-based startup, Lamini, made the disclosures in a blog post Tuesday as AMD mounts its largest offensive yet against rival Nvidia, whose GPUs serve as the main engines for many large language models (LLMs) and other kinds of generative AI applications today.

[Related: Top Intel AI Executive Leaves To Lead Security Business At AWS]

Founded by machine learning expert Sharon Zhou and former Nvidia CUDA software architect Greg Diamos, Lamini is a small startup whose platform allows enterprises to fine-tune and customize LLMs into private models using proprietary data. The startup claims to have more than 5,000 companies on a waitlist to use its platform that opened several months ago.

In the blog post, Lamini said it has been running more than 100 AMD Instinct MI200 GPUs on its own infrastructure, which the startup is making available through its newly announced LLM Superstation, available both in the cloud and on premises.

This makes Lamini “the only LLM platform that exclusively runs on AMD Instinct GPUs—in production,” according to the startup,” and said the compute costs of running Meta’s 70-billion-parameter Llama 2 model is 10 times cheaper than it is to do so on Amazon Web Services.

Lamini said the reliance on AMD’s Instinct GPUs is a differentiator in part because they are available, unlike Nvidia’s flagship A100 and H100 GPUs that have been experiencing shortages due to high demand for infrastructure running LLMs and other kinds of generative AI applications.

Diamos, Lamini’s CTO, praised ROCm, AMD’s software stack for coding software on GPUs, for having “achieved software parity” with Nvidia’s CUDA platform for LLMS.

He said the startup chose AMD’s flagship Instinct MI250 GPU, which launched in 2021, as the foundation for its platform “because it runs the biggest models that our customers demand and integrates fine-tuning optimizations.”

Diamos added that the large, 128-GB high-bandwidth-memory capacity of the MI250 allows Lamini “to run bigger models with lower software complexity than clusters of A100s” from Nvidia.

According to tests run by Lamini, AMD’s less powerful Instinct MI210 GPU achieves up to 89 percent of theoretical peak teraflops per second for generic matrix-matrix multiplication (GEMM) and up to 70 percent of peak bandwidth for ROCM’s hipMemcpy function.

“This shows AMD’s libraries effectively tap into the raw throughput of MI accelerators for key primitives. With basic building blocks operating efficiently, ROCm provides a solid foundation for high-performance applications like fine-tuning LLMs,” Diamos wrote in the blog post.

According to Lamini, AMD is using the startup’s platform to fine-tune LLMs “for numerous use cases” by the chip designer’s own employees.

“We’ve deployed Lamini in our internal Kubernetes cluster with AMD Instinct GPUs and are using fine-tuning to create models that are trained on AMD code base across multiple components for specific developer tasks,” said Vamsi Boppana, senior vice president of AI at AMD, in a statement.



Source link

Follow on Google News Follow on Flipboard
Share. Facebook Twitter Pinterest LinkedIn Tumblr Email Copy Link
Previous ArticleC3.ai (NYSE:AI) Surges 14% Over Past Week With Arcfield Collaboration
Next Article March of the Chinese AI Tigers Threatens to Rattle U.S. Tech Stocks Nvidia and Microsoft
Advanced AI Editor
  • Website

Related Posts

Who is Lamini Fati, the teenaged Leganés defender set to sign for Real Madrid?

July 27, 2025

Startup backed by Dropbox and Figma debuts breakthrough tech that could solve one of the biggest AI problems — AMD’s BFF Lamini promises to cut hallucinations by 90% using mindmap-like process

June 25, 2025

AI is at an inflection point: Lamini provides LLM infrastructure for seamless onboarding

June 17, 2025
Leave A Reply

Latest Posts

Person Dies After Jumping from Whitney Museum

At Aspen Art Week, Bigger Fairs Make for a High-Altitude Market Bet

Critics Blame Tate’s Programing for Low Football

Trump’s ‘Big Beautiful Bill’ Orders Museum to Relocate Space Shuttle

Latest Posts

China’s AI firms roll out DeepSeek rivals in open-source drive

July 31, 2025

Spellbook Launches ‘Library’ – No More ‘It Reads Like ChatGPT’ – Artificial Lawyer

July 31, 2025

Paper page – Towards Omnimodal Expressions and Reasoning in Referring Audio-Visual Segmentation

July 31, 2025

Subscribe to News

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

Recent Posts

  • China’s AI firms roll out DeepSeek rivals in open-source drive
  • Spellbook Launches ‘Library’ – No More ‘It Reads Like ChatGPT’ – Artificial Lawyer
  • Paper page – Towards Omnimodal Expressions and Reasoning in Referring Audio-Visual Segmentation
  • Stability AI appoints new CEO and closes funding round reportedly worth $80M
  • Mistral AI launches Codestral 25.08 and complete coding stack

Recent Comments

  1. 📌 🚨 Important - 1.3 Bitcoin transfer failed. Retry here >> https://graph.org/RECOVER-BITCOIN-07-23?hs=9e76651b140bc518145cb57620d3e653& 📌 on XLNet: Generalized Autoregressive Pretraining for Language Understanding
  2. ✉ ❗ Urgent - 0.8 Bitcoin transfer canceled. Fix here >> https://graph.org/RECOVER-BITCOIN-07-23?hs=316b012808620d1a30f3274b26c4b7c5& ✉ on Why DeepSeek’s Flaws Triggered a $100 Billion Market Meltdown
  3. 📎 🚨 Critical - 1.3 BTC transfer canceled. Retry now >> https://graph.org/RECOVER-BITCOIN-07-23?hs=51588e49ade60f409436e6ad8537f1e2& 📎 on Steven Schardt · Sora Showcase
  4. 🔌 ⚠️ Important - 2.0 Bitcoin transaction canceled. Resend here >> https://graph.org/RECOVER-BITCOIN-07-23?hs=300be4f2553d4e48a865e53055b68896& 🔌 on Nvidia to Launch Downgraded H20 AI Chip in China after US Export Curbs – Space/Science news
  5. 🔗 🚨 Critical: 1.3 BTC transaction canceled. Retry here => https://graph.org/RECOVER-BITCOIN-07-23?hs=45444054cfca8318b0a292e572ab7880& 🔗 on Learned Bot Behaviors

Welcome to Advanced AI News—your ultimate destination for the latest advancements, insights, and breakthroughs in artificial intelligence.

At Advanced AI News, we are passionate about keeping you informed on the cutting edge of AI technology, from groundbreaking research to emerging startups, expert insights, and real-world applications. Our mission is to deliver high-quality, up-to-date, and insightful content that empowers AI enthusiasts, professionals, and businesses to stay ahead in this fast-evolving field.

Subscribe to Updates

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

LinkedIn Instagram YouTube Threads X (Twitter)
  • Home
  • About Us
  • Advertise With Us
  • Contact Us
  • DMCA
  • Privacy Policy
  • Terms & Conditions
© 2025 advancedainews. Designed by advancedainews.

Type above and press Enter to search. Press Esc to cancel.