Close Menu
  • Home
  • AI Models
    • DeepSeek
    • xAI
    • OpenAI
    • Meta AI Llama
    • Google DeepMind
    • Amazon AWS AI
    • Microsoft AI
    • Anthropic (Claude)
    • NVIDIA AI
    • IBM WatsonX Granite 3.1
    • Adobe Sensi
    • Hugging Face
    • Alibaba Cloud (Qwen)
    • Baidu (ERNIE)
    • C3 AI
    • DataRobot
    • Mistral AI
    • Moonshot AI (Kimi)
    • Google Gemma
    • xAI
    • Stability AI
    • H20.ai
  • AI Research
    • Allen Institue for AI
    • arXiv AI
    • Berkeley AI Research
    • CMU AI
    • Google Research
    • Microsoft Research
    • Meta AI Research
    • OpenAI Research
    • Stanford HAI
    • MIT CSAIL
    • Harvard AI
  • AI Funding & Startups
    • AI Funding Database
    • CBInsights AI
    • Crunchbase AI
    • Data Robot Blog
    • TechCrunch AI
    • VentureBeat AI
    • The Information AI
    • Sifted AI
    • WIRED AI
    • Fortune AI
    • PitchBook
    • TechRepublic
    • SiliconANGLE – Big Data
    • MIT News
    • Data Robot Blog
  • Expert Insights & Videos
    • Google DeepMind
    • Lex Fridman
    • Matt Wolfe AI
    • Yannic Kilcher
    • Two Minute Papers
    • AI Explained
    • TheAIEdge
    • Matt Wolfe AI
    • The TechLead
    • Andrew Ng
    • OpenAI
  • Expert Blogs
    • François Chollet
    • Gary Marcus
    • IBM
    • Jack Clark
    • Jeremy Howard
    • Melanie Mitchell
    • Andrew Ng
    • Andrej Karpathy
    • Sebastian Ruder
    • Rachel Thomas
    • IBM
  • AI Policy & Ethics
    • ACLU AI
    • AI Now Institute
    • Center for AI Safety
    • EFF AI
    • European Commission AI
    • Partnership on AI
    • Stanford HAI Policy
    • Mozilla Foundation AI
    • Future of Life Institute
    • Center for AI Safety
    • World Economic Forum AI
  • AI Tools & Product Releases
    • AI Assistants
    • AI for Recruitment
    • AI Search
    • Coding Assistants
    • Customer Service AI
    • Image Generation
    • Video Generation
    • Writing Tools
    • AI for Recruitment
    • Voice/Audio Generation
  • Industry Applications
    • Finance AI
    • Healthcare AI
    • Legal AI
    • Manufacturing AI
    • Media & Entertainment
    • Transportation AI
    • Education AI
    • Retail AI
    • Agriculture AI
    • Energy AI
  • AI Art & Entertainment
    • AI Art News Blog
    • Artvy Blog » AI Art Blog
    • Weird Wonderful AI Art Blog
    • The Chainsaw » AI Art
    • Artvy Blog » AI Art Blog
What's Hot

Coinbase and Perplexity AI Unite for Live Crypto Price Access

Paper page – Beyond the Linear Separability Ceiling

Fraud detection empowered by federated learning with the Flower framework on Amazon SageMaker AI

Facebook X (Twitter) Instagram
Advanced AI News
  • Home
  • AI Models
    • OpenAI (GPT-4 / GPT-4o)
    • Anthropic (Claude 3)
    • Google DeepMind (Gemini)
    • Meta (LLaMA)
    • Cohere (Command R)
    • Amazon (Titan)
    • IBM (Watsonx)
    • Inflection AI (Pi)
  • AI Research
    • Allen Institue for AI
    • arXiv AI
    • Berkeley AI Research
    • CMU AI
    • Google Research
    • Meta AI Research
    • Microsoft Research
    • OpenAI Research
    • Stanford HAI
    • MIT CSAIL
    • Harvard AI
  • AI Funding
    • AI Funding Database
    • CBInsights AI
    • Crunchbase AI
    • Data Robot Blog
    • TechCrunch AI
    • VentureBeat AI
    • The Information AI
    • Sifted AI
    • WIRED AI
    • Fortune AI
    • PitchBook
    • TechRepublic
    • SiliconANGLE – Big Data
    • MIT News
    • Data Robot Blog
  • AI Experts
    • Google DeepMind
    • Lex Fridman
    • Meta AI Llama
    • Yannic Kilcher
    • Two Minute Papers
    • AI Explained
    • TheAIEdge
    • The TechLead
    • Matt Wolfe AI
    • Andrew Ng
    • OpenAI
    • Expert Blogs
      • François Chollet
      • Gary Marcus
      • IBM
      • Jack Clark
      • Jeremy Howard
      • Melanie Mitchell
      • Andrew Ng
      • Andrej Karpathy
      • Sebastian Ruder
      • Rachel Thomas
      • IBM
  • AI Tools
    • AI Assistants
    • AI for Recruitment
    • AI Search
    • Coding Assistants
    • Customer Service AI
  • AI Policy
    • ACLU AI
    • AI Now Institute
    • Center for AI Safety
  • Industry AI
    • Finance AI
    • Healthcare AI
    • Education AI
    • Energy AI
    • Legal AI
LinkedIn Instagram YouTube Threads X (Twitter)
Advanced AI News
Alibaba Cloud (Qwen)

At LlamaCon 2025, Meta Tried to Reassert AI Leadership Against Rivals

By Advanced AI EditorMay 15, 2025No Comments7 Mins Read
Share Facebook Twitter Pinterest Copy Link Telegram LinkedIn Tumblr Email
Share
Facebook Twitter LinkedIn Pinterest Email


On the manicured lawns outside Building 21 on Meta’s sprawling Menlo Park headquarters, live llamas meandered with languid indifference, drawing clusters of developers who momentarily abandoned technical discussions for selfies with the stoic, woolly ambassadors of Meta’s family of large language models.

Inside Building 21, I shivered. The cavernous auditorium’s air conditioning was cranked up high. Mood lighting bathed the space in Meta’s signature blue shade, and dance music blasted from speakers, lending a nightclub ambiance to the event that clashed oddly with the earnest, tech-focused agenda.

“Rise and shine!” a Meta PR person chirped as I took a seat.

This was LlamaCon, Meta’s first-ever conference for AI developers. Its timing felt oddly defensive. Earlier this year, DeepSeek, an open-source AI model from China that delivered groundbreaking performance with computational efficiency, had much of Silicon Valley, including Meta’s AI division, panicked.

Around the same time, Meta announced that it would spend $65 billion in 2025 to build out AI infrastructure. Weeks after that, the company released Llama 4, the latest version of its LLM family. Mark Zuckerberg called it “the beginning of a new era for the Llama ecosystem.” Almost immediately after, Meta was accused of artificially inflating Llama models’ performance benchmarks, a claim that executives pushed back against.

LlamaCon, I thought, was Meta’s moment to reclaim trust and clarify its AI strategy.

Carved watermelons with Llama branding

A culinary spectacle: Meta culinary line cook Ricardo J. Borjas Rodriguez’s artistic talents were unexpectedly conscripted into the company’s AI branding efforts.

Pranav Dixit



Onstage, Meta’s Chief Product Officer Chris Cox framed the company’s open source strategy as principled rather than reactive: “We were a startup once, too,” he said in the keynote. “We built this place on open source.”

The subtext was clear: Meta wants developers to see Llama as their path to autonomy and flexibility in an increasingly closed AI ecosystem dominated by offerings from OpenAI, Microsoft, and Google.

Meta Chief Product Officer Chris Cox speaks at LlamaCon 2025

Meta Chief Product Officer Chris Cox speaks at LlamaCon 2025

AP Photo/Jeff Chiu



Llama knelt to competitors

LlamaCon featured several announcements, including the launch of a new Llama API that Meta says will make it easy for developers to integrate its models using familiar tools and interfaces. Some tasks will be possible with just a few lines of code.

Meta also announced partnerships with companies to make AI run faster; a security program with AT&T and others to fight AI-generated scams; and $1.5 million in grants to startups and universities around the world using Llama.

Conspicuously absent, however, was what many developers had actually come hoping to see: a new reasoning model to compete with what has rapidly become table stakes in the AI industry, including in Chinese open-source alternatives like DeepSeek and Alibaba’s Qwen.

In a conversation with Databricks CEO Ali Ghodsi, Zuckerberg seemed to tacitly acknowledge these shortcomings.

Mood lighting bathed the space in Meta's signature shade of blue, and dance music blasted through the air, lending the event a nightclub ambiance.

At LlamaCon, mood lighting bathed the space in Meta’s signature shade of blue, and dance music blasted through the air, lending the event a nightclub ambiance.

Pranav Dixit



“Part of the value around open source is that you can mix and match,” he said. “If another model, like DeepSeek, is better, or if Qwen is better at something, then, as developers, you have the ability to take the best parts of the intelligence from different models. This is part of how I think open source basically passes in quality all the closed source [models]…[It] feels like sort of an unstoppable force.”

Related stories

Business Insider tells the innovative stories you want to know

Business Insider tells the innovative stories you want to know

Vineeth Sai Varikuntla, a developer working on medical AI applications, echoed this sentiment when I spoke with him after the keynote.

“It would be exciting if they were beating Qwen and DeepSeek,” he said. “I think they will come out with a model soon. But right now the model that they have should be on par—” he paused, reconsidering, “Qwen is ahead, way ahead of what they are doing in general use cases and reasoning.”

Missing model improvements

The online reaction to LlamaCon reflected similar disappointment across developer communities.

On Reddit’s r/LocalLLaMA, the top post was titled “No new models in LlamaCon announced.” Users compared Meta unfavorably to Qwen 3, which Alibaba strategically released just one day before Meta’s event.

“Good lord. Llama went from competitively good Open Source to just so far behind the race that I’m beginning to think Qwen and DeepSeek can’t even see it in their rear view mirror anymore,” wrote one user. Others debated whether Meta had planned to release a reasoning model but pulled back after seeing Qwen’s performance.

On Hacker News, a popular forum for developers and tech industry professionals, some criticized the event’s focus on API services and partnerships rather than model improvements as “super shallow.” And one user on Threads summed up the event simply as “kinda mid.”

When I asked Meta how they measured the success of the event, they declined to comment.

“It did seem like a bit of a marketing push for Llama,” Mahesh Sathiamoorthy, cofounder of Bespoke Labs, a Mountain View-based startup that creates AI tools for data curation and training LLMs, told me. “They wanted to cast a wider net and appeal to enterprises, but I think the technical community was looking for more substantial model improvements.”

Still, LlamaCon won praise from Wall Street analysts tracking the company’s AI strategy. “LlamaCon was one giant flex of Meta’s ambitions and successes with AI,” Mike Proulx of Forrester told me.

Jefferies analyst Brent Thill called Meta’s announcement at the event “a big step forward” to becoming a “hyperscaler, a term referring to large cloud serve providers that offer computing resources and infrastructure to businesses.

Some developers using Llama models were equally enthusiastic about the technology’s benefits. For Yevhenii Petrenko of Tavus, which creates AI-powered conversational videos, Llama’s speed was crucial. “We really care about very low latency, like very fast response, and Llama helps us use other LLMs,” he told me after the event.

Hanzla Ramey, CTO of WriteSea, an AI-powered career services platform that helps job seekers prepare résumés and practice interviews, highlighted Llama’s cost-effectiveness: “For us, cost is huge,” he told me. “We are a startup, so controlling expenses is really important. If we go with closed source, we can’t process millions of jobs. No way.”

The future’s form and function

Toward the end of the day, Zuckerberg joined Microsoft CEO Satya Nadella onstage for a wide-ranging chat about AI’s future. One comment stood out.

Llama 4, Zuckerberg explained, had been designed around Meta’s preferred infrastructure — the H100 GPU, which shaped its architecture and scale. But he acknowledged that “a lot of the open source community wants even smaller models.” Developers “just need things in different shapes,” he said.

Meta Founder and CEO Mark Zuckerberg, left, speaks with Microsoft Chairman and CEO Satya Nadella at LlamaCon 2025

Meta Founder and CEO Mark Zuckerberg, left, speaks with Microsoft Chairman and CEO Satya Nadella at LlamaCon 2025

AP Photo/Jeff Chiu



“To be able to basically take whatever intelligence you have from bigger models,” he added, “and distill them into whatever form factor you want — to be able to run on your laptop, on your phone, on whatever the thing is…to me, this is one of the most important things,” he said.

It was a candid admission. For all the pageantry, LlamaCon wasn’t a coronation. It was Meta still mid-pivot, trying to convince developers — and maybe itself — that it can build not just models, but momentum.​​​​​​​​​​​​​​​​

Have a tip? Contact this reporter via email at pdixit@insider.com or Signal at +1408-905-9124. Use a personal email address and a nonwork device; here’s our guide to sharing information securely.



Source link

Follow on Google News Follow on Flipboard
Share. Facebook Twitter Pinterest LinkedIn Tumblr Email Copy Link
Previous ArticleBaidu AI patent application reveals plans for turning animal sounds into words
Next Article itel Debuts CITY Series with CITY 100 New Model: A Stylish, Durable & DeepSeek AI-Powered Smartphone for Gen Z
Advanced AI Editor
  • Website

Related Posts

Alibaba, Huawei Dominate China’s Growing Cloud Market – Alibaba Gr Hldgs (NYSE:BABA)

July 11, 2025

First On-Device Agentic AI Platform Built for Mobile Devices

July 10, 2025

U.S. Officials Secretly Evaluate Chinese AI for Political Bias

July 10, 2025
Leave A Reply

Latest Posts

Homeland Security Targets Chicago’s National Museum of Puerto Rican Arts & Culture

1,600-Year-Old Tomb of Mayan City’s Founding King Discovered in Belize

Centre Pompidou Cancels Caribbean Art Show, Raising Controversy

‘Night at the Museum’ Reboot in the Works

Latest Posts

Coinbase and Perplexity AI Unite for Live Crypto Price Access

July 12, 2025

Paper page – Beyond the Linear Separability Ceiling

July 12, 2025

Fraud detection empowered by federated learning with the Flower framework on Amazon SageMaker AI

July 12, 2025

Subscribe to News

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

Recent Posts

  • Coinbase and Perplexity AI Unite for Live Crypto Price Access
  • Paper page – Beyond the Linear Separability Ceiling
  • Fraud detection empowered by federated learning with the Flower framework on Amazon SageMaker AI
  • Full STEM ahead: Parkersburg Catholic’s Helena Teltscher will fuel her passion for problem solving, engineering at MIT | News, Sports, Jobs
  • TU Wien Rendering #8 – Surface Normals

Recent Comments

  1. Compte Binance on Anthropic’s Lawyers Apologize After its Claude AI Hallucinates Legal Citation in Copyright Lawsuit
  2. Index Home on Artists Through The Eyes Of Artists’ At Pallant House Gallery
  3. código binance on Five takeaways from IBM Think 2025
  4. Dang k'y binance on Qwen 2.5 Coder and Qwen 3 Lead in Open Source LLM Over DeepSeek and Meta
  5. "oppna binance-konto on Trump crypto czar Sacks stablecoin bill unlock trillions for Treasury

Welcome to Advanced AI News—your ultimate destination for the latest advancements, insights, and breakthroughs in artificial intelligence.

At Advanced AI News, we are passionate about keeping you informed on the cutting edge of AI technology, from groundbreaking research to emerging startups, expert insights, and real-world applications. Our mission is to deliver high-quality, up-to-date, and insightful content that empowers AI enthusiasts, professionals, and businesses to stay ahead in this fast-evolving field.

Subscribe to Updates

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

LinkedIn Instagram YouTube Threads X (Twitter)
  • Home
  • About Us
  • Advertise With Us
  • Contact Us
  • DMCA
  • Privacy Policy
  • Terms & Conditions
© 2025 advancedainews. Designed by advancedainews.

Type above and press Enter to search. Press Esc to cancel.