Close Menu
  • Home
  • AI Models
    • DeepSeek
    • xAI
    • OpenAI
    • Meta AI Llama
    • Google DeepMind
    • Amazon AWS AI
    • Microsoft AI
    • Anthropic (Claude)
    • NVIDIA AI
    • IBM WatsonX Granite 3.1
    • Adobe Sensi
    • Hugging Face
    • Alibaba Cloud (Qwen)
    • Baidu (ERNIE)
    • C3 AI
    • DataRobot
    • Mistral AI
    • Moonshot AI (Kimi)
    • Google Gemma
    • xAI
    • Stability AI
    • H20.ai
  • AI Research
    • Allen Institue for AI
    • arXiv AI
    • Berkeley AI Research
    • CMU AI
    • Google Research
    • Microsoft Research
    • Meta AI Research
    • OpenAI Research
    • Stanford HAI
    • MIT CSAIL
    • Harvard AI
  • AI Funding & Startups
    • AI Funding Database
    • CBInsights AI
    • Crunchbase AI
    • Data Robot Blog
    • TechCrunch AI
    • VentureBeat AI
    • The Information AI
    • Sifted AI
    • WIRED AI
    • Fortune AI
    • PitchBook
    • TechRepublic
    • SiliconANGLE – Big Data
    • MIT News
    • Data Robot Blog
  • Expert Insights & Videos
    • Google DeepMind
    • Lex Fridman
    • Matt Wolfe AI
    • Yannic Kilcher
    • Two Minute Papers
    • AI Explained
    • TheAIEdge
    • Matt Wolfe AI
    • The TechLead
    • Andrew Ng
    • OpenAI
  • Expert Blogs
    • François Chollet
    • Gary Marcus
    • IBM
    • Jack Clark
    • Jeremy Howard
    • Melanie Mitchell
    • Andrew Ng
    • Andrej Karpathy
    • Sebastian Ruder
    • Rachel Thomas
    • IBM
  • AI Policy & Ethics
    • ACLU AI
    • AI Now Institute
    • Center for AI Safety
    • EFF AI
    • European Commission AI
    • Partnership on AI
    • Stanford HAI Policy
    • Mozilla Foundation AI
    • Future of Life Institute
    • Center for AI Safety
    • World Economic Forum AI
  • AI Tools & Product Releases
    • AI Assistants
    • AI for Recruitment
    • AI Search
    • Coding Assistants
    • Customer Service AI
    • Image Generation
    • Video Generation
    • Writing Tools
    • AI for Recruitment
    • Voice/Audio Generation
  • Industry Applications
    • Finance AI
    • Healthcare AI
    • Legal AI
    • Manufacturing AI
    • Media & Entertainment
    • Transportation AI
    • Education AI
    • Retail AI
    • Agriculture AI
    • Energy AI
  • AI Art & Entertainment
    • AI Art News Blog
    • Artvy Blog » AI Art Blog
    • Weird Wonderful AI Art Blog
    • The Chainsaw » AI Art
    • Artvy Blog » AI Art Blog
What's Hot

TU Wien Rendering #26 – Low Discrepancy Sequences

Snapsheet And Foundation AI To Enhance Claims Document Management With New Product

Introducing Skiniglow-6 Matrix™ Collection from the Cohere Beauty Innovation Collaborative

Facebook X (Twitter) Instagram
Advanced AI News
  • Home
  • AI Models
    • OpenAI (GPT-4 / GPT-4o)
    • Anthropic (Claude 3)
    • Google DeepMind (Gemini)
    • Meta (LLaMA)
    • Cohere (Command R)
    • Amazon (Titan)
    • IBM (Watsonx)
    • Inflection AI (Pi)
  • AI Research
    • Allen Institue for AI
    • arXiv AI
    • Berkeley AI Research
    • CMU AI
    • Google Research
    • Meta AI Research
    • Microsoft Research
    • OpenAI Research
    • Stanford HAI
    • MIT CSAIL
    • Harvard AI
  • AI Funding
    • AI Funding Database
    • CBInsights AI
    • Crunchbase AI
    • Data Robot Blog
    • TechCrunch AI
    • VentureBeat AI
    • The Information AI
    • Sifted AI
    • WIRED AI
    • Fortune AI
    • PitchBook
    • TechRepublic
    • SiliconANGLE – Big Data
    • MIT News
    • Data Robot Blog
  • AI Experts
    • Google DeepMind
    • Lex Fridman
    • Meta AI Llama
    • Yannic Kilcher
    • Two Minute Papers
    • AI Explained
    • TheAIEdge
    • The TechLead
    • Matt Wolfe AI
    • Andrew Ng
    • OpenAI
    • Expert Blogs
      • François Chollet
      • Gary Marcus
      • IBM
      • Jack Clark
      • Jeremy Howard
      • Melanie Mitchell
      • Andrew Ng
      • Andrej Karpathy
      • Sebastian Ruder
      • Rachel Thomas
      • IBM
  • AI Tools
    • AI Assistants
    • AI for Recruitment
    • AI Search
    • Coding Assistants
    • Customer Service AI
  • AI Policy
    • ACLU AI
    • AI Now Institute
    • Center for AI Safety
  • Industry AI
    • Finance AI
    • Healthcare AI
    • Education AI
    • Energy AI
    • Legal AI
LinkedIn Instagram YouTube Threads X (Twitter)
Advanced AI News
Meta AI Llama

Meta Lets Its Largest Llama AI Model Loose Into The Open Field

By Advanced AI EditorJuly 9, 2025No Comments7 Mins Read
Share Facebook Twitter Pinterest Copy Link Telegram LinkedIn Tumblr Email
Share
Facebook Twitter LinkedIn Pinterest Email



A scant three months ago, when Meta Platforms released the Llama 3 AI model in 8B and 70B versions, which correspond to the billions of parameters they can span, we asked the question we ask of every open source tool or platform since the dawn of Linux: Who’s going to profit from it and how are they going to do it?

The hyperscaler and social network put itself on the open source AI track when it launched the first Llama model in early 2023, and it has since continued to pour hundreds of millions of dollars into building better and more capable models that chief executive officer Mark Zuckerberg and other executives said rivaled the performance of the closed and proprietary models of their for-profit counterparts, who are vying to take leadership shares of a global generative AI market that could reach as high as $356 billion by 2030.

But Zuckerberg believes that the open source model is not only good for Meta but also for the world in general, saying in an open letter this week that the constraints placed on the company by Apple while building its services was a formative experience.

To quote Zuckerberg: “It’s clear that Meta and many other companies would be freed up to build much better services for people if we could build the best versions of our products and competitors were not able to constrain what we could build. On a philosophical level, this is a major reason why I believe so strongly in building open ecosystems in AI and AR/VR [augmented and virtual reality] for the next generation of computing.”

Llama 3.1 Takes On The World

His letter came in conjunction with the release of Llama 3.1 – an update that was hinted at during the release of Llama 3 – its largest large-language model (LLM) that the company says can outperform Anthropic’s Claude 3.5 Sonnet and OpenAI’s GPT-4 and GPT-4o models. With this latest release, Meta upgraded its 8B and 70B versions, but the focus is the 405B model, which Meta called the first “frontier-level” open source AI model. It was trained on more than 15 trillion tokens using more than 16,000 expensive and hard-to-find Nvidia H100 GPUs.

At that scale, Meta scientists said they chose to develop the 405B as a standard decoder-only transformer model architecture – though with minor adaptations – rather than a mixture-of-experts, a move aimed to ensure training stability.

“We adopted an iterative post-training procedure, where each round uses supervised fine-tuning and direct preference optimization,” the Meta researchers wrote in the announcement post. “This enabled us to create the highest quality synthetic data for each round and improve each capability’s performance.”

All three models within the 3.1 release – the 405B as well as the enhanced 8B and 70B – are getting enhancements, such as an extended context length of 128,000, up from 8,000, and support for eight languages (English, French, German, Hindi, Italian, Portuguese, Spanish, and Thai). Users in the United States can try out the 405B Llama model on WhatsApp and at meta.ai.

Open Is The Way To Go

Amid all this, Zuckerberg and Meta are continuing the open source drumbeat, with the company researchers noting that the goal is to make the models part of a larger system that can juggle multiple components, all with the plan to give developers the technologies they need to create their own custom AI tools, an idea they said was introduced last year when Meta first incorporated components that were outside of the LLMs.

Along with Llama 3.1, Meta is releasing a compete reference system and new components like Llama Guard 3 to give developers a safeguard by more easily detecting content that violated standards, detect cyberattacks, and prevent malicious code to be put out by the models. In addition, Prompt Guard helps filter out prompt injections, which threat groups use to bypass security controls in LLMs.

Meta also is looking to build out what it’s calling the “Llama Stack,” APIs that will make it easier for third-party developers to use the Llama LLMs. The company also has posted a request for comment on GitHub for suggestions on what the stack should look like.

“The implementation of components in this Llama System vision is still fragmented,” the researchers wrote. “That’s why we’ve started working with industry, startups, and the broader community to help better define the interfaces of these components. Our hope is for these to become adopted across the ecosystem, which should help with easier interoperability.”

Pulling In Partners

As with most open systems, creating a community of tech partners is a key part of Meta’s plans. With Llama 3.1, the company has more than two dozen vendors offering services, with Zuckerberg writing that “as the community grows and more companies develop new services, we can collectively make Llama the industry standard and bring the benefits of AI to everyone.”

The release of Llama 3.1 tightened Meta’s relationship with Nvidia, which will be on full display next week when Zuckerberg and Jensen Huang, the GPU maker’s chief executive officer, sit down to talk about generative AI and its use for building virtual worlds.

Nvidia released its AI Foundry for the Llama 3.1 models, which lets developers build and deploy custom AI models best suited to their specific needs using its accelerated computing and software, including DGX Cloud, foundation models, and NeMo software. There also are consulting services from the likes of Accenture, Deloitte, Infosys, and Wipro, and with DGX Cloud offering increasing capacity on such cloud services as Amazon Web Services, Microsoft Azure, Google Cloud, and Oracle Cloud Infrastructure.

In addition, organizations also can use Nvidia’s NIM inference microservices with all three Llama 3.1 models.

AWS is making the three models available in its Amazon Bedrock AI managed service and Groq is running the models on its LPU inference technology. Other tech partners include Dell, Microsoft, Google, Databricks, and Snowflake.

Such partnerships will improve the services Meta offers in its various businesses, which include Facebook, WhatsApp, and Instagram, which will only benefit the company, Zuckerberg wrote. The company needs to build up the ecosystem of integrated tools, silicon optimizations, and other components, he said, adding that “if we were the only company using Llama, this ecosystem wouldn’t develop and we’d fare no better than the closed variants of Unix.”

In addition, a rapidly evolving AI market means that Llama will have to be highly competitive and open if it is to become the industry standard. Also, selling access to AI models isn’t part of Meta’s business plan, so releasing Llama won’t hurt its revenue.

Zuckerberg compared what Meta is doing with Llama with what it did when it founded the Open Compute Project in 2011, releasing its server, storage, networking, and datacenter designs and in the course of that saving billions of dollars with its “vanity free” iron and innovative datacenters.

“We benefited from the ecosystem’s innovations by open sourcing leading tools like PyTorch, React, and many more tools,” he said. “This approach has consistently worked for us when we stick with it over the long term.”

Zuckerberg is now betting that the same approach will work in the high-stakes and highly competitive world of AI.

Sign up to our Newsletter

Featuring highlights, analysis, and stories from the week directly from us to your inbox with nothing in between.
Subscribe now

Related Articles



Source link

Follow on Google News Follow on Flipboard
Share. Facebook Twitter Pinterest LinkedIn Tumblr Email Copy Link
Previous ArticleIBM Aims To Simplify AI Deployment With New Chips, Servers – IBM (NYSE:IBM)
Next Article Perplexity AI proposes TikTok merger with 50% U.S. government ownership stake
Advanced AI Editor
  • Website

Related Posts

Meta debuts newest Llama AI model with help from Nvidia and cloud partners – NBC 5 Dallas-Fort Worth

July 8, 2025

Meta reports rapid growth in popularity for its Llama AI models, with nearly 350M downloads

July 8, 2025

Meta Wins Artificial Intelligence Copyright Case on Fair Use Grou

July 7, 2025

Comments are closed.

Latest Posts

Is the Summer Group Show Dead or are Galleries Are Getting Smarter?

Supreme Court Greenlights Mass Layoffs of Federal Workers Under Trump

Adam Lindemann to Close Venus Over Manhattan After 14 Years

Ed Sheeran Is Ripping Off Jackson Pollock with His Paintings

Latest Posts

TU Wien Rendering #26 – Low Discrepancy Sequences

July 10, 2025

Snapsheet And Foundation AI To Enhance Claims Document Management With New Product

July 10, 2025

Introducing Skiniglow-6 Matrix™ Collection from the Cohere Beauty Innovation Collaborative

July 10, 2025

Subscribe to News

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

Recent Posts

  • TU Wien Rendering #26 – Low Discrepancy Sequences
  • Snapsheet And Foundation AI To Enhance Claims Document Management With New Product
  • Introducing Skiniglow-6 Matrix™ Collection from the Cohere Beauty Innovation Collaborative
  • Information concerning the total number of voting rights and shares in the share capital as of 31 january 2025.
  • Paper page – Go to Zero: Towards Zero-shot Motion Generation with Million-scale Data

Recent Comments

  1. "oppna binance-konto on Trump crypto czar Sacks stablecoin bill unlock trillions for Treasury
  2. Account binance on itel debuts CITY series with CITY 100 new model: A stylish, durable & DeepSeek AI-powered smartphone for Gen Z

Welcome to Advanced AI News—your ultimate destination for the latest advancements, insights, and breakthroughs in artificial intelligence.

At Advanced AI News, we are passionate about keeping you informed on the cutting edge of AI technology, from groundbreaking research to emerging startups, expert insights, and real-world applications. Our mission is to deliver high-quality, up-to-date, and insightful content that empowers AI enthusiasts, professionals, and businesses to stay ahead in this fast-evolving field.

Subscribe to Updates

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

LinkedIn Instagram YouTube Threads X (Twitter)
  • Home
  • About Us
  • Advertise With Us
  • Contact Us
  • DMCA
  • Privacy Policy
  • Terms & Conditions
© 2025 advancedainews. Designed by advancedainews.

Type above and press Enter to search. Press Esc to cancel.