Close Menu
  • Home
  • AI Models
    • DeepSeek
    • xAI
    • OpenAI
    • Meta AI Llama
    • Google DeepMind
    • Amazon AWS AI
    • Microsoft AI
    • Anthropic (Claude)
    • NVIDIA AI
    • IBM WatsonX Granite 3.1
    • Adobe Sensi
    • Hugging Face
    • Alibaba Cloud (Qwen)
    • Baidu (ERNIE)
    • C3 AI
    • DataRobot
    • Mistral AI
    • Moonshot AI (Kimi)
    • Google Gemma
    • xAI
    • Stability AI
    • H20.ai
  • AI Research
    • Allen Institue for AI
    • arXiv AI
    • Berkeley AI Research
    • CMU AI
    • Google Research
    • Microsoft Research
    • Meta AI Research
    • OpenAI Research
    • Stanford HAI
    • MIT CSAIL
    • Harvard AI
  • AI Funding & Startups
    • AI Funding Database
    • CBInsights AI
    • Crunchbase AI
    • Data Robot Blog
    • TechCrunch AI
    • VentureBeat AI
    • The Information AI
    • Sifted AI
    • WIRED AI
    • Fortune AI
    • PitchBook
    • TechRepublic
    • SiliconANGLE – Big Data
    • MIT News
    • Data Robot Blog
  • Expert Insights & Videos
    • Google DeepMind
    • Lex Fridman
    • Matt Wolfe AI
    • Yannic Kilcher
    • Two Minute Papers
    • AI Explained
    • TheAIEdge
    • Matt Wolfe AI
    • The TechLead
    • Andrew Ng
    • OpenAI
  • Expert Blogs
    • François Chollet
    • Gary Marcus
    • IBM
    • Jack Clark
    • Jeremy Howard
    • Melanie Mitchell
    • Andrew Ng
    • Andrej Karpathy
    • Sebastian Ruder
    • Rachel Thomas
    • IBM
  • AI Policy & Ethics
    • ACLU AI
    • AI Now Institute
    • Center for AI Safety
    • EFF AI
    • European Commission AI
    • Partnership on AI
    • Stanford HAI Policy
    • Mozilla Foundation AI
    • Future of Life Institute
    • Center for AI Safety
    • World Economic Forum AI
  • AI Tools & Product Releases
    • AI Assistants
    • AI for Recruitment
    • AI Search
    • Coding Assistants
    • Customer Service AI
    • Image Generation
    • Video Generation
    • Writing Tools
    • AI for Recruitment
    • Voice/Audio Generation
  • Industry Applications
    • Finance AI
    • Healthcare AI
    • Legal AI
    • Manufacturing AI
    • Media & Entertainment
    • Transportation AI
    • Education AI
    • Retail AI
    • Agriculture AI
    • Energy AI
  • AI Art & Entertainment
    • AI Art News Blog
    • Artvy Blog » AI Art Blog
    • Weird Wonderful AI Art Blog
    • The Chainsaw » AI Art
    • Artvy Blog » AI Art Blog
What's Hot

Litera Expands Kira With Added GenAI Features – Artificial Lawyer

Qwen 3 Coder vs GPT-4.1: Why Developers Are Making the Switch

MIT device could deliver more energy-efficient computing, communications

Facebook X (Twitter) Instagram
Advanced AI News
  • Home
  • AI Models
    • OpenAI (GPT-4 / GPT-4o)
    • Anthropic (Claude 3)
    • Google DeepMind (Gemini)
    • Meta (LLaMA)
    • Cohere (Command R)
    • Amazon (Titan)
    • IBM (Watsonx)
    • Inflection AI (Pi)
  • AI Research
    • Allen Institue for AI
    • arXiv AI
    • Berkeley AI Research
    • CMU AI
    • Google Research
    • Meta AI Research
    • Microsoft Research
    • OpenAI Research
    • Stanford HAI
    • MIT CSAIL
    • Harvard AI
  • AI Funding
    • AI Funding Database
    • CBInsights AI
    • Crunchbase AI
    • Data Robot Blog
    • TechCrunch AI
    • VentureBeat AI
    • The Information AI
    • Sifted AI
    • WIRED AI
    • Fortune AI
    • PitchBook
    • TechRepublic
    • SiliconANGLE – Big Data
    • MIT News
    • Data Robot Blog
  • AI Experts
    • Google DeepMind
    • Lex Fridman
    • Meta AI Llama
    • Yannic Kilcher
    • Two Minute Papers
    • AI Explained
    • TheAIEdge
    • The TechLead
    • Matt Wolfe AI
    • Andrew Ng
    • OpenAI
    • Expert Blogs
      • François Chollet
      • Gary Marcus
      • IBM
      • Jack Clark
      • Jeremy Howard
      • Melanie Mitchell
      • Andrew Ng
      • Andrej Karpathy
      • Sebastian Ruder
      • Rachel Thomas
      • IBM
  • AI Tools
    • AI Assistants
    • AI for Recruitment
    • AI Search
    • Coding Assistants
    • Customer Service AI
  • AI Policy
    • ACLU AI
    • AI Now Institute
    • Center for AI Safety
  • Industry AI
    • Finance AI
    • Healthcare AI
    • Education AI
    • Energy AI
    • Legal AI
LinkedIn Instagram YouTube Threads X (Twitter)
Advanced AI News
Mistral AI

Mistral AI debuts new Magistral series of reasoning LLMs

By Advanced AI EditorJune 10, 2025No Comments4 Mins Read
Share Facebook Twitter Pinterest Copy Link Telegram LinkedIn Tumblr Email
Share
Facebook Twitter LinkedIn Pinterest Email


Mistral AI SAS today introduced Magistral, a new lineup of reasoning-optimized large language models.

The LLM series includes two algorithms on launch. The first, Magistral Small, is available under an open-source license and features 24 billion parameters. It’s joined by a more capable, proprietary model called Magistral Small that will be available through Mistral AI’s cloud services.

Mistral AI is a Paris-based OpenAI competitor backed by more than $1 billion in funding. Alongside the newly launched reasoning-optimized models, it offers general-purpose LLMs and neural networks optimized for specialized tasks such as solving math problems. The launch of Magistral comes amid rumors that the company is seeking to raise another $1 billion from investors.

The two models in the Magistral series share several features. Both understand multiple languages and ship with a chain-of-thought feature, which allows them to break down complex tasks into simpler substeps. Moreover, they can display the substeps involved in generating a prompt response, which enables users to verify its accuracy.

Magistral Medium, Mistral’s other new reasoning model, generates higher-quality output. The company compared it with Magistral Small by asking the models to solve problems from a qualifying exam for the 2024 U.S. Math Olympiad. Magistral Medium scored 73.6% with default settings and 90% with a configuration designed to boost output quality. Magistral Small scored 70.7% and 83.3%, respectively.

Magistral Medium also includes speed optimizations not available in its open-source namesake. When users access the former model through Le Chat, Mistral’s chatbot service, they can activate two settings called Think mode and Flash Answers. According to Mistral, the settings allow Magistral Medium to answer prompts nearly 10 times faster than competing models.

In a paper accompanying the launch of Magistral, Mistral detailed how the LLM series was developed. The company used a popular AI training method known as reinforcement learning, or RL. “Instead of relying on existing implementations and RL traces distilled from prior models, we follow a ground up approach, relying solely on our own models and infrastructure,” Mistral researchers wrote in the paper.

The typical RL project involves two models: the LLM being trained and a so-called critic model that guides the training process by providing the LLM with feedback. According to Mistral, Magistral was trained using an RL method that removes the need for a critic model. This arrangement can improve the quality of LLM prompt responses.

Mistral developed programs called generators and verifiers to manage the training process. Magistral used the generators to answer the practice questions in its training dataset. The verifiers, in turn, checked the accuracy of the model’s answers. The trainers and verifiers spread the calculations involved in the workflow across a cluster of graphics cards.

During the project, Mistral created several versions of the training workflow with which it trained Magistral and compared them. The company says that the test produced several new discoveries about RL. “We contribute insights that add to, or contradict, existing RLVR literature, for example on whether RL can improve upon the distillation SFT baseline for small models,” the company’s researchers wrote.

Mistral’s first discovery was that a version of Magistral trained solely on a coding dataset proved surprisingly adept at solving math problems. The opposite was true as well, the company determined. “The model demonstrates strong performance to out-of-domain tasks, showcasing the generalization ability of RL,” Mistral’s researchers wrote. The ability to apply knowledge from one field to another is important for many reasoning tasks.

An earlier research paper observed that small models trained solely with RL can’t compete with LLMs developed the same way. According to Mistral, its AI training tests showed that’s not always the case. “We achieved strong results even with pure RL,” the company’s researchers detailed.

Mistral has released the code for Magistral Small on Hugging Face. Magistral Medium, in turn, is available through Le Chat and the company’s application programming interface for developers.

 Image: Mistral

Your vote of support is important to us and it helps us keep the content FREE.

One click below supports our mission to provide free, deep, and relevant content.  

Join our community on YouTube

Join the community that includes more than 15,000 #CubeAlumni experts, including Amazon.com CEO Andy Jassy, Dell Technologies founder and CEO Michael Dell, Intel CEO Pat Gelsinger, and many more luminaries and experts.

“TheCUBE is an important partner to the industry. You guys really are a part of our events and we really appreciate you coming and I know people appreciate the content you create as well” – Andy Jassy

THANK YOU



Source link

Follow on Google News Follow on Flipboard
Share. Facebook Twitter Pinterest LinkedIn Tumblr Email Copy Link
Previous ArticleDeepWho? If You Missed DeepSeek’s Latest AI Launch, You’re Not Alone.
Next Article New Dramatists Shapes Generations Of Playwrights
Advanced AI Editor
  • Website

Related Posts

Mistral AI Unveils Codestral, Its First GenAI Model For Developers

July 28, 2025

Mistral AI & Qualcomm partner will boost AI on Snapdragon devices

July 28, 2025

Mistral AI’s Environmental Audit Puts Spotlight On AI’s Hidden Costs

July 28, 2025
Leave A Reply

Latest Posts

Scottish Museum Group Warns of ‘Policing of Gender’—and More Art News

David Geffen Sued By Estranged Husband for Breach of Contract

Auction House Will Sell Egyptian Artifact Despite Concern From Experts

Anish Kapoor Lists New York Apartment for $17.75 M.

Latest Posts

Litera Expands Kira With Added GenAI Features – Artificial Lawyer

July 28, 2025

Qwen 3 Coder vs GPT-4.1: Why Developers Are Making the Switch

July 28, 2025

MIT device could deliver more energy-efficient computing, communications

July 28, 2025

Subscribe to News

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

Recent Posts

  • Litera Expands Kira With Added GenAI Features – Artificial Lawyer
  • Qwen 3 Coder vs GPT-4.1: Why Developers Are Making the Switch
  • MIT device could deliver more energy-efficient computing, communications
  • How E2B became essential to 88% of Fortune 100 companies and raised $21 million
  • The first look: Disrupt 2025 AI Stage revealed

Recent Comments

  1. binance推薦獎金 on [2407.11104] Exploring the Potentials and Challenges of Deep Generative Models in Product Design Conception
  2. психолог онлайн индивидуально on 1-800-CHAT-GPT—12 Days of OpenAI: Day 10
  3. GeraldDes on 1-800-CHAT-GPT—12 Days of OpenAI: Day 10
  4. binance sign up on Inclusion Strategies in Workplace | Recruiting News Network
  5. Rejestracja on Online Education – How I Make My Videos

Welcome to Advanced AI News—your ultimate destination for the latest advancements, insights, and breakthroughs in artificial intelligence.

At Advanced AI News, we are passionate about keeping you informed on the cutting edge of AI technology, from groundbreaking research to emerging startups, expert insights, and real-world applications. Our mission is to deliver high-quality, up-to-date, and insightful content that empowers AI enthusiasts, professionals, and businesses to stay ahead in this fast-evolving field.

Subscribe to Updates

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

LinkedIn Instagram YouTube Threads X (Twitter)
  • Home
  • About Us
  • Advertise With Us
  • Contact Us
  • DMCA
  • Privacy Policy
  • Terms & Conditions
© 2025 advancedainews. Designed by advancedainews.

Type above and press Enter to search. Press Esc to cancel.