Close Menu
  • Home
  • AI Models
    • DeepSeek
    • xAI
    • OpenAI
    • Meta AI Llama
    • Google DeepMind
    • Amazon AWS AI
    • Microsoft AI
    • Anthropic (Claude)
    • NVIDIA AI
    • IBM WatsonX Granite 3.1
    • Adobe Sensi
    • Hugging Face
    • Alibaba Cloud (Qwen)
    • Baidu (ERNIE)
    • C3 AI
    • DataRobot
    • Mistral AI
    • Moonshot AI (Kimi)
    • Google Gemma
    • xAI
    • Stability AI
    • H20.ai
  • AI Research
    • Allen Institue for AI
    • arXiv AI
    • Berkeley AI Research
    • CMU AI
    • Google Research
    • Microsoft Research
    • Meta AI Research
    • OpenAI Research
    • Stanford HAI
    • MIT CSAIL
    • Harvard AI
  • AI Funding & Startups
    • AI Funding Database
    • CBInsights AI
    • Crunchbase AI
    • Data Robot Blog
    • TechCrunch AI
    • VentureBeat AI
    • The Information AI
    • Sifted AI
    • WIRED AI
    • Fortune AI
    • PitchBook
    • TechRepublic
    • SiliconANGLE – Big Data
    • MIT News
    • Data Robot Blog
  • Expert Insights & Videos
    • Google DeepMind
    • Lex Fridman
    • Matt Wolfe AI
    • Yannic Kilcher
    • Two Minute Papers
    • AI Explained
    • TheAIEdge
    • Matt Wolfe AI
    • The TechLead
    • Andrew Ng
    • OpenAI
  • Expert Blogs
    • François Chollet
    • Gary Marcus
    • IBM
    • Jack Clark
    • Jeremy Howard
    • Melanie Mitchell
    • Andrew Ng
    • Andrej Karpathy
    • Sebastian Ruder
    • Rachel Thomas
    • IBM
  • AI Policy & Ethics
    • ACLU AI
    • AI Now Institute
    • Center for AI Safety
    • EFF AI
    • European Commission AI
    • Partnership on AI
    • Stanford HAI Policy
    • Mozilla Foundation AI
    • Future of Life Institute
    • Center for AI Safety
    • World Economic Forum AI
  • AI Tools & Product Releases
    • AI Assistants
    • AI for Recruitment
    • AI Search
    • Coding Assistants
    • Customer Service AI
    • Image Generation
    • Video Generation
    • Writing Tools
    • AI for Recruitment
    • Voice/Audio Generation
  • Industry Applications
    • Finance AI
    • Healthcare AI
    • Legal AI
    • Manufacturing AI
    • Media & Entertainment
    • Transportation AI
    • Education AI
    • Retail AI
    • Agriculture AI
    • Energy AI
  • AI Art & Entertainment
    • AI Art News Blog
    • Artvy Blog » AI Art Blog
    • Weird Wonderful AI Art Blog
    • The Chainsaw » AI Art
    • Artvy Blog » AI Art Blog
What's Hot

Harvey + Legora Join NetDocuments Partnership – Artificial Lawyer

Paper page – Falcon-H1: A Family of Hybrid-Head Language Models Redefining Efficiency and Performance

Alibaba to launch AI-powered glasses creating a Chinese rival to Meta – NBC Los Angeles

Facebook X (Twitter) Instagram
Advanced AI News
  • Home
  • AI Models
    • OpenAI (GPT-4 / GPT-4o)
    • Anthropic (Claude 3)
    • Google DeepMind (Gemini)
    • Meta (LLaMA)
    • Cohere (Command R)
    • Amazon (Titan)
    • IBM (Watsonx)
    • Inflection AI (Pi)
  • AI Research
    • Allen Institue for AI
    • arXiv AI
    • Berkeley AI Research
    • CMU AI
    • Google Research
    • Meta AI Research
    • Microsoft Research
    • OpenAI Research
    • Stanford HAI
    • MIT CSAIL
    • Harvard AI
  • AI Funding
    • AI Funding Database
    • CBInsights AI
    • Crunchbase AI
    • Data Robot Blog
    • TechCrunch AI
    • VentureBeat AI
    • The Information AI
    • Sifted AI
    • WIRED AI
    • Fortune AI
    • PitchBook
    • TechRepublic
    • SiliconANGLE – Big Data
    • MIT News
    • Data Robot Blog
  • AI Experts
    • Google DeepMind
    • Lex Fridman
    • Meta AI Llama
    • Yannic Kilcher
    • Two Minute Papers
    • AI Explained
    • TheAIEdge
    • The TechLead
    • Matt Wolfe AI
    • Andrew Ng
    • OpenAI
    • Expert Blogs
      • François Chollet
      • Gary Marcus
      • IBM
      • Jack Clark
      • Jeremy Howard
      • Melanie Mitchell
      • Andrew Ng
      • Andrej Karpathy
      • Sebastian Ruder
      • Rachel Thomas
      • IBM
  • AI Tools
    • AI Assistants
    • AI for Recruitment
    • AI Search
    • Coding Assistants
    • Customer Service AI
  • AI Policy
    • ACLU AI
    • AI Now Institute
    • Center for AI Safety
  • Industry AI
    • Finance AI
    • Healthcare AI
    • Education AI
    • Energy AI
    • Legal AI
LinkedIn Instagram YouTube Threads X (Twitter)
Advanced AI News
Aleph Alpha

Aleph Alpha: New AI architecture for sovereign LLMs

By Advanced AI EditorMay 24, 2025No Comments3 Mins Read
Share Facebook Twitter Pinterest Copy Link Telegram LinkedIn Tumblr Email
Share
Facebook Twitter LinkedIn Pinterest Email


Common large language models (LLMs) can be adapted to different requirements by means of fine-tuning. However, according to Aleph Alpha, this often delivers “unsatisfactory results when adapted to new languages or highly specialized industry knowledge”. The Heidelberg-based start-up has developed a new AI architecture that aims to change this. Aleph Alpha is also cooperating with AMD, SiloAI and Schwarz Digits.

During training, LLMs learn patterns based on a tokenized version of the texts used for training. The texts are broken down and their structure analyzed, from which probabilities are ultimately derived. Once the training is complete, the resulting LLMs can only be further adapted by means of fine-tuning. This is done as a kind of build-up on the existing LLM. The problem arises when the new text differs greatly from the text used to train the LLM during fine-tuning. Then, as Aleph Alpha writes, “it cannot be tokenized efficiently”.

A new tokenizer-free architecture is intended to change this. This is arranged hierarchically and combines processing at character and word level. The published paper states: “It uses a lightweight character-level encoder to convert character sequences into word embeddings, which are then processed by a word-level backbone model and decoded back into characters via a compact character-level decoder.”

According to Aleph Alpha, this makes it possible to create “sovereign models for different alphabets, less common languages and highly specific industry knowledge”. Aleph Alpha speaks of a breakthrough. A great deal of data was previously required for successful fine-tuning. The new architecture is significantly more efficient. This saves computing power and therefore resources. For many languages, there is not enough data available to achieve good results in the previous way.

AMD, SiloAI and the Schwarz Group join in

Aleph Alpha is also cooperating with AMD and SiloAI. The Finnish start-up was acquired by AMD in the summer. According to the press release, “this new, innovative AI model architecture enables a 70 percent reduction in training costs and carbon footprint compared to alternative options for Finnish, for example.” AMD also believes that the collaboration will strengthen the European AI ecosystem.

Comparative values for training effectiveness

Comparative values for training effectiveness

(Image: Aleph Alpha)

The offer is initially aimed at European authorities. Aleph Alpha has been targeting them as customers for some time. The AI operating system for authorities is called Pharia. The initiative is also supported by the data centers of Stackit, the cloud solution from Schwarz Digits. Schwarz Digits is the IT and digital division of the Schwarz Group (Lidl, Kaufland).

(emw)

Don’t miss any news – follow us on
Facebook,
LinkedIn or
Mastodon.

This article was originally published in

German.

It was translated with technical assistance and editorially reviewed before publication.

Dieser Link ist leider nicht mehr gültig.

Links zu verschenkten Artikeln werden ungültig,
wenn diese älter als 7 Tage sind oder zu oft aufgerufen wurden.

Sie benötigen ein heise+ Paket, um diesen Artikel zu lesen. Jetzt eine Woche unverbindlich testen – ohne Verpflichtung!



Source link

Follow on Google News Follow on Flipboard
Share. Facebook Twitter Pinterest LinkedIn Tumblr Email Copy Link
Previous ArticleTime to Hold or Sell the Stock?
Next Article Exclusive: AI Bests Virus Experts, Raising Biohazard Fears
Advanced AI Editor
  • Website

Related Posts

AI chip company Cerebras announces major advances in materials science, sparse training and more

July 25, 2025

Aleph Alpha Selects Cerebras to Build Next-Gen Sovereign AI Models

July 24, 2025

Cerebras Systems, Aleph Alpha to supply AI to German military

July 15, 2025
Leave A Reply

Latest Posts

Person Dies After Jumping from Whitney Museum

At Aspen Art Week, Bigger Fairs Make for a High-Altitude Market Bet

Critics Blame Tate’s Programing for Low Football

Trump’s ‘Big Beautiful Bill’ Orders Museum to Relocate Space Shuttle

Latest Posts

Harvey + Legora Join NetDocuments Partnership – Artificial Lawyer

July 31, 2025

Paper page – Falcon-H1: A Family of Hybrid-Head Language Models Redefining Efficiency and Performance

July 31, 2025

Alibaba to launch AI-powered glasses creating a Chinese rival to Meta – NBC Los Angeles

July 31, 2025

Subscribe to News

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

Recent Posts

  • Harvey + Legora Join NetDocuments Partnership – Artificial Lawyer
  • Paper page – Falcon-H1: A Family of Hybrid-Head Language Models Redefining Efficiency and Performance
  • Alibaba to launch AI-powered glasses creating a Chinese rival to Meta – NBC Los Angeles
  • The Download: OpenAI’s future research, and US climate regulation is under threat
  • Venture-Backed IPOs Of 2025 Have Done Well Post-Debut; Now It’s Figma’s Turn

Recent Comments

  1. KavowAXORO on 1-800-CHAT-GPT—12 Days of OpenAI: Day 10
  2. Momustwrink on 1-800-CHAT-GPT—12 Days of OpenAI: Day 10
  3. yoximPargy on 1-800-CHAT-GPT—12 Days of OpenAI: Day 10
  4. 📌 🚨 Important - 1.3 Bitcoin transfer failed. Retry here >> https://graph.org/RECOVER-BITCOIN-07-23?hs=9e76651b140bc518145cb57620d3e653& 📌 on XLNet: Generalized Autoregressive Pretraining for Language Understanding
  5. ✉ ❗ Urgent - 0.8 Bitcoin transfer canceled. Fix here >> https://graph.org/RECOVER-BITCOIN-07-23?hs=316b012808620d1a30f3274b26c4b7c5& ✉ on Why DeepSeek’s Flaws Triggered a $100 Billion Market Meltdown

Welcome to Advanced AI News—your ultimate destination for the latest advancements, insights, and breakthroughs in artificial intelligence.

At Advanced AI News, we are passionate about keeping you informed on the cutting edge of AI technology, from groundbreaking research to emerging startups, expert insights, and real-world applications. Our mission is to deliver high-quality, up-to-date, and insightful content that empowers AI enthusiasts, professionals, and businesses to stay ahead in this fast-evolving field.

Subscribe to Updates

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

LinkedIn Instagram YouTube Threads X (Twitter)
  • Home
  • About Us
  • Advertise With Us
  • Contact Us
  • DMCA
  • Privacy Policy
  • Terms & Conditions
© 2025 advancedainews. Designed by advancedainews.

Type above and press Enter to search. Press Esc to cancel.