Paper page - How new data permeates LLM knowledge and how to dilute it

Large language models learn and continually learn through the accumulation of
gradient-based updates, but how individual pieces of new information affect
existing knowledge, leading to both beneficial generalization and problematic
hallucination, remains poorly understood. We demonstrate that when learning new
information, LLMs exhibit a “priming” effect: learning a new fact can cause the
model to inappropriately apply that knowledge in unrelated contexts. To
systematically study this phenomenon, we introduce “Outlandish,” a carefully
curated dataset of 1320 diverse text samples designed to probe how new
knowledge permeates through an LLM’s existing knowledge base. Using this
dataset, we show that the degree of priming after learning new information can
be predicted by measuring the token probability of key words before learning.
This relationship holds robustly across different model architectures (PALM-2,
Gemma, Llama), sizes, and training stages. Finally, we develop two novel
techniques to modulate how new knowledge affects existing model behavior: (1) a
“stepping-stone” text augmentation strategy and (2) an “ignore-k” update
pruning method. These approaches reduce undesirable priming effects by 50-95\%
while preserving the model’s ability to learn new information. Our findings
provide both empirical insights into how LLMs learn and practical tools for
improving the specificity of knowledge insertion in language models. Further
materials: https://sunchipsster1.github.io/projects/outlandish/

Source link

What's Hot

Meta’s AI spending spree is Wall Street’s focus in second-quarter earnings – NBC New York

Nvidia CEO cashes out shares—Is it time to rethink your position?

As AI Throws Education Into Chaos, OpenAI Introduces ‘Study Mode’ to Help Students ‘Learn’

Paper page – How new data permeates LLM knowledge and how to dilute it

Paper page – Music Arena: Live Evaluation for Text-to-Music

Paper page – Chat with AI: The Surprising Turn of Real-time Video Communication from Human to AI

Discovering and using Spelke segments

Artlogic, ArtCloud Merge in Bid to Shape Art World’s Digital Backbone

John Roberts Prevented Firing of National Portrait Gallery Director

At Comic-Con, George Lucas Previews Forthcoming Lucas Museum

Betye Saar Assembles an All-Star Group to Steward Her Legacy

Meta’s AI spending spree is Wall Street’s focus in second-quarter earnings – NBC New York

Nvidia CEO cashes out shares—Is it time to rethink your position?

As AI Throws Education Into Chaos, OpenAI Introduces ‘Study Mode’ to Help Students ‘Learn’

What's Hot

Paper page – How new data permeates LLM knowledge and how to dilute it

Related Posts

Subscribe to Updates