[2408.02288] Spin Glass Model Of In-context Learning

[Submitted on 5 Aug 2024 (v1), last revised 18 Apr 2025 (this version, v3)]

View a PDF of the paper titled Spin glass model of in-context learning, by Yuhao Li and 2 other authors

View PDF
HTML (experimental)

Abstract:Large language models show a surprising in-context learning ability — being able to use a prompt to form a prediction for a query, yet without additional training, in stark contrast to old-fashioned supervised learning. Providing a mechanistic interpretation and linking the empirical phenomenon to physics are thus challenging and remain unsolved. We study a simple yet expressive transformer with linear attention and map this structure to a spin glass model with real-valued spins, where the couplings and fields explain the intrinsic disorder in data. The spin glass model explains how the weight parameters interact with each other during pre-training, and further clarifies why an unseen function can be predicted by providing only a prompt yet without further training. Our theory reveals that for single-instance learning, increasing the task diversity leads to the emergence of in-context learning, by allowing the Boltzmann distribution to converge to a unique correct solution of weight parameters. Therefore the pre-trained transformer displays a prediction power in a novel prompt setting. The proposed analytically tractable model thus offers a promising avenue for thinking about how to interpret many intriguing but puzzling properties of large language models.

Submission history

From: Haiping Huang [view email]
[v1]
Mon, 5 Aug 2024 07:54:01 UTC (1,087 KB)
[v2]
Wed, 13 Nov 2024 07:13:36 UTC (1,894 KB)
[v3]
Fri, 18 Apr 2025 08:16:22 UTC (970 KB)

Source link

7 Comments

swot-analiz-692 on September 4, 2025 2:58 am

swot анализ бизнеса https://swot-analiz1.ru
russkoe-porno-789 on September 7, 2025 10:07 pm

порно русские милфы русское порно сиськи
porno-849 on September 7, 2025 10:40 pm

Want to have fun? porno girl Watch porn, buy heroin or ecstasy. Pick up whores or buy marijuana. Come in, we’re waiting
promocod-iherb-964 on September 7, 2025 10:52 pm

Новые актуальные промокод iherb на заказ для выгодных покупок! Скидки на витамины, БАДы, косметику и товары для здоровья. Экономьте до 30% на заказах, используйте проверенные купоны и наслаждайтесь выгодным шопингом.
kursovaya-rabota-767 on September 12, 2025 10:24 pm

заказ диплома курсовой где заказать курсовую работу
zaym onlayn 882 on September 12, 2025 11:09 pm

взять займы онлайн без карты быстрый займ онлайн
onlayn zaym 977 on September 12, 2025 11:13 pm

займ на карту онлайн мгновенно взять займ онлайн без

What's Hot

New MIT Tech Sees Underwater As if the Water Weren’t There

AI fuels false claims after Charlie Kirk’s death, CBS News analysis reveals

Google is a ‘bad actor’ says People CEO, accusing the company of stealing content

[2408.02288] Spin glass model of in-context learning

LTLCrit: A Temporal Logic-based LLM Critic for Safe and Efficient Embodied Agents

From Imitation to Innovation: The Emergence of AI Unique Artistic Styles and the Challenge of Copyright Protection

VerifyLLM: LLM-Based Pre-Execution Task Plan Verification for Robots

7 Comments

Ohio Auction of Two Paintings Looted By Nazis Halted By Foundation

Lee Ufan Painting at Center of Bribery Investigation in Korea

Nicholas Galanin Pulls Out of Smithsonian Event, Claiming Censorship

Two More Staffers Fired from Kennedy Center after Trump Takeover

New MIT Tech Sees Underwater As if the Water Weren’t There

AI fuels false claims after Charlie Kirk’s death, CBS News analysis reveals

Google is a ‘bad actor’ says People CEO, accusing the company of stealing content

What's Hot

[2408.02288] Spin glass model of in-context learning

Submission history

Related Posts

7 Comments

Subscribe to Updates