Paper Page - Pre-trained Large Language Models Learn Hidden Markov Models In-context

In-context learning in large language models can effectively model sequences generated by hidden Markov models, achieving predictive accuracy and uncovering scaling trends, thus demonstrating its potential as a diagnostic tool for complex scientific data.

Hidden Markov Models (HMMs) are foundational tools for modeling sequential
data with latent Markovian structure, yet fitting them to real-world data
remains computationally challenging. In this work, we show that pre-trained
large language models (LLMs) can effectively model data generated by HMMs via
in-context learning (ICL)x2013their ability to infer patterns from
examples within a prompt. On a diverse set of synthetic HMMs, LLMs achieve
predictive accuracy approaching the theoretical optimum. We uncover novel
scaling trends influenced by HMM properties, and offer theoretical conjectures
for these empirical observations. We also provide practical guidelines for
scientists on using ICL as a diagnostic tool for complex data. On real-world
animal decision-making tasks, ICL achieves competitive performance with models
designed by human experts. To our knowledge, this is the first demonstration
that ICL can learn and predict HMM-generated sequencesx2013an
advance that deepens our understanding of in-context learning in LLMs and
establishes its potential as a powerful tool for uncovering hidden structure in
complex scientific data.

Source link

What's Hot

Hcltech Joins Mit Media Lab in the Us to Collaborate on Next-gen Ai Research

IBM Releases Open-Source Granite 4.0 Generative AI

New AI training method creates powerful software agents with just 78 examples

Paper page – Pre-trained Large Language Models Learn Hidden Markov Models In-context

Your Agent May Misevolve: Emergent Risks in Self-evolving LLM Agents – Takara TLDR

Go with Your Gut: Scaling Confidence for Autoregressive Image Generation – Takara TLDR

Optimizing What Matters: AUC-Driven Learning for Robust Neural Retrieval – Takara TLDR

Tomb of Amenhotep III Reopens After Two-Decade Renovation

Limited Edition Print of Ozzy Osbourne Art Sold To Benefit Charities

Odili Donald Odita Sues Jack Shainman Gallery over ‘Withheld’ Artworks

Mohamed Hamidi, Moroccan Modernist Painter, Has Died at 84

Hcltech Joins Mit Media Lab in the Us to Collaborate on Next-gen Ai Research

IBM Releases Open-Source Granite 4.0 Generative AI

New AI training method creates powerful software agents with just 78 examples

What's Hot

Paper page – Pre-trained Large Language Models Learn Hidden Markov Models In-context

Related Posts

Subscribe to Updates