Paper Page - Unveiling Instruction-Specific Neurons & Experts: An Analytical Framework For LLM's Instruction-Following Capabilities

The study investigates the role of sparse computational components in the instruction-following capabilities of Large Language Models through systematic analysis and introduces HexaInst and SPARCOM for better understanding.

The finetuning of Large Language Models (LLMs) has significantly advanced
their instruction-following capabilities, yet the underlying computational
mechanisms driving these improvements remain poorly understood. This study
systematically examines how fine-tuning reconfigures LLM computations by
isolating and analyzing instruction-specific sparse components, i.e., neurons
in dense models and both neurons and experts in Mixture-of-Experts (MoE)
architectures. In particular, we introduce HexaInst, a carefully curated and
balanced instructional dataset spanning six distinct categories, and propose
SPARCOM, a novel analytical framework comprising three key contributions: (1) a
method for identifying these sparse components, (2) an evaluation of their
functional generality and uniqueness, and (3) a systematic comparison of their
alterations. Through experiments, we demonstrate functional generality,
uniqueness, and the critical role of these components in instruction execution.
By elucidating the relationship between fine-tuning-induced adaptations and
sparse computational substrates, this work provides deeper insights into how
LLMs internalize instruction-following behavior for the trustworthy LLM
community.

Source link

What's Hot

Microsoft Layoffs Continue | Recruiting News Network

Harmony Version Doubao Empowers, Multi-Modal Large Model Experience Evolves_voice_user_and

DeepSeek-R1 and Tencent Hunyuan Empower Content Platforms_the_user_models

Paper page – Unveiling Instruction-Specific Neurons & Experts: An Analytical Framework for LLM’s Instruction-Following Capabilities

Research Paper – Takara TLDR

2D Gaussian Splatting with Semantic Alignment for Image Inpainting – Takara TLDR

The Choice of Divergence: A Neglected Key to Mitigating Diversity Collapse in Reinforcement Learning with Verifiable Reward – Takara TLDR

Long-Lost Painting By Rubens From 1613 Discovered in Paris Mansion

Ken Griffin Loves Pollock’s Blue Poles So Much He Tried to Buy it

Nan Goldin Says Her Market ‘Tanked’ Due to Palestine Activism

Sally Mann Says Her Black Men Photos Are ‘Problematic’ in Hindsight

Microsoft Layoffs Continue | Recruiting News Network

Harmony Version Doubao Empowers, Multi-Modal Large Model Experience Evolves_voice_user_and

DeepSeek-R1 and Tencent Hunyuan Empower Content Platforms_the_user_models

What's Hot

Paper page – Unveiling Instruction-Specific Neurons & Experts: An Analytical Framework for LLM’s Instruction-Following Capabilities

Related Posts

Subscribe to Updates