Spectral Scaling Laws In Language Models: How Effectively Do Feed-Forward Networks Use Their Latent Space? - Takara TLDR

As large language models (LLMs) scale, the question is not only how large
they become, but how much of their capacity is effectively utilized. Existing
scaling laws relate model size to loss, yet overlook how components exploit
their latent space. We study feed-forward networks (FFNs) and recast width
selection as a spectral utilization problem. Using a lightweight diagnostic
suite — Hard Rank (participation ratio), Soft Rank (Shannon rank), Spectral
Concentration, and the composite Spectral Utilization Index (SUI) — we
quantify how many latent directions are meaningfully activated across LLaMA,
GPT-2, and nGPT families. Our key finding is an asymmetric spectral scaling
law: soft rank follows an almost perfect power law with FFN width, while hard
rank grows only sublinearly and with high variance. This asymmetry suggests
that widening FFNs mostly adds low-energy tail directions, while dominant-mode
subspaces saturate early. Moreover, at larger widths, variance further
collapses into a narrow subspace, leaving much of the latent space
under-utilized. These results recast FFN width selection as a principled
trade-off between tail capacity and dominant-mode capacity, offering concrete
guidance for inference-efficient LLM design.

Source link

What's Hot

Vinod Khosla Slams ‘Tunnel Vision Creatives’ Attacking Sora As ‘AI Slop’

Spectral Scaling Laws in Language Models: How Effectively Do Feed-Forward Networks Use Their Latent Space? – Takara TLDR

DeepSeek Launches V3.2-Exp, Targets Cost and Long-Text Performance

Spectral Scaling Laws in Language Models: How Effectively Do Feed-Forward Networks Use Their Latent Space? – Takara TLDR

Rethinking Thinking Tokens: LLMs as Improvement Operators – Takara TLDR

Generalized Parallel Scaling with Interdependent Generations – Takara TLDR

TOUCAN: Synthesizing 1.5M Tool-Agentic Data from Real-World MCP Environments – Takara TLDR

Former ARTnews Publisher Dies at 97

National Gallery of Art Closes as a Result of Government Shutdown

Almine Rech Closes London Gallery After More Than a Decade

Record Exec and Art Collector Gets Over 4 Years

Vinod Khosla Slams ‘Tunnel Vision Creatives’ Attacking Sora As ‘AI Slop’