arXiv AI

Unveiling Fine-Tuning Dynamics for LLM Selection

By Advanced AI EditorMay 30, 2025No Comments2 Mins Read

[Submitted on 1 May 2025 (v1), last revised 29 May 2025 (this version, v2)]

View a PDF of the paper titled LENSLLM: Unveiling Fine-Tuning Dynamics for LLM Selection, by Xinyue Zeng and 5 other authors

View PDF
HTML (experimental)

Abstract:The proliferation of open-sourced Large Language Models (LLMs) and diverse downstream tasks necessitates efficient model selection, given the impracticality of fine-tuning all candidates due to computational constraints. Despite the recent advances in LLM selection, a fundamental research question largely remains nascent: how can we model the dynamic behaviors of LLMs during fine-tuning, thereby enhancing our understanding of their generalization performance across diverse downstream tasks? In this work, we propose a novel theoretical framework that provides a proper lens to assess the generalization capabilities of LLMs, thereby enabling accurate and efficient LLM selection for downstream applications. In particular, we first derive a PAC-Bayesian Generalization Bound that unveils fine-tuning dynamics of LLMs and then introduce LENSLLM, a Neural Tangent Kernel (NTK)-based Rectified Scaling Model that enables accurate performance predictions across diverse tasks while maintaining computational efficiency. Extensive empirical results on 3 large-scale benchmarks demonstrate that our model achieves up to 91.1% accuracy and reduces up to 88.5% computational cost in LLM selection, outperforming 5 state-of-the-art methods. We open-source our proposed LENSLLM model and corresponding results at this http URL.

Submission history

From: Xinyue Zeng [view email]
[v1]
Thu, 1 May 2025 15:07:32 UTC (3,192 KB)
[v2]
Thu, 29 May 2025 14:15:49 UTC (1,794 KB)

Previous ArticleStanford HAI’s 2025 AI Index Reveals Record Growth in AI Capabilities, Investment, and Regulation

Next Article Changes to Google Search, Claude Gets a Voice

Advanced AI Editor

Leave A Reply