Delta Activations: A Representation For Finetuned Large Language Models - Takara TLDR

The success of powerful open source Large Language Models (LLMs) has enabled
the community to create a vast collection of post-trained models adapted to
specific tasks and domains. However, navigating and understanding these models
remains challenging due to inconsistent metadata and unstructured repositories.
We introduce Delta Activations, a method to represent finetuned models as
vector embeddings by measuring shifts in their internal activations relative to
a base model. This representation allows for effective clustering by domain and
task, revealing structure in the model landscape. Delta Activations also
demonstrate desirable properties: it is robust across finetuning settings and
exhibits an additive property when finetuning datasets are mixed. In addition,
we show that Delta Activations can embed tasks via few-shot finetuning, and
further explore its use for model selection and merging. We hope Delta
Activations can facilitate the practice of reusing publicly available models.
Code is available at https://github.com/OscarXZQ/delta_activations.

Source link

What's Hot

Loong: Synthesize Long Chain-of-Thoughts at Scale through Verifiers – Takara TLDR

Anthropic blocks Chinese-controlled firms from Claude AI — cites ‘legal, regulatory, and security risks’

AI Futures Project: 2027 AI Forecast Report_The_years_OpenAI

Delta Activations: A Representation for Finetuned Large Language Models – Takara TLDR

Loong: Synthesize Long Chain-of-Thoughts at Scale through Verifiers – Takara TLDR

Towards a Unified View of Large Language Model Post-Training – Takara TLDR

DeepResearch Arena: The First Exam of LLMs’ Research Abilities via Seminar-Grounded Tasks – Takara TLDR

Tony Shafrazi and the Art of the Comeback

Basquiats Linked to 1MDB Scandal Auctioned by US Government

US Ambassador to UK Fills Residence with Impressionist Masters

New Code of Ethics Implores UK Museums to End Fossil Fuel Sponsorships

Loong: Synthesize Long Chain-of-Thoughts at Scale through Verifiers – Takara TLDR

Anthropic blocks Chinese-controlled firms from Claude AI — cites ‘legal, regulatory, and security risks’

AI Futures Project: 2027 AI Forecast Report_The_years_OpenAI

What's Hot

Delta Activations: A Representation for Finetuned Large Language Models – Takara TLDR

Related Posts

Subscribe to Updates