Paper page - Configurable Preference Tuning with Rubric-Guided Synthetic Data

Configurable Preference Tuning enables language models to dynamically adjust their behavior based on human-interprettable directives, using rubric-guided preference data for fine-tuning and inference-time modulation.

Models of human feedback for AI alignment, such as those underpinning Direct
Preference Optimization (DPO), often bake in a singular, static set of
preferences, limiting adaptability. This paper challenges the assumption of
monolithic preferences by introducing Configurable Preference Tuning (CPT), a
novel framework for endowing language models with the ability to dynamically
adjust their behavior based on explicit, human-interpretable directives. CPT
leverages synthetically generated preference data, conditioned on system
prompts derived from structured, fine-grained rubrics that define desired
attributes like writing style. By fine-tuning with these rubric-guided
preferences, the LLM learns to modulate its outputs at inference time in
response to the system prompt, without retraining. This approach not only
offers fine-grained control but also provides a mechanism for modeling more
nuanced and context-dependent human feedback. Several experimental artifacts,
such as training code, generated datasets and fine-tuned models are released at
https://github.com/vicgalle/configurable-preference-tuning

Source link

What's Hot

Anthropic Revokes OpenAI’s Access to Claude

OpenAI reportedly raises $8.3B in funding after annualized revenue tops $13B

Ethan Thornton of Mach Industries takes the AI stage at Disrupt 2025

Paper page – Configurable Preference Tuning with Rubric-Guided Synthetic Data

Paper page – Seed-Prover: Deep and Broad Reasoning for Automated Theorem Proving

Paper page – RecGPT Technical Report

Paper page – C3: A Bilingual Benchmark for Spoken Dialogue Models Exploring Challenges in Complex Conversations

Artist Tyrrell Winston Sues New Orleans Pelicans Over Instagram Posts

Blum Staffers Speak On Closure, Spiegler Slams Art ‘Financialization’

Theatre Director and Artist Dies at 83

France to Accelerate Return of Looted Artworks—and More Art News

Anthropic Revokes OpenAI’s Access to Claude

OpenAI reportedly raises $8.3B in funding after annualized revenue tops $13B

Ethan Thornton of Mach Industries takes the AI stage at Disrupt 2025

What's Hot

Paper page – Configurable Preference Tuning with Rubric-Guided Synthetic Data

Related Posts

Subscribe to Updates