Paper Page - WaterDrum: Watermarking For Data-centric Unlearning Metric

Large language model (LLM) unlearning is critical in real-world applications
where it is necessary to efficiently remove the influence of private,
copyrighted, or harmful data from some users. However, existing utility-centric
unlearning metrics (based on model utility) may fail to accurately evaluate the
extent of unlearning in realistic settings such as when (a) the forget and
retain set have semantically similar content, (b) retraining the model from
scratch on the retain set is impractical, and/or (c) the model owner can
improve the unlearning metric without directly performing unlearning on the
LLM. This paper presents the first data-centric unlearning metric for LLMs
called WaterDrum that exploits robust text watermarking for overcoming these
limitations. We also introduce new benchmark datasets for LLM unlearning that
contain varying levels of similar data points and can be used to rigorously
evaluate unlearning algorithms using WaterDrum. Our code is available at
https://github.com/lululu008/WaterDrum and our new benchmark datasets are
released at https://huggingface.co/datasets/Glow-AI/WaterDrum-Ax.

Source link

What's Hot

Quantization Meets dLLMs: A Systematic Study of Post-training Quantization for Diffusion LLMs – Takara TLDR

DevOps automation startup SRE.ai launches with $7.2M in funding

China experiments with brain-computer interfaces to compete in AI race: report

Paper page – WaterDrum: Watermarking for Data-centric Unlearning Metric

Quantization Meets dLLMs: A Systematic Study of Post-training Quantization for Diffusion LLMs – Takara TLDR

NVIDIA Nemotron Nano 2: An Accurate and Efficient Hybrid Mamba-Transformer Reasoning Model – Takara TLDR

RynnEC: Bringing MLLMs into Embodied World – Takara TLDR

Tanya Bonakdar Gallery to Close Los Angeles Space

Ancient Silver Coins Suggest New History of Trading in Southeast Asia

Sasan Ghandehari Sues Christie’s Over Picasso Once Owned by a Criminal

Ancient Roman Villa in Sicily Reveals Mosaic of Flip-Flops

Quantization Meets dLLMs: A Systematic Study of Post-training Quantization for Diffusion LLMs – Takara TLDR

DevOps automation startup SRE.ai launches with $7.2M in funding

China experiments with brain-computer interfaces to compete in AI race: report

What's Hot

Paper page – WaterDrum: Watermarking for Data-centric Unlearning Metric

Related Posts

Subscribe to Updates