arXiv AI

Towards False-claim-resistant Model Ownership Verification via Targeted Fingerprint

By Advanced AI EditorMay 26, 2025No Comments2 Mins Read

[Submitted on 26 Jan 2025 (v1), last revised 23 May 2025 (this version, v2)]

View a PDF of the paper titled FIT-Print: Towards False-claim-resistant Model Ownership Verification via Targeted Fingerprint, by Shuo Shao and 6 other authors

View PDF
HTML (experimental)

Abstract:Model fingerprinting is a widely adopted approach to safeguard the copyright of open-source models by detecting and preventing their unauthorized reuse without modifying the protected model. However, in this paper, we reveal that existing fingerprinting methods are vulnerable to false claim attacks where adversaries falsely assert ownership of third-party non-reused models. We find that this vulnerability mostly stems from their untargeted nature, where they generally compare the outputs of given samples on different models instead of the similarities to specific references. Motivated by this finding, we propose a targeted fingerprinting paradigm (i.e., FIT-Print) to counteract false claim attacks. Specifically, FIT-Print transforms the fingerprint into a targeted signature via optimization. Building on the principles of FIT-Print, we develop bit-wise and list-wise black-box model fingerprinting methods, i.e., FIT-ModelDiff and FIT-LIME, which exploit the distance between model outputs and the feature attribution of specific samples as the fingerprint, respectively. Experiments on benchmark models and datasets verify the effectiveness, conferrability, and resistance to false claim attacks of our FIT-Print.

Submission history

From: Shuo Shao [view email]
[v1]
Sun, 26 Jan 2025 13:00:58 UTC (860 KB)
[v2]
Fri, 23 May 2025 07:19:40 UTC (842 KB)

Previous ArticleStanford HAI’s 2025 AI Index Reveals Record Growth in AI Capabilities, Investment, and Regulation

Next Article AI Will Disrupt Jobs Within 5 Years—Start Preparing Now

Advanced AI Editor

Leave A Reply