View a PDF of the paper titled FIT-Print: Towards False-claim-resistant Model Ownership Verification via Targeted Fingerprint, by Shuo Shao and 6 other authors
View PDF
HTML (experimental)
Abstract:Model fingerprinting is a widely adopted approach to safeguard the copyright of open-source models by detecting and preventing their unauthorized reuse without modifying the protected model. However, in this paper, we reveal that existing fingerprinting methods are vulnerable to false claim attacks where adversaries falsely assert ownership of third-party non-reused models. We find that this vulnerability mostly stems from their untargeted nature, where they generally compare the outputs of given samples on different models instead of the similarities to specific references. Motivated by this finding, we propose a targeted fingerprinting paradigm (i.e., FIT-Print) to counteract false claim attacks. Specifically, FIT-Print transforms the fingerprint into a targeted signature via optimization. Building on the principles of FIT-Print, we develop bit-wise and list-wise black-box model fingerprinting methods, i.e., FIT-ModelDiff and FIT-LIME, which exploit the distance between model outputs and the feature attribution of specific samples as the fingerprint, respectively. Experiments on benchmark models and datasets verify the effectiveness, conferrability, and resistance to false claim attacks of our FIT-Print.
Submission history
From: Shuo Shao [view email]
[v1]
Sun, 26 Jan 2025 13:00:58 UTC (860 KB)
[v2]
Fri, 23 May 2025 07:19:40 UTC (842 KB)