Abstract
Transfer learning, successful in knowledge translation across related tasks, faces a substantial privacy threat from membership inference attacks (MIAs). These attacks, despite posing significant risk to ML model’s training data, remain limited-explored in transfer learning. The interaction between teacher and student models in transfer learning has not been thoroughly explored in MIAs, potentially resulting in an under-examined aspect of privacy vulnerabilities within transfer learning. In this paper, we propose a new MIA vector against transfer learning, to determine whether a specific data point was used to train the teacher model while only accessing the student model in a white-box setting. Our method delves into the intricate relationship between teacher and student models, analyzing the discrepancies in hidden layer representations between the student model and its shadow counterpart. These identified differences are then adeptly utilized to refine the shadow model’s training process and to inform membership inference decisions effectively. Our method, evaluated across four datasets in diverse transfer learning tasks, reveals that even when an attacker only has access to the student model, the teacher model’s training data remains susceptible to MIAs. We believe our work unveils the unexplored risk of membership inference in transfer learning. © 2024 IEEE.
| Original language | English |
|---|---|
| Pages (from-to) | 6441-6454 |
| Journal | IEEE Transactions on Information Forensics and Security |
| Volume | 19 |
| Online published | 12 Jun 2024 |
| DOIs | |
| Publication status | Published - 2024 |
Funding
This work was supported in part by the National Key Research and Development Program of China under Grant 2021YFB2700200, in part by the National Natural Science Foundation of China under Grant 62076187, in part by the Key Research and Development Program of Hubei Province under Grant 2021BAA190 and Grant 2022BAA039, and in part by the Key Research and Development Program of Shandong Province under Grant 2022CXPT055.
Research Keywords
- Membership inference attack
- transfer learning