Unlearning Protected User Attributes in Recommendations with Adversarial Training

Ganhör, Christian; Penz, David; Rekabsaz, Navid; Lesota, Oleg; Schedl, Markus

doi:10.1145/3477495.3531820

Cited by 16 publications

(10 citation statements)

References 26 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…It is well-known that existing LMs trained on English text encode societal biases (Bolukbasi et al, 2016;Caliskan et al, 2017;Rekabsaz et al, 2021b) and stereotypes and using them in downstream tasks might lead to unfair treatment of various social groups (Zerveas et al, 2022;Krieg et al, 2022;Ganhör et al, 2022;Rekabsaz et al, 2021a;Melchiorre et al, 2021;Rekabsaz and Schedl, 2020;Elazar and Goldberg, 2018). Since we propose a method to transfer the English LMs to new languages, it is highly probable that the existing biases are also transferred to the target LMs.…”

Section: Risksmentioning

confidence: 98%

WECHSEL: Effective initialization of subword embeddings for cross-lingual transfer of monolingual language models

Minixhofer¹,

Fabian²,

Rekabsaz³

2022

Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Langua

Self Cite

View full text Add to dashboard Cite

Large pretrained language models (LMs) have become the central building block of many NLP applications. Training these models requires ever more computational resources and most of the existing models are trained on English text only. It is exceedingly expensive to train these models in other languages. To alleviate this problem, we introduce a novel method -called WECHSEL -to efficiently and effectively transfer pretrained LMs to new languages. WECHSEL can be applied to any model which uses subword-based tokenization and learns an embedding for each subword. The tokenizer of the source model (in English) is replaced with a tokenizer in the target language and token embeddings are initialized such that they are semantically similar to the English tokens by utilizing multilingual static word embeddings covering English and the target language. We use WECHSEL to transfer the English RoBERTa and GPT-2 models to four languages (French, German, Chinese and Swahili). We also study the benefits of our method on very low-resource languages. WECHSEL improves over proposed methods for cross-lingual parameter transfer and outperforms models of comparable size trained from scratch with up to 64x less training effort. Our method makes training large language models for new languages more accessible and less damaging to the environment. We make our code and models publicly available.

show abstract

Section: Risksmentioning

confidence: 98%

WECHSEL: Effective initialization of subword embeddings for cross-lingual transfer of monolingual language models

Minixhofer¹,

Fabian²,

Rekabsaz³

2022

Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Langua

Self Cite

View full text Add to dashboard Cite

show abstract

“…Compared with in-training setting (InT-AU), the post-training setting (PoT-AU) is more challenging. Firstly, PoT-AU allows no interference with the training process, which means InT-AU methods, i.e., adding network block [24], and adversarial training [18], are not applicable. Secondly, even though PoT-AU cuts down the connection with the training process, directly manipulating user embedding by adding artificially-designed noise, e.g., differential privacy [1], is inappropriate, because i) it will inevitably degrade recommendation performance, and ii) its unlearning ability is not promising, as the functional mechanism of attacking models, including complex machine learning models, is not well-understood.…”

Section: Motivationmentioning

confidence: 99%

“…Experiments are conducted on three publicly accessible datasets that contain both input data, i.e., user-item interactions, and user attributes, i.e., gender. Following [18], the provided gender information of the users are limited to females and males.…”

Section: Experimental Settingsmentioning

confidence: 99%

“…Due to the information extraction capabilities of recommender systems, AU is especially valuable in the context of recommendation. Although recommendation models did not see the latent attribute, the research found that basic machine learning models can successfully infer users' attributes from the user embedding learned by collaborative filtering models [18], which is also known as Attribute Inference Attack (AIA) [3,31]. Therefore, from the perspective of privacy preservation, AU is as important as IU in recommender systems.…”

Section: Introductionmentioning

confidence: 99%

“…Existing but limited research on AU has focused on In-Training AU (InT-AU) [18,24], where unlearning is performed during model training (as shown in the left of Figure 1). In this paper, we focus on a more strict AU setting, namely Post-Training Attribute Unlearning (PoT-AU), where we can only manipulate the model, i.e., updating parameters, after the training is fully completed and have no knowledge about training data or other training information (as shown in the right of Figure 1).…”

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

Making Users Indistinguishable: Attribute-wise Unlearning in Recommender Systems

Li,

Chen,

Zheng

et al. 2023

Proceedings of the 31st ACM International Conference on Multimedia

View full text Add to dashboard Cite

With the growing privacy concerns in recommender systems, recommendation unlearning, i.e., forgetting the impact of specific learned targets, is getting increasing attention. Existing studies predominantly use training data, i.e., model inputs, as the unlearning target. However, we find that attackers can extract private information, i.e., gender, race, and age, from a trained model even if it has not been explicitly encountered during training. We name this unseen information as attribute and treat it as the unlearning target. To protect the sensitive attribute of users, Attribute Unlearning (AU) aims to degrade attacking performance and make target attributes indistinguishable. In this paper, we focus on a strict but practical setting of AU, namely Post-Training Attribute Unlearning (PoT-AU), where unlearning can only be performed after the training of the recommendation model is completed. To address the PoT-AU problem in recommender systems, we design a two-component loss function that consists of i) distinguishability loss: making attribute labels indistinguishable from attackers, and ii) regularization loss: preventing drastic changes in the model that result in a negative impact on recommendation performance. Specifically, we investigate two types of distinguishability measurements, i.e., user-to-user and distribution-to-distribution. We use the stochastic gradient descent algorithm to optimize our proposed loss. Extensive experiments

show abstract

Fair Re-Ranking Recommendation Based on Debiased Multi-graph Representations

Han,

Wang,

Zhao

et al. 2023

Lecture Notes in Computer Science

View full text Add to dashboard Cite

Unlearning Protected User Attributes in Recommendations with Adversarial Training

Cited by 16 publications

References 26 publications

WECHSEL: Effective initialization of subword embeddings for cross-lingual transfer of monolingual language models

WECHSEL: Effective initialization of subword embeddings for cross-lingual transfer of monolingual language models

Making Users Indistinguishable: Attribute-wise Unlearning in Recommender Systems

Fair Re-Ranking Recommendation Based on Debiased Multi-graph Representations

Contact Info

Product

Resources

About