Xuanli He scite author profile

It has been demonstrated that hidden representation learned by deep model can encode private information of the input, hence can be exploited to recover such information with reasonable accuracy. To address this issue, we propose a novel approach called Differentially Private Neural Representation (DPNR) to preserve privacy of the extracted representation from text. DPNR utilises Differential Privacy (DP) to provide formal privacy guarantee. Further, we show that masking words via dropout can further enhance privacy. To maintain utility of the learned representation, we integrate DP-noisy representation into a robust training process to derive a robust target model, which also helps for model fairness over various demographic variables. Experimental results on benchmark datasets under various parameter settings demonstrate that DPNR largely reduces privacy leakage without significantly sacrificing the main task performance.

show abstract

Privacy-Preserving Collaborative Deep Learning with Application to Human Activity Recognition

Lyu

Law

et al. 2017

View full text Add to dashboard Cite

Fog-Empowered Anomaly Detection in IoT Using Hyperellipsoidal Clustering

Lyu

Jin

Rajasegarar

et al. 2017

IEEE Internet Things J.

104

View full text Add to dashboard Cite

Fog-Embedded Deep Learning for the Internet of Things

Lyu

Bezdek

et al. 2019

IEEE Trans. Ind. Inf.

View full text Add to dashboard Cite

Dynamic Programming Encoding for Subword Segmentation in Neural Machine Translation

He¹,

Haffari²,

Norouzi³

2020

View full text Add to dashboard Cite

This paper introduces Dynamic Programming Encoding (DPE), a new segmentation algorithm for tokenizing sentences into subword units. We view the subword segmentation of output sentences as a latent variable that should be marginalized out for learning and inference. A mixed character-subword transformer is proposed, which enables exact log marginal likelihood estimation and exact MAP inference to find target segmentations with maximum posterior probability. DPE uses a lightweight mixed character-subword transformer as a means of pre-processing parallel data to segment output sentences using dynamic programming. Empirical results on machine translation suggest that DPE is effective for segmenting output sentences and can be combined with BPE dropout for stochastic segmentation of source sentences. DPE achieves an average improvement of 0.9 BLEU over BPE (Sennrich et al., 2016) and an average improvement of 0.55 BLEU over BPE dropout (Provilkov et al., 2019) on several WMT datasets including English ↔ (German, Romanian, Estonian, Finnish, Hungarian).

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Xuanli He

Differentially Private Representation for NLP: Formal Guarantee and An Empirical Study on Privacy and Fairness

Privacy-Preserving Collaborative Deep Learning with Application to Human Activity Recognition

Fog-Empowered Anomaly Detection in IoT Using Hyperellipsoidal Clustering

Fog-Embedded Deep Learning for the Internet of Things

Dynamic Programming Encoding for Subword Segmentation in Neural Machine Translation

Contact Info

Product

Resources

About