Ranjana Nadagoudar scite author profile

A large number of cloud forces require users to carve up private data like electronic health records for data analysis or mining, bringing privacy concerns. Anonymizing data sets via generalization to satisfy certain privacy requirements such as k-anonymity is a widely used category of privacy preserving techniques. At present, the scale of data in many cloud applications increases massively in accordance with the Big Data trend, thereby making it a challenge for commonly used software tools to confine, manage, and process such large-scale data within a adequate elapsed time. As a result, it is a challenge for existing anonymization approaches to accomplish privacy preservation on privacy-sensitive large-scale data sets due to their insufficiency of scalability. In this paper, we propose a scalable two phase top-down specialization (TDS) to anonymize large-scale data sets using the MapReduce framework on cloud. In both phases of our approach, we deliberately design a group of inventive MapReduce jobs to concretely accomplish the specialization computation in a highly scalable way. Experimental assessment results demonstrate that with our approach, the scalability and efficiency of TDS can be significantly enhanced over existing approaches.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Ranjana Nadagoudar

Liver Diseases Prediction using KNN with Hyper Parameter Tuning Techniques

A Scalable Two-Phase Top-Down Specialization Approach for Data Anonymization using MapReduce on Cloud

Contact Info

Product

Resources

About