Yan Yan scite author profile

Yan Yan

4Publications

40Citation Statements Received

88Citation Statements Given

How they've been cited

How they cite others

Affiliations

Lanzhou University of Technology, Macquarie University

Publications

Order By: Most citations

Differential Private Spatial Decomposition and Location Publishing Based on Unbalanced Quadtree Partition Algorithm

et al. 2020

View full text Add to dashboard Cite

Centralized publishing of big location data provides great convenience for various locationbased interactive queries and services. Privacy protection of users' location information is an indispensable issue in the security of big data applications. Partition publishing is an effective way to release statistical information of two-dimensional big location data. By combining with the differential privacy model, it can provide more accurate range counting query service on the premise of ensuring location privacy. In order to further improve the availability of location data subsequent to centralized publishing, this paper analyzes the primary noise sources of partition publishing and discusses the constraints among publishing errors, the spatial partition structure, and privacy budget allocation. An unbalanced quadtree partition algorithm based on regional uniformity is proposed. Accordingly, the gradient privacy budget allocation scheme and adjustment method are designed to ensure the effectiveness of the differential privacy model. Experimental comparison of the real-world datasets proves the advantages of the proposed algorithm in improving the querying accuracy of the published data. INDEX TERMS Privacy preserving data publishing; location privacy; private spatial decomposition; differential privacy; unbalanced quadtree partition; gradient budget allocation.

show abstract

A weighted K-member clustering algorithm for K-anonymization

et al. 2021

View full text Add to dashboard Cite

Dynamic Release of Big Location Data Based on Adaptive Sampling and Differential Privacy

et al. 2019

View full text Add to dashboard Cite

Data releasing is a key part bridging between the collection of big data and their applications. Traditional methods release the static version of dataset or publish the snapshot with a fixed sampling interval, which cannot meet the dynamic query requirements and query precision for big data. Moreover, the quality of published data cannot reflect the characteristics of the dynamic changes of big data, which often leads to subsequent data analysis and mining errors. This paper proposes an adaptive sampling mechanism and privacy protection method for the release of big location data. In order to reflect the dynamic change of data in time, we design an adaptive sampling mechanism based on the proportional-integral-derivative (PID) controller according to the temporal and spatial correlation of the location data. To ensure the privacy of published data, we propose a heuristic quad-tree partitioning method as well as a corresponding privacy budget allocation strategy. Experiments and analysis prove that the adaptive sampling mechanism proposed in this paper can effectively track the trend of dynamic changes of data, and the designed differential privacy method can improve the accuracy of counting query and enhance the availability of published data under the premise of certain privacy intensity. The proposed methods can also be readily extended to other areas of big data release applications. INDEX TERMS Big location data, privacy preserving data publishing, adaptive sampling, differential privacy, heuristic quad-tree partitioning.

show abstract

Achieving Differential Privacy Publishing of Location-Based Statistical Data Using Grid Clustering

Yan

Sun

Mahmood

et al. 2022

IJGI

View full text Add to dashboard Cite

Statistical partitioning and publishing is commonly used in location-based big data services to address queries such as the number of points of interest, available vehicles, traffic flows, infected patients, etc., within a certain range. Adding noise perturbation to the location-based statistical data according to the differential privacy model can reduce various risks caused by location privacy leakage while keeping the statistical characteristics of the published data. The traditional statistical partitioning and publishing methods realize the decomposition and indexing of 2D space from top to bottom. However, they can easily cause the over-partitioning or under-partitioning phenomenon, and therefore need multiple times of data scan. This paper proposes a grid clustering and differential privacy protection method for location-based statistical big data publishing scenarios. We implement location-based big data statistics in units of equal-sized grids and perform density classification on uniformly distributed grids by discrete wavelet transform. A bottom-up grid clustering algorithm is designed to perform on the blank and the uniform grids of the same density level based on neighborhood similarity. The Laplacian noise is incorporated into the clustering results according to the differential privacy model to form the published statistics. Experimental comparison of the real-world datasets manifests that the grid clustering and differential privacy publishing method proposed in this paper is superior to other existing partition publishing methods in terms of range querying accuracy and algorithm operating efficiency.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Yan Yan

Differential Private Spatial Decomposition and Location Publishing Based on Unbalanced Quadtree Partition Algorithm

A weighted K-member clustering algorithm for K-anonymization

Dynamic Release of Big Location Data Based on Adaptive Sampling and Differential Privacy

Achieving Differential Privacy Publishing of Location-Based Statistical Data Using Grid Clustering

Contact Info

Product

Resources

About