Zhiyuan Chen scite author profile

Zhiyuan Chen

5Publications

19Citation Statements Received

78Citation Statements Given

How they've been cited

How they cite others

160

Affiliations

University of Maryland, Baltimore County

Publications

Order By: Most citations

A data recipient centered de-identification method to retain statistical attributes

Gal

Tucker

Gangopadhyay

et al. 2014

Journal of Biomedical Informatics

View full text Add to dashboard Cite

Privacy has always been a great concern of patients and medical service providers. As a result of the recent advances in information technology and the government's push for the use of Electronic Health Record (EHR) systems, a large amount of medical data is collected and stored electronically. This data needs to be made available for analysis but at the same time patient privacy has to be protected through de-identification. Although biomedical researchers often describe their research plans when they request anonymized data, most existing anonymization methods do not use this information when de-identifying the data. As a result, the anonymized data may not be useful for the planned research project. This paper proposes a data recipient centered approach to tailor the de-identification method based on input from the recipient of the data. We demonstrate our approach through an anonymization project for biomedical researchers with specific goals to improve the utility of the anonymized data for statistical models used for their research project. The selected algorithm improves a privacy protection method called Condensation by Aggarwal et al. Our methods were tested and validated on real cancer surveillance data provided by the Kentucky Cancer Registry.

show abstract

A Learning Approach to SQL Query Results Ranking Using Skyline and Users' Current Navigational Behavior

Chen

Sun

2013

IEEE Trans. Knowl. Data Eng.

View full text Add to dashboard Cite

Dynamic Query Forms for Database Queries

Tang

Jiang

et al. 2014

IEEE Trans. Knowl. Data Eng.

View full text Add to dashboard Cite

Abstract-Modern scientific databases and web databases maintain large and heterogeneous data. These real-world databases contain over hundreds or even thousands of relations and attributes. Traditional predefined query forms are not able to satisfy various ad-hoc queries from users on those databases. This paper proposes DQF, a novel database query form interface, which is able to dynamically generate query forms. The essence of DQF is to capture a user's preference and rank query form components, assisting him/her to make decisions. The generation of a query form is an iterative process and is guided by the user. At each iteration, the system automatically generates ranking lists of form components and the user then adds the desired form components into the query form. The ranking of form components is based on the captured user preference. A user can also fill the query form and submit queries to view the query result at each iteration. In this way, a query form could be dynamically refined till the user satisfies with the query results. We utilize the expected F-measure for measuring the goodness of a query form. A probabilistic model is developed for estimating the goodness of a query form in DQF. Our experimental evaluation and user study demonstrate the effectiveness and efficiency of the system.

show abstract

Target-Based, Privacy Preserving, and Incremental Association Rule Mining

Ahluwalia¹,

Gangopadhyay²,

Chen³

et al. 2017

IEEE Trans. Serv. Comput.

View full text Add to dashboard Cite

We consider a special case in association rule mining where mining is conducted by a third party over data located at a central location that is updated from several source locations. The data at the central location is at rest while that flowing in through source locations is in motion. We impose some limitations on the source locations, so that the central target location tracks and privatizes changes and a third party mines the data incrementally. Our results show high efficiency, privacy and accuracy of rules for small to moderate updates in large volumes of data. We believe that the framework we develop is therefore applicable and valuable for mining big data.

show abstract

A generic and distributed privacy preserving classification method with a worst-case privacy guarantee

Banerjee

Chen

Gangopadhyay

2013

Distrib Parallel Databases

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Zhiyuan Chen

A data recipient centered de-identification method to retain statistical attributes

A Learning Approach to SQL Query Results Ranking Using Skyline and Users' Current Navigational Behavior

Dynamic Query Forms for Database Queries

Target-Based, Privacy Preserving, and Incremental Association Rule Mining

A generic and distributed privacy preserving classification method with a worst-case privacy guarantee

Contact Info

Product

Resources

About