Uni Eropa menerbitkan sebuah peraturan yang bernama General Data Protection Regulation (GDPR) untuk menjaga privasi warga. Peraturan ini meregulasi penyebaran data-data pribadi seperti nama, nomor telepon atau alamat yang mungkin akan digunakan untuk tujuan tertentu. Salah satu teknik yang dapat digunakan untuk menyebarkan data tanpa melanggar privasi dari subjek pemilik data adalah K-Anonymity. K-Anonymity memodifikasi nilai quasi-identifier hingga subjek tidak dapat dikenali lagi tetapi dataset tetap mengandung informasi yang diperlukan. Artikel ini telah mengimplementasikan K-Anonymity pada data Calon Legislatif untuk Pemilihan Umum Calon Legislatif tahun 2019 yang dihimpun dari laman resmi Komisi Pemilihan Umum. Dengan algoritma Mondrian Multidimensional K-Anonymity hasil anonimisasi menunjukkan bahwa masih terdapat data yang unik. Namun, dari hasil visualisasi terlihat hampir semua data memiliki anonimitas sama, yang dimungkinkan karena jumlah data partisi yang kurang banyak ataupun kurangnya keberagaman data.
Data processing speed in companies is important to speed up their analysis. Entity matching is a computational process that companies can perform in data processing. In conducting data processing, entity matching plays a role in determining two different data but referring to the same entity. Entity matching problems arise when the dataset used in the comparison is large. The deep learning concept is one of the solutions in dealing with entity matching problems. DeepMatcher is a python package based on a deep learning model architecture that can solve entity matching problems. The purpose of this study was to determine the matching between the two datasets with the application of DeepMatcher in entity matching using drug data from farmaku.com and k24klik.com. The comparison model used is the Hybrid model. Based on the test results, the Hybrid model produces accurate numbers, so that the entity matching used in this study runs well. The best accuracy value of the 10th training with an F1 value of 30.30, a precision value of 17.86, and a recall value of 100.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.