Valéria de Abreu da Silva Bastos scite author profile

IntroductionRecord linkage has been increasingly used in Brazil. However, only a few studies report the quality of the linkage process. Synthetic test data can be used to evaluate the quality of data linkage. Objectives and ApproachTo develop a synthetic data generator that creates test datasets with similar attributes and error characteristics found in the Brazilian databases. We analyzed the 2013 mortality database from Rio de Janeiro State to know the characteristics and frequency distribution of the database attributes (name, mother’s name, sex, date of birth and address). We used Python and C++ to customize and add routines to GeCo (http://dlrep.org/dataset/GeCo), a personal data generation tool developed by Tran et al. (DOI:10.1145/2505515.2508207). ResultsBrazilian names have specific characteristics that distinguish them from other countries’ patterns: multiple family names are usual, as are composite first names, and, despite that, homonyms are frequent. Family names may include the full extension or only parts of either the father and mother’s respective family names, or both, so there is a wide variation in progeny family names and not necessarily a common family name for all family members. Conclusion/ImplicationsDue to the specific national characteristics of name building in Brazil, modeling synthetic data is particularly challenging and needs to have more flexible rules in order to generate databases that will actually allow assessing the quality of data linkage processes.

show abstract

An AprioriAll Based Method for Guiding a Robot in Dynamic Environment

Lima

Lobo

Castellani³

et al. 2018

IEEE Latin Am. Trans.

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Valéria de Abreu da Silva Bastos

Mother-to-child transmission of Streptococcus mutans: A systematic review and meta-analysis

Oral Health of Babies and Mothers during the Breastfeeding Period

Management of over retention of permanent incisor impacted by compound odontoma: Clinical, radiological, and microscopic evaluation

Synthetic data generator for testing record linkage routines in Brazil.

An AprioriAll Based Method for Guiding a Robot in Dynamic Environment

Contact Info

Product

Resources

About