2021
DOI: 10.3390/app11178275
|View full text |Cite
|
Sign up to set email alerts
|

Data Harmonization for Heterogeneous Datasets: A Systematic Literature Review

Abstract: As data size increases drastically, its variety also increases. Investigating such heterogeneous data is one of the most challenging tasks in information management and data analytics. The heterogeneity and decentralization of data sources affect data visualization and prediction, thereby influencing analytical results accordingly. Data harmonization (DH) corresponds to a field that unifies the representation of such a disparate nature of data. Over the years, multiple solutions have been developed to minimize… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
7
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
4
2
1

Relationship

0
7

Authors

Journals

citations
Cited by 20 publications
(7 citation statements)
references
References 81 publications
(117 reference statements)
0
7
0
Order By: Relevance
“…In order to deal with heterogeneity in these forms of data, Data harmonization (DH) is based on its core techniques including Natural Language Preprocessing (NLP), text preprocessing, and some of the machine learning methods and deep learning can be helpful. [ 149 ] ML algorithms play a significant role in this regard. Each algorithm has its own set of benefits and drawbacks, as well as its own specific area of implementation.…”
Section: Challenging Issues Faced With Ai and Microfluidics For Biote...mentioning
confidence: 99%
“…In order to deal with heterogeneity in these forms of data, Data harmonization (DH) is based on its core techniques including Natural Language Preprocessing (NLP), text preprocessing, and some of the machine learning methods and deep learning can be helpful. [ 149 ] ML algorithms play a significant role in this regard. Each algorithm has its own set of benefits and drawbacks, as well as its own specific area of implementation.…”
Section: Challenging Issues Faced With Ai and Microfluidics For Biote...mentioning
confidence: 99%
“…This pertains to electronic medical records (EMRs), which may include all forms of data [ 120 ]. For instance, structured data refers to data that are simple to categorize in a database; they might contain a set of features and records such as patient’s biodata and generic health complaints such as fever or nausea [ 121 ]. On the other hand, some health data are unstructured and are usually in the form of photographs, text files (medical reports and summaries), and audio/visual recordings.…”
Section: Artificial Intelligencementioning
confidence: 99%
“…Realizing and exploiting different forms of data on a large scale would be highly valuable in implementing ML techniques in HMS [ 121 ]. Furthermore, when ML is used successfully, it may assist doctors in making near-perfect diagnoses, determining and improving patients’ overall health, and lowering costs [ 122 ].…”
Section: Artificial Intelligencementioning
confidence: 99%
See 1 more Smart Citation
“…Structuring electronic dental records through DL for a clinical decision support system. Medical informatics’ fundamental and difficult objective is to extract information from unstructured clinical text [ 172 ].…”
Section: Limitations and Future Recommendationsmentioning
confidence: 99%