2018
DOI: 10.1093/comjnl/bxy064
|View full text |Cite
|
Sign up to set email alerts
|

Automating Data Mart Construction from Semi-structured Data Sources

Abstract: The global food and agricultural industry has a total market value of USD 8 trillion in 2016, and decision makers in the Agri sector require appropriate tools and up-to-date information to make predictions across a range of products and areas. Traditionally, these requirements are met with information processed into a data warehouse and data marts constructed for analyses. Increasingly however, data is coming from outside the enterprise and often in unprocessed forms. As these sources are outside the control o… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
6
0
1

Year Published

2019
2019
2023
2023

Publication Types

Select...
3
2
1
1

Relationship

3
4

Authors

Journals

citations
Cited by 10 publications
(7 citation statements)
references
References 22 publications
0
6
0
1
Order By: Relevance
“…Finally, the two datasets require integration. A previous work (Scriney et al, 2019 ) proposes a methodology for the determination of an integration strategy by examining the common datasets for each source in order to design a common data model. In this study, the common data model is the FHIR JSON schema.…”
Section: Enhancing Semantic Interoperability With the Semantic Enginementioning
confidence: 99%
“…Finally, the two datasets require integration. A previous work (Scriney et al, 2019 ) proposes a methodology for the determination of an integration strategy by examining the common datasets for each source in order to design a common data model. In this study, the common data model is the FHIR JSON schema.…”
Section: Enhancing Semantic Interoperability With the Semantic Enginementioning
confidence: 99%
“…• Interoperable. Data interoperability means the ability to share and integrate data from different users and sources [16]. This can only happen if a standard (meta) data model is employed to describe data, an important concept which generally requires data engineering skills to deliver.…”
Section: Application Of Fair Metricsmentioning
confidence: 99%
“…However, in many cases, variables that are available are not suited to machine learning algorithms with the identification and preparation Han et al (2011) of the data set seen as a complex process. Another critical task requires the transformation of variables (Components) (King and Jackson, 1999;Westad et al, 2003;Granato et al, 2018;Scriney et al, 2019;Yun et al, 2019) using the appropriate normalization methodology (Milligan and Cooper, 1988). Our research is based on a problem that is common to many retailers: computing CLV values where customer records are separated across different points of sales with no clear method to combine them.…”
Section: Problem Description and Motivationmentioning
confidence: 99%