2019
DOI: 10.1007/978-3-030-33223-5_7
|View full text |Cite
|
Sign up to set email alerts
|

Modeling Data Lakes with Data Vault: Practical Experiences, Assessment, and Lessons Learned

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
14
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
5
1
1

Relationship

0
7

Authors

Journals

citations
Cited by 20 publications
(14 citation statements)
references
References 10 publications
0
14
0
Order By: Relevance
“…Meanwhile, DL-related research problems are raising massive attention associated with the implementation of DL prototypes. A large range of challenges are discussed such as metadata management [56], data quality [41], data provenance [120], metadata enrichment [10,59], data preparation [82], dataset organization [6,131], modeling [108,98,52], data integration [60,57] and related dataset discovery [43,16,131,14,130]. Such data lake proposals targeting specific research challenges, are also our main focus in this survey, which will be addressed in Sec.…”
Section: -Present: Prosperity and Diversitymentioning
confidence: 99%
See 1 more Smart Citation
“…Meanwhile, DL-related research problems are raising massive attention associated with the implementation of DL prototypes. A large range of challenges are discussed such as metadata management [56], data quality [41], data provenance [120], metadata enrichment [10,59], data preparation [82], dataset organization [6,131], modeling [108,98,52], data integration [60,57] and related dataset discovery [43,16,131,14,130]. Such data lake proposals targeting specific research challenges, are also our main focus in this survey, which will be addressed in Sec.…”
Section: -Present: Prosperity and Diversitymentioning
confidence: 99%
“…The Hadoop Distributed File System (HDFS) is one of the most frequently mentioned data storage systems for data lakes [21,119,13]. HDFS supports a wide range of DATAMARAN [48] Skluma [118] Metadata modeling Generic metadata model (GEMMS [108], Constance [59]) Data vault [98,52] Graph-based metadata model (Diamantiniet al [32,33], Aurum [43], Sawadogoet et al [113])…”
Section: File-based Storage Systemsmentioning
confidence: 99%
“…One may view this as modelling for all possibilities of interactions between data, even those not required by the current business specification document. Proponents of data vaults, Giebler et al (2019) highlight the fact that data are stored in its raw format which improves the auditability of results and flexibility of the system for future adaptability of requirements. The popularity of data vault modelling is increasing and clear principles for the implementation of BISs using data vaults have been developed by industry role players (Stroo & Broekmans, 2018).…”
Section: Theoretical Perspectivesmentioning
confidence: 99%
“…While it has been acknowledged that metadata management is an important aspect in DLs, there are only few works on the modeling of data and metadata in a DL. Data vault is a dimensional modeling technique frequently applied in DW projects; in Giebler et al (2019), this modeling technique is applied to DLs and compared with other techniques. Because of the fragmentation of the data in many different tables, querying is expensive due to many join operations.…”
Section: Data Models and Semantics In Data Lakesmentioning
confidence: 99%