2019
DOI: 10.1007/978-3-030-27615-7_23
|View full text |Cite
|
Sign up to set email alerts
|

Data Lakes: Trends and Perspectives

Abstract: As a relatively new concept, data lake has neither a standard definition nor an acknowledged architecture. Thus, we study the existing work and propose a complete definition and a generic and extensible architecture of data lake. What's more, we introduce three future research axes in connection with our health-care Information Technology (IT) activities. They are related to (i) metadata management that consists of intra-and inter-metadata, (ii) a unified ecosystem for companies' data warehouses and data lakes… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
45
0
1

Year Published

2020
2020
2022
2022

Publication Types

Select...
4
3
1

Relationship

0
8

Authors

Journals

citations
Cited by 65 publications
(55 citation statements)
references
References 19 publications
0
45
0
1
Order By: Relevance
“…[14] ⧫ □ ✓ ✓ ✓ ✓ Alrehamy and Walker [42] ⧫ ✓ ✓ Data wrangling [41] ⧫ ✓ ✓ ✓ ✓ Constance [16] ⧫ ✓ ✓ GEMMS [32] ◊ ✓ CLAMS [12] ⧫ ✓ Suriarachchi and Plale [40] ⧫ ✓ ✓ Singh, K. et al [39] ⧫ ✓ ✓ ✓ ✓ Farrugia et al [13] ⧫ ✓ GOODS [17] ⧫ ✓ ✓ ✓ ✓ CoreDB [3] ⧫ ✓ ✓ Ground [18] ◊ □ ✓ ✓ ✓ ✓ KAYAK [26] ⧫ ✓ ✓ ✓ CoreKG [4] ⧫ ✓ ✓ ✓ ✓ ✓ Diamantini et al [10] ◊ ✓ ✓ ✓ Ravat, F., Zhao, Y. [34] ⧫ ◊ ✓ ✓ ✓ ✓ ✓ MEDAL [36] ◊ ✓ ✓ ✓ ✓ ✓ ✓ Our proposal ⧫ ◊ ✓ ✓ ✓ ✓ ✓ ✓ ⧫ : Data lake implementation ◊ : Metadata model □ : Model or implementation assimilable to a data lake ✓ : feature is available…”
Section: Discussionmentioning
confidence: 99%
See 2 more Smart Citations
“…[14] ⧫ □ ✓ ✓ ✓ ✓ Alrehamy and Walker [42] ⧫ ✓ ✓ Data wrangling [41] ⧫ ✓ ✓ ✓ ✓ Constance [16] ⧫ ✓ ✓ GEMMS [32] ◊ ✓ CLAMS [12] ⧫ ✓ Suriarachchi and Plale [40] ⧫ ✓ ✓ Singh, K. et al [39] ⧫ ✓ ✓ ✓ ✓ Farrugia et al [13] ⧫ ✓ GOODS [17] ⧫ ✓ ✓ ✓ ✓ CoreDB [3] ⧫ ✓ ✓ Ground [18] ◊ □ ✓ ✓ ✓ ✓ KAYAK [26] ⧫ ✓ ✓ ✓ CoreKG [4] ⧫ ✓ ✓ ✓ ✓ ✓ Diamantini et al [10] ◊ ✓ ✓ ✓ Ravat, F., Zhao, Y. [34] ⧫ ◊ ✓ ✓ ✓ ✓ ✓ MEDAL [36] ◊ ✓ ✓ ✓ ✓ ✓ ✓ Our proposal ⧫ ◊ ✓ ✓ ✓ ✓ ✓ ✓ ⧫ : Data lake implementation ◊ : Metadata model □ : Model or implementation assimilable to a data lake ✓ : feature is available…”
Section: Discussionmentioning
confidence: 99%
“…The implementation and Table 1 shows the state-of-the-art approaches and the associated features provided. Among seventeen (17) proposals, only one approach [34] proposes a data lake implementation associated with a metadata management system. Moreover, this approach, like the majority, has been set up for a very specific case study, and does not allow, or hardly takes into account, the case of complex data such as spatial data (satellite data).…”
Section: Data and Softwarementioning
confidence: 99%
See 1 more Smart Citation
“…But on the other hand, the pond architecture is also functional because Inmon's specifications consider some storage and process components distributed across data ponds (Figure 2). Ravat and Zhao also propose such an hybrid data lake architecture (Figure 5 [45]).…”
Section: Functional × Maturity Architecturesmentioning
confidence: 99%
“…Data lakes are also characterized by tools designed to allow efficient data indexing and searching. The data lake concept has emerged very recently and is just starting to be comprehensively analyzed and conceptualized [ 37 ]. We will detail in the next chapter the architecture of the data lake that was created to collect the data produced by the sensors deployed on Mt.…”
Section: Introduction and State-of-the-artmentioning
confidence: 99%