Proceedings of the 5th International Conference on Internet of Things, Big Data and Security 2020
DOI: 10.5220/0009569804280435
|View full text |Cite
|
Sign up to set email alerts
|

Approaching the (Big) Data Science Engineering Process

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
10
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
2
2
1
1

Relationship

1
5

Authors

Journals

citations
Cited by 10 publications
(10 citation statements)
references
References 0 publications
0
10
0
Order By: Relevance
“…Once the highly curated and standardized PSets are released via ORCESTRA, they need to be preprocessed into tables which match the PharmacoDB Entity Relationship Diagram (ERD) before being loaded into the database. To ensure that the data ingestion standards in PharmacoDB adhere to FAIR data principles, the Pharmaco-Data Ingestion (PharmacoDI) project was initiated to create an Extract Transform Load (ETL) pipeline which adheres to modern data engineering best practices (28,29).…”
Section: Implementation Of Reproducible Pipelinesmentioning
confidence: 99%
“…Once the highly curated and standardized PSets are released via ORCESTRA, they need to be preprocessed into tables which match the PharmacoDB Entity Relationship Diagram (ERD) before being loaded into the database. To ensure that the data ingestion standards in PharmacoDB adhere to FAIR data principles, the Pharmaco-Data Ingestion (PharmacoDI) project was initiated to create an Extract Transform Load (ETL) pipeline which adheres to modern data engineering best practices (28,29).…”
Section: Implementation Of Reproducible Pipelinesmentioning
confidence: 99%
“…While initially being referred to as a synonym for large amounts of data that cannot be easily handled by relational databases and technologies of that time, today Big Data covers a variety of advanced data characteristics, technologies, paradigms and methods [1]. During this time, the concept has undergone significant changes that dramatically changed the term from a hype topic [7] to the foundation of most of the data-driven and data-intensive projects known today [4]. Despite that long lasting maturation and a highly active research community [8], no distinct and universally applied definition was found that precisely describes the nature and elements of that term [1].…”
Section: Big Datamentioning
confidence: 99%
“…Apart from the pure lack of experts and qualified staff [13], the comprehensive planning, engineering and integration of architectures represents a cumbersome task [1]. Many practitioners and researchers noted this problem and attempted to reduce the prevailing complexity through the design and developments of promising solutions, such as reference architectures [14], decision support systems [4], automation approaches [15] or the application of new technologies [6]. Especially in times at which highly decentralized or loosely coupled environments are sought more than ever, as in the case of very large business application scenarios, the use of Big Data in combination with the latter remains desirable.…”
Section: Big Datamentioning
confidence: 99%
See 2 more Smart Citations