2020
DOI: 10.2196/15918
|View full text |Cite
|
Sign up to set email alerts
|

Enabling Agile Clinical and Translational Data Warehousing: Platform Development and Evaluation

Abstract: Background Modern data-driven medical research provides new insights into the development and course of diseases and enables novel methods of clinical decision support. Clinical and translational data warehouses, such as Informatics for Integrating Biology and the Bedside (i2b2) and tranSMART, are important infrastructure components that provide users with unified access to the large heterogeneous data sets needed to realize this and support use cases such as cohort selection, hypothesis generation… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
7
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
4
2
1

Relationship

3
4

Authors

Journals

citations
Cited by 8 publications
(8 citation statements)
references
References 28 publications
(32 reference statements)
0
7
0
Order By: Relevance
“…Since the underlying grant program was competitively bid, the information architecture is the same within consortium sites, but different between the consortia. The technical architecture of the ETL pipeline in DIFUTURE integrates comprehensive event logging through which data quality is viewed in a structured and detailed manner [5]. An audit service allows flexibly configurable quality analyses, which are executed on a SQLbased data mart but are independent of concrete schemas.…”
Section: Resultsmentioning
confidence: 99%
“…Since the underlying grant program was competitively bid, the information architecture is the same within consortium sites, but different between the consortia. The technical architecture of the ETL pipeline in DIFUTURE integrates comprehensive event logging through which data quality is viewed in a structured and detailed manner [5]. An audit service allows flexibly configurable quality analyses, which are executed on a SQLbased data mart but are independent of concrete schemas.…”
Section: Resultsmentioning
confidence: 99%
“…A common approach for integrating data for research purposes is to establish data warehouses, which are specific types of databases designed for analytical processing of heterogeneous data. To reduce the efforts required to transform data into common representations, pay‐as‐you‐go approaches can be utilized 43 . These approaches are based on the principle of not aiming for a fully integrated dataset from the beginning, but instead incrementally harmonizing it into a common representation as it is needed, for example, for a circadian medicine research project together with domain experts.…”
Section: Data Integration Challengesmentioning
confidence: 99%
“…To reduce the efforts required to transform data into common representations, pay-as-you-go approaches can be utilized. 43 These approaches are based on the principle of not aiming for a fully integrated dataset from the beginning, but instead incrementally harmonizing it into a common representation as it is needed, for example, for a circadian medicine research project together with domain experts. This implies that efforts are invested when needed, leading to a more efficient integration process.…”
Section: Clinical Data Integrationmentioning
confidence: 99%
“…A single web server (i.e. Apache 2) is placed in a dedicated gateway container, which manages ingoing and outgoing communication using proxy and reverse proxy configurations for each instance's i2b2 and tranSMART installation [3].…”
Section: Overview Of the Data Warehousing Infrastructurementioning
confidence: 99%
“…The infrastructure is based on container technology and supports setting up new instances and terminating old instances in a scalable manner. For this purpose, we have adopted the open source approach presented by Spengler et al [3], which makes use of the Docker software stack for containerization and management of multiple instances. Another advantage of the approach is that it supports agile processes for data loading based on close feedback cycles between informaticians and medical researchers.…”
Section: Introductionmentioning
confidence: 99%