2012
DOI: 10.1186/1471-2288-12-163
|View full text |Cite
|
Sign up to set email alerts
|

Fitting parametric random effects models in very large data sets with application to VHA national data

Abstract: BackgroundWith the current focus on personalized medicine, patient/subject level inference is often of key interest in translational research. As a result, random effects models (REM) are becoming popular for patient level inference. However, for very large data sets that are characterized by large sample size, it can be difficult to fit REM using commonly available statistical software such as SAS since they require inordinate amounts of computer time and memory allocations beyond what are available preventin… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
16
0

Year Published

2014
2014
2021
2021

Publication Types

Select...
8

Relationship

0
8

Authors

Journals

citations
Cited by 14 publications
(16 citation statements)
references
References 43 publications
(49 reference statements)
0
16
0
Order By: Relevance
“…One of the major uses for clinical big data is in analysis of the prevalence or trends of a disease or phenotype among different populations. An early big data study evaluated a cohort consisting of 890,394 US veterans with diabetes followed from 2002 through 2006 [ 43 ]. Bermejo-Sanchez et al [ 44 ] observed 326 of the birth defect Amelia among 23 million live births, stillbirths, and fetal anomalies from 23 countries and 4 continents, and found the trend of higher prevalence of Amelia among younger mothers.…”
Section: Resultsmentioning
confidence: 99%
See 1 more Smart Citation
“…One of the major uses for clinical big data is in analysis of the prevalence or trends of a disease or phenotype among different populations. An early big data study evaluated a cohort consisting of 890,394 US veterans with diabetes followed from 2002 through 2006 [ 43 ]. Bermejo-Sanchez et al [ 44 ] observed 326 of the birth defect Amelia among 23 million live births, stillbirths, and fetal anomalies from 23 countries and 4 continents, and found the trend of higher prevalence of Amelia among younger mothers.…”
Section: Resultsmentioning
confidence: 99%
“…Gebregziabher et al [ 43 ] stated that the datasets generated through many translational research projects to answer questions of public health interest are not self-explanatory due to complexity and inadequate description/documentation of the dataset's parameters and associated metadata. The methodologies for interpreting the data can therefore be subject to all sorts of philosophical debate.…”
Section: Discussionmentioning
confidence: 99%
“…In wound care, common examples include the unit of analysis (e.g., multiple wounds per patient) or patients treated at different investigational sites or settings or by different clinicians. 37,38 Any time significant clustering is encountered in a dataset, not adjusting for the clustering effects is likely to lead to overestimation of the effect size of the intervention or parameter under study. Mixed models in which such clusters are treated as random effects are the most frequently methods used for adjustment, but these methods are computationally intensive, and may be impossible in very big datasets due to available computing power or memory.…”
Section: Other Analytical Issuesmentioning
confidence: 99%
“…Mixed models in which such clusters are treated as random effects are the most frequently methods used for adjustment, but these methods are computationally intensive, and may be impossible in very big datasets due to available computing power or memory. 38 Standard of care (SOC) is likely to vary considerably in large datasets but nevertheless can have a large impact on wound healing. The TIME algorithm, developed from a meeting of wound care experts, outlines an ideal SOC that includes tissue management, infection, moisture imbalance, and edge of the wound.…”
Section: Other Analytical Issuesmentioning
confidence: 99%
“…In general situations, one can partition the data between multiple processors, compute separate parameter estimates for each chunk and then combine the results (Huang and Gelman, ; Gebregziabher et al ., ; Khanna et al ., ; Scott et al ., ). These splitting strategies often require the same total computational cost, but they split the costs between K processors, reducing wall clock time by a factor of K .…”
Section: Introductionmentioning
confidence: 99%