2015
DOI: 10.1371/journal.pcbi.1004071
|View full text |Cite
|
Sign up to set email alerts
|

Accurate Computation of Survival Statistics in Genome-Wide Studies

Abstract: A key challenge in genomics is to identify genetic variants that distinguish patients with different survival time following diagnosis or treatment. While the log-rank test is widely used for this purpose, nearly all implementations of the log-rank test rely on an asymptotic approximation that is not appropriate in many genomics applications. This is because: the two populations determined by a genetic variant may have very different sizes; and the evaluation of many possible variants demands highly accurate c… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
30
0

Year Published

2016
2016
2024
2024

Publication Types

Select...
6
4

Relationship

0
10

Authors

Journals

citations
Cited by 30 publications
(31 citation statements)
references
References 41 publications
1
30
0
Order By: Relevance
“…Age, gender, mutation number and tumor stage were used as covariates. ExaLT, which was specifically designed for unequal sample sizes in TCGA data was compiled and run using the—table command according to the author’s readme on GitHub [ 15 ].…”
Section: Methodsmentioning
confidence: 99%
“…Age, gender, mutation number and tumor stage were used as covariates. ExaLT, which was specifically designed for unequal sample sizes in TCGA data was compiled and run using the—table command according to the author’s readme on GitHub [ 15 ].…”
Section: Methodsmentioning
confidence: 99%
“…To derive a p-value for the logrank test, the χ 2 test for independence, and the Kruskal-Wallis test, the statistic for these three tests is assumed to have χ 2 distribution. However, for the logrank test and χ 2 test this approximation is not accurate for small sample sizes and unbalanced cluster sizes, especially for large values of the test statistic (this was shown for example in [109] for the logrank test in the case of two clusters). Indeed, we encountered in our analysis cases where the approximation gave extreme pvalues (< 10 −10 ) for very small clusters (n = 3).…”
Section: Benchmarkmentioning
confidence: 99%
“…For all methods, for all trials, the resulting clusters are balanced in terms of the number of patients participating in the clusters except two trials of MCCA. Log-rank test is known to result in unrealistically low p-values when one of cluster size is small [47]. In those two trials, MCCA's extremely low p-values are due to cluster sizes of 9 and 14.…”
Section: Comparison With the State-of-the Art Multi-omics Methods Permentioning
confidence: 96%