2018
DOI: 10.1080/10485252.2018.1508678
|View full text |Cite
|
Sign up to set email alerts
|

Decentralized nonparametric multiple testing

Abstract: Consider a big data multiple testing task, where, due to storage and computational bottlenecks, one is given a very large collection of p-values by splitting into manageable chunks and distributing over thousands of computer nodes. This paper is concerned with the following question: How can we find the full data multiple testing solution by operating completely independently on individual machines in parallel, without any data exchange between nodes? This version of the problem tends naturally to arise in a w… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
9
0

Year Published

2020
2020
2021
2021

Publication Types

Select...
2
1

Relationship

1
2

Authors

Journals

citations
Cited by 3 publications
(9 citation statements)
references
References 12 publications
0
9
0
Order By: Relevance
“…One can also compute pvalues using the χ 2 m null distribution of qDIV. For more details see Mukhopadhyay (2018) and Mukhopadhyay and Wang (2020a).…”
Section: Goodness-of-fit Diagnosticsmentioning
confidence: 99%
See 1 more Smart Citation
“…One can also compute pvalues using the χ 2 m null distribution of qDIV. For more details see Mukhopadhyay (2018) and Mukhopadhyay and Wang (2020a).…”
Section: Goodness-of-fit Diagnosticsmentioning
confidence: 99%
“…fidence from the ability of our theory to tackle statistical problems as diverse as time-series analysis (Mukhopadhyay and Parzen, 2018), copula modeling , empirical Bayes , graph-theory (Mukhopadhyay and Wang, 2020b), multiple testing (Mukhopadhyay, 2018(Mukhopadhyay, , 2016, high-dimensional data analysis (Mukhopadhyay and Wang, 2020a), large-scale distributed learning (Bruce et al, 2019), etc. This is an ongoing and growing movement with the mission to structure the field in an understandable and effective way.…”
Section: Modern Applied Statistics: Theory Practice and Pedagogymentioning
confidence: 99%
“…where f h (x) is the pdf of the unexpected signal and assume its distribution to be normal with center at 37 and width 1.8. Let f b (x) and f s (x) be defined as in (11) and (13), respectively, and let η 1 = 0.15 and η 2 = 0.1. We can start with a nonparametric signal detection stage by setting g(x) = g bs (x) in (30), with f s defined as in (13) dicate a significant departure from the expected backgroundonly model and a prominent peak is observed in correspondence of the signal of interest centered around 25.…”
Section: Semiparametric Signal Characterizationmentioning
confidence: 99%
“…At this stage, if f s was unknown, we could proceed with a semiparametric signal characterization as in Case IIb. Whereas assuming that the distribution of the signal of interest is known and given by (13), we fit (33), aiming to capture a significant deviation in correspondence of the second bump. This is precisely what we observe in the bottom right panel of Fig.…”
Section: Semiparametric Signal Characterizationmentioning
confidence: 99%
See 1 more Smart Citation