2017
DOI: 10.1093/mnras/stx1807
|View full text |Cite
|
Sign up to set email alerts
|

Local two-sample testing: a new tool for analysing high-dimensional astronomical data

Abstract: Modern surveys have provided the astronomical community with a flood of highdimensional data, but analyses of these data often occur after their projection to lower-dimensional spaces. In this work, we introduce a local two-sample hypothesis test framework that an analyst may directly apply to data in their native space. In this framework, the analyst defines two classes based on a response variable of interest (e.g. higher-mass galaxies versus lower-mass galaxies) and determines at arbitrary points in predict… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
6
0

Year Published

2019
2019
2022
2022

Publication Types

Select...
4
2
1

Relationship

0
7

Authors

Journals

citations
Cited by 7 publications
(6 citation statements)
references
References 54 publications
(66 reference statements)
0
6
0
Order By: Relevance
“…The objective of DA-seq is to identify regions in R m where f 0 is significantly larger than f 1 and vice versa, by analyzing the set of samples {x i , y i } n i=1 . One approach to find DA regions is based on local two-sample tests (49)(50)(51)(52). A global two-sample test determines whether two sets of samples were generated by the same distribution.…”
Section: Methodsmentioning
confidence: 99%
“…The objective of DA-seq is to identify regions in R m where f 0 is significantly larger than f 1 and vice versa, by analyzing the set of samples {x i , y i } n i=1 . One approach to find DA regions is based on local two-sample tests (49)(50)(51)(52). A global two-sample test determines whether two sets of samples were generated by the same distribution.…”
Section: Methodsmentioning
confidence: 99%
“…Therefore, aside from detecting that a difference between the two distributions exists, it is of interest to localize it. Indeed, localizing the difference between two distributions is useful in various scientific applications, such as single-cell RNA sequencing [65], astronomy [25], and climate reanalysis [48].…”
Section: Scientific Discovery In Two-sample Datasetsmentioning
confidence: 99%
“…One approach to find DA regions is based on local two sample tests [8,10,19]. A global two sample test determines whether two sets of samples were generated by the same distribution.…”
Section: Problem Setupmentioning
confidence: 99%
“…al. [10]. These methods compute a test statistic in local neighborhoods around randomly selected cells {x i }.…”
Section: Problem Setupmentioning
confidence: 99%