2020
DOI: 10.1093/mnras/staa207
|View full text |Cite
|
Sign up to set email alerts
|

Identifying new X-ray binary candidates in M31 using random forest classification

Abstract: Identifying X-ray binary (XRB) candidates in nearby galaxies requires distinguishing them from possible contaminants including foreground stars and background active galactic nuclei. This work investigates the use of supervised machine learning algorithms to identify high-probability X-ray binary candidates. Using a catalogue of 943 Chandra X-ray sources in the Andromeda galaxy, we trained and tested several classification algorithms using the X-ray properties of 163 sources with previously known types. Amongs… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
10
0

Year Published

2020
2020
2022
2022

Publication Types

Select...
5

Relationship

0
5

Authors

Journals

citations
Cited by 12 publications
(10 citation statements)
references
References 53 publications
(100 reference statements)
0
10
0
Order By: Relevance
“…The precision of XRBs and their recall seem to be slightly higher and lower, respectively, but this depends on the weight chosen for the XRB class. The f 1 score of XRBs is also higher than a recent random forest classifier of X-ray sources in the literature (0.769 for Arnason et al 2020, however not based on the same source properties). We compared random forest performance to our classification on the test sample, both statistically and with manual inspection of a small subsample.…”
Section: Comparison To Machine Learning Resultsmentioning
confidence: 55%
See 2 more Smart Citations
“…The precision of XRBs and their recall seem to be slightly higher and lower, respectively, but this depends on the weight chosen for the XRB class. The f 1 score of XRBs is also higher than a recent random forest classifier of X-ray sources in the literature (0.769 for Arnason et al 2020, however not based on the same source properties). We compared random forest performance to our classification on the test sample, both statistically and with manual inspection of a small subsample.…”
Section: Comparison To Machine Learning Resultsmentioning
confidence: 55%
“…In the literature, no classification work has addressed the 2SXPS catalog so far. However, other similar catalogs, such as the XMM-Newton serendipitous source catalog (Webb et al 2020 in its latest version), are subject to classification works, for example Lin et al (2012) and Farrell et al (2015), respectively, using the "threshold rules" approach and a random forest algorithm, while Arnason et al (2020) tested several machine learning methods to classify Chandra X-ray sources in M31. Besides, a study preliminary to this work (Primorac 2015) shows that the Swift catalog has an interesting potential for X-ray source classification, with a classification based on selection criteria forming a decision tree with three classes: AGN, stars, and stellar-mass compact objects.…”
Section: Swift-xrt Catalog: 2sxpsmentioning
confidence: 99%
See 1 more Smart Citation
“…After optimizing the algorithms and training the models, we evaluated their performance on the test set. For the evaluation of the algorithms, we used the confusion matrix, which is widely used in problems of statistical classification (e.g., Mahabal et 2017; Arnason et al 2020). The confusion matrix has a table layout that visualizes the performance of a classifier using the test subset.…”
Section: Evaluating the Performance Of The Best Rf And Prf Modelsmentioning
confidence: 99%
“…The former use known and labeled data as inputs, while the latter try to learn the various groups of the data from the data itself (for a full overview, see Baron 2019). Recently, thanks to improvements on software development and computational power, an increasing number of studies apply machine learning to a variety of problems in astronomy, taking advantage of its computational speed, as well as, its ability to handle large volumes of data (e.g., Mahabal et al 2008;Laurino et al 2011;Castro et al 2018;Pearson et al 2019;Clarke et al 2020;Arnason et al 2020).…”
Section: Introductionmentioning
confidence: 99%