Classification of <i>Fermi</i>-LAT unidentified gamma-ray sources using <scp>catboost</scp> gradient boosting decision trees

Coronado-Blázquez, Javier

doi:10.1093/mnras/stac1950

Cited by 16 publications

(8 citation statements)

References 55 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…We use KNN (Arthur & Vassilvitskii 2006;Xu et al 2020), which determines the category of the sample to be divided according to the category of the nearest one or several samples. We use gradient boosting + categorical features (CB; Prokhorenkova et al 2017;Coronado-Blázquez 2022, which supports categorical variables and high accuracy gradient boosting decision tree (GBDT) framework. LR aims to map the results of linear regression to the interval from 0 to 1 through the logistic function, and the classification of the data is obtained by comparing with 0.5.…”

Section: Algorithmsmentioning

confidence: 99%

“…Supervised machine learning (SML) is a useful and alternative classification method and it provide reference for classification results. SML had been used by many scholars (Ackermann et al 2012;Chiaro et al 2016;Saz Parkinson et al 2016;Salvetti et al 2017;Lefaucheur & Pita 2017;Yi et al 2017;Kovačević et al 2019Kovačević et al , 2020Kang et al 2019a;Xu et al 2020;Xiao et al 2020;Zhu et al 2021b;Coronado-Blázquez 2022) to classify BCUs from the Fermi-LAT catalogs.…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Classification of Fermi BCUs Using Machine Learning

Xiao,

Xie,

Zeng

et al. 2023

ApJ

View full text Add to dashboard Cite

The Fermi Large Area Telescope (LAT) has detected 6659 γ-ray sources in the incremental version (4FGL-DR3, for Data Release 3) of the fourth Fermi-LAT catalog of γ-ray sources and 3743 of them are blazars, including 1517 blazar candidates of uncertain type (BCUs). Blazars are generally classified by properties of emission lines into BL Lac objects and flat spectrum radio quasars (FSRQs). However, BCUs are difficult to classify because of the lack of spectrum. In this work we apply five different machine-learning algorithms (K-nearest neighbors, logistic regression, support vector machine, random forest, CatBoost) to evaluate the classification of 1517 BCUs based on the observational data of 4FGL-DR3. The results indicate that the use of recursive feature elimination cross-validation can effectively improve the accuracy of models and reduce computation time. We use our models to predict the BCUs from 4FGL-DR3 and the results of the overlapping of the five models are as follows: 811 BL Lac objects, 397 FSRQs, and 309 BCUs.

show abstract

Section: Algorithmsmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

Classification of Fermi BCUs Using Machine Learning

Xiao,

Xie,

Zeng

et al. 2023

ApJ

View full text Add to dashboard Cite

show abstract

“…Decision trees are a top-down, divide-and-conquer recursive process, where the comparison of attributes is also the comparison of node attribute values within the decision tree [11][12]. Each path from the root node to the leaf nodes forms a classification rule that recurses to the leaf nodes to obtain a conclusion.…”

Section: Decision Tree Modelmentioning

confidence: 99%

“…Jinshan Lin, Min Lin and Hang Xu. Applied Mathematics and Nonlinear Sciences, 9(1) (2024)[1][2][3][4][5][6][7][8][9][10][11][12][13][14][15][16][17] …”

mentioning

confidence: 99%

A study of algorithms for solving nonlinear two-level programming problems oriented to decision tree models

Lin,

2023

Applied Mathematics and Nonlinear Sciences

View full text Add to dashboard Cite

In this paper, the original two-level planning problem is transformed into a single-level optimization problem by combining the penalty function method for the large amount of data processing involved in the training process of the decision tree model, setting the output as a classification tree in the iterative process of the CART decision tree, and recursively building the CART classification tree with the training set to find the optimal solution set for the nonlinear two-level planning problem. It is verified that the proposed solution method is also stable at a convergence index of 1.0 with a maximum accuracy of 95.37%, which can provide an efficient solution method for nonlinear two-level programming problems oriented to decision tree models.

show abstract

“…However, the abovementioned studies predominantly use machine-learning methods of the unsupervised type, where only the observed features of the GRBs are inputted into the models, but not the labels (the GRBs' physical classes being Type I or II). On the other hand, the other type of machinelearning methods, supervised methods, are also commonly employed by astronomy researchers in the classification of other astronomical objects (e.g., Luo et al 2023;Zhu-Ge et al 2023;Connor & van Leeuwen 2018;Butter et al 2022;Coronado-Blázquez 2022;de Beurs et al 2022;Fan et al 2022;Villa-Ortega et al 2022;Yang et al 2022a;Kaur et al 2023), although studies on the application of supervised methods on GRB are scarce. Since supervised methods take both features and labels as input, and can produce deterministic predictions of the class of new GRBs, they can be helpful in identifying the true physical origin of intermingled GRBs.…”

Section: Introductionmentioning

confidence: 99%

Identifying the Physical Origin of Gamma-Ray Bursts with Supervised Machine Learning

Luo,

Wang,

Zhu-Ge

et al. 2023

ApJ

View full text Add to dashboard Cite

The empirical classification of gamma-ray bursts (GRBs) into long and short GRBs based on their durations is already firmly established. This empirical classification is generally linked to the physical classification of GRBs originating from compact binary mergers and GRBs originating from massive star collapses, or Type I and II GRBs, with the majority of short GRBs belonging to Type I and the majority of long GRBs belonging to Type II. However, there is a significant overlap in the duration distributions of long and short GRBs. Furthermore, some intermingled GRBs, i.e., short-duration Type II and long-duration Type I GRBs, have been reported. A multiparameter classification scheme of GRBs is evidently needed. In this paper, we seek to build such a classification scheme with supervised machine-learning methods, chiefly XGBoost. We utilize the GRB Big Table and Greiner’s GRB catalog and divide the input features into three subgroups: prompt emission, afterglow, and host galaxy. We find that the prompt emission subgroup performs the best in distinguishing between Type I and II GRBs. We also find the most important distinguishing features in prompt emission to be T 90, the hardness ratio, and fluence. After building the machine-learning model, we apply it to the currently unclassified GRBs to predict their probabilities of being either GRB class, and we assign the most probable class of each GRB to be its possible physical class.

show abstract

Classification of Fermi-LAT unidentified gamma-ray sources using catboost gradient boosting decision trees

Cited by 16 publications

References 55 publications

Classification of Fermi BCUs Using Machine Learning

Classification of Fermi BCUs Using Machine Learning

A study of algorithms for solving nonlinear two-level programming problems oriented to decision tree models

Identifying the Physical Origin of Gamma-Ray Bursts with Supervised Machine Learning

Contact Info

Product

Resources

About