O'TRAIN: a robust and flexible Real/Bogus classifier for the study of the optical transient sky

Makhlouf, K.; Turpin, D.; Corre, D.; Karpov, S.; Kann, D. A.; Klotz, A.

doi:10.48550/arxiv.2112.10280

Cited by 1 publication

(2 citation statements)

References 21 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…This problem is particularly important for the identification of the electromagnetic (EM) counterpart of a Table 1. The architecture of O'TRAIN [36] (top) and WaveNet [37] (bottom) before the classification head. (i: the layer number, k: the filter size, nc i : the number of filters, p: the dropout rate, r dilated : the rate of dilation).…”

Section: Transient-vs-bogus Datasetmentioning

confidence: 99%

“…We conducted Grad-CAM analysis on the CNN-BS Net. The architecture of the image network follows that of O'TRAIN [36]. O'TRAIN performs well on various grayscale images, particularly on the transient versus bogus image classification problem, and is widely used in the astrophysical image analysis community.…”

Section: Network Architecture and Training Configurationmentioning

confidence: 99%

See 1 more Smart Citation

The effects of topological features on convolutional neural networks—an explanatory analysis via Grad-CAM

Lee

Jung

2023

Mach. Learn.: Sci. Technol.

View full text Add to dashboard Cite

Topological data analysis (TDA) characterizes the global structure of databased on topological invariants such as persistent homology, whereas convolutionalneural networks (CNNs) are capable of characterizing local features in the globalstructure of the data. In contrast, a combined model of TDA and CNN, a familyof multimodal networks, simultaneously takes the image and the correspondingtopological features as the input to the network for classification, thereby significantlyimproving the performance of a single CNN. This innovative approach has beenrecently successful in various applications. However, there is a lack of explanationregarding how and why topological signatures, when combined with a CNN, improvediscriminative power. In this paper, we use persistent homology to compute topologicalfeatures and subsequently demonstrate both qualitatively and quantitatively theeffects of topological signatures on a CNN model, for which the Grad-CAM analysisof multimodal networks and topological inverse image map are proposed andappropriately utilized. For experimental validation, we utilized two famous datasets:the transient versus bogus image dataset and the HAM10000 dataset. Using Grad-CAM analysis of multimodal networks, we demonstrate that topological featuresenforce the image network of a CNN to focus more on significant and meaningfulregions across images rather than task-irrelevant artifacts such as background noiseand texture.

show abstract

Section: Transient-vs-bogus Datasetmentioning

confidence: 99%

Section: Network Architecture and Training Configurationmentioning

confidence: 99%