Non-Gaussian information from weak lensing data via deep learning

Gupta, Arushi; Matilla, José Manuel Zorrilla; Hsu, Daniel; Haiman, Zoltán

doi:10.1103/physrevd.97.103515

Cited by 115 publications

(105 citation statements)

References 54 publications

Supporting

Mentioning

104

Contrasting

Order By: Relevance

“…The convolutional neural network (CNN) extracts the characterising features directly from the pixel data of the training mass maps. We have experimented with a number of architectures, including classic topologies which implement a large number of 3 × 3 convolutions inspired by VGG-net (Simonyan & Zisserman 2014), as well as architectures presented in Ravanbakhsh et al (2017) and Gupta et al (2018). The model that worked best for our purposes is almost exclusively based on the Inception layers first presented in Szegedy et al (2014).…”

Section: Convolutional Neural Networkmentioning

confidence: 99%

On the dissection of degenerate cosmologies with machine learning

Merten¹,

Giocoli

Baldi

et al. 2019

Monthly Notices of the Royal Astronomical Society

View full text Add to dashboard Cite

Based on the DUSTGRAIN-pathfinder suite of simulations, we investigate observational degeneracies between nine models of modified gravity and massive neutrinos. Three types of machine learning techniques are tested for their ability to discriminate lensing convergence maps by extracting dimensional reduced representations of the data. Classical map descriptors such as the power spectrum, peak counts and Minkowski functionals are combined into a joint feature vector and compared to the descriptors and statistics that are common to the field of digital image processing. To learn new features directly from the data we use a Convolutional Neural Network (CNN). For the mapping between feature vectors and the predictions of their underlying model, we implement two different classifiers; one based on a nearest-neighbour search and one that is based on a fully connected neural network. We find that the neural network provides a much more robust classification than the nearest-neighbour approach and that the CNN provides the most discriminating representation of the data. It achieves the cleanest separation between the different models and the highest classification success rate of 59% for a single source redshift. Once we perform a tomographic CNN analysis, the total classification accuracy increases significantly to 76% with no observational degeneracies remaining. Visualising the filter responses of the CNN at different network depths provides us with the unique opportunity to learn from very complex models and to understand better why they perform so well.

show abstract

Section: Convolutional Neural Networkmentioning

confidence: 99%

On the dissection of degenerate cosmologies with machine learning

Merten¹,

Giocoli

Baldi

et al. 2019

Monthly Notices of the Royal Astronomical Society

View full text Add to dashboard Cite

show abstract

“…One approach is to apply a standard 2D CNN to a grid discretisation of the sphere [20][21][22]. An alternative is to divide the sphere into small chunks and project those on flat 2D surfaces [9,11,12,23].…”

Section: Introductionmentioning

confidence: 99%

DeepSphere: Efficient spherical convolutional neural network with HEALPix sampling for cosmological applications

Perraudin

Defferrard

Kacprzak

et al. 2019

Astronomy and Computing

158

View full text Add to dashboard Cite

Convolutional Neural Networks (CNNs) are a cornerstone of the Deep Learning toolbox and have led to many breakthroughs in Artificial Intelligence. So far, these neural networks (NNs) have mostly been developed for regular Euclidean domains such as those supporting images, audio, or video. Because of their success, CNN-based methods are becoming increasingly popular in Cosmology. Cosmological data often comes as spherical maps, which make the use of the traditional CNNs more complicated. The commonly used pixelization scheme for spherical maps is the Hierarchical Equal Area isoLatitude Pixelisation (HEALPix). We present a spherical CNN for analysis of full and partial HEALPix maps, which we call DeepSphere. The spherical CNN is constructed by representing the sphere as a graph. Graphs are versatile data structures that can represent pairwise relationships between objects or act as a discrete representation of a continuous manifold. Using the graph-based representation, we define many of the standard CNN operations, such as convolution and pooling. With filters restricted to being radial, our convolutions are equivariant to rotation on the sphere, and DeepSphere can be made invariant or equivariant to rotation. This way, DeepSphere is a special case of a graph CNN, tailored to the HEALPix sampling of the sphere. This approach is computationally more efficient than using spherical harmonics to perform convolutions. We demonstrate the method on a classification problem of weak lensing mass maps from two cosmological models and compare its performance with that of three baseline classifiers, two based on the power spectrum and pixel density histogram, and a classical 2D CNN. Our experimental results show that the performance of DeepSphere is always superior or equal to the baselines. For high noise levels and for data covering only a smaller fraction of the sphere, DeepSphere achieves typically 10% better classification accuracy than the baselines. Finally, we show how learned filters can be visualized to introspect the NN.Code and examples are available at https://github.com/SwissDataScienceCenter/DeepSphere.

show abstract

“…Due to nonlinearities on small scales, the traditional analysis with two-point statistics does not fully capture all the underlying information [5]. Multiple inference methods were proposed to extract more details based on higher order statistics [6, 7], peak statistics [8][9][10][11][12][13], Minkowski functionals [14-16] and recently convolutional neural networks (CNN) [17,18]. Here we present an improved convolutional neural network that gives significantly better estimates of Ω m and σ 8 cosmological parameters from simulated convergence maps than the state of art methods and also is free of systematic bias.…”

mentioning

confidence: 99%

“…The proposed scheme is even more accurate than the neural network on high-resolution noiseless maps. With shape noise and lower resolution its relative advantage deteriorates, but it remains more accurate than peak counting.Following the idea and using the simulation data from a recent study [18] we created an improved convolutional neural network (CNN) architecture (see details in the Methods) which is able to recover cosmological parameters more accurately from simulated weak lensing maps. The input of the network is a set of mock convergence (κ) maps generated by ray-tracing n-body simulations with 96 different values for the matter density Ω m and the scale of the initial perturbations normalized at the late Universe, σ 8 (see [18] and [19] for details of the weak lensing map generation), the outputs of the network were the predicted cosmological parameters.…”

mentioning

confidence: 99%

An improved cosmological parameter inference scheme motivated by deep learning

2018

View full text Add to dashboard Cite

Dark matter cannot be observed directly, but its weak gravitational lensing slightly distorts the apparent shapes of background galaxies, making weak lensing one of the most promising probes of cosmology. Several observational studies have measured the effect, and there are currently running [1,2], and planned efforts [3,4] to provide even larger, and higher resolution weak lensing maps. Due to nonlinearities on small scales, the traditional analysis with two-point statistics does not fully capture all the underlying information [5]. Multiple inference methods were proposed to extract more details based on higher order statistics [6, 7], peak statistics [8][9][10][11][12][13], Minkowski functionals [14-16] and recently convolutional neural networks (CNN) [17,18]. Here we present an improved convolutional neural network that gives significantly better estimates of Ω m and σ 8 cosmological parameters from simulated convergence maps than the state of art methods and also is free of systematic bias. We show that the network exploits information in the gradients around peaks, and with this insight, we construct a new, easy-to-understand, and robust peak counting algorithm based on the 'steepness' of peaks, instead of their heights. The proposed scheme is even more accurate than the neural network on high-resolution noiseless maps. With shape noise and lower resolution its relative advantage deteriorates, but it remains more accurate than peak counting.Following the idea and using the simulation data from a recent study [18] we created an improved convolutional neural network (CNN) architecture (see details in the Methods) which is able to recover cosmological parameters more accurately from simulated weak lensing maps. The input of the network is a set of mock convergence (κ) maps generated by ray-tracing n-body simulations with 96 different values for the matter density Ω m and the scale of the initial perturbations normalized at the late Universe, σ 8 (see [18] and [19] for details of the weak lensing map generation), the outputs of the network were the predicted cosmological parameters. The modifications of the CNN mostly consisted of adding further activations, increasing the number of filters, and introducing a regular block structure, following successful computer vision models [20,21].

show abstract

Non-Gaussian information from weak lensing data via deep learning

Cited by 115 publications

References 54 publications

On the dissection of degenerate cosmologies with machine learning

On the dissection of degenerate cosmologies with machine learning

DeepSphere: Efficient spherical convolutional neural network with HEALPix sampling for cosmological applications

An improved cosmological parameter inference scheme motivated by deep learning

Contact Info

Product

Resources

About