GPU-Accelerated Machine Learning Inference as a Service for Computing in Neutrino Experiments

Wang, Michael; Yang, T.; Flechas, Maria Acosta; Harris, P.; Hawks, Benjamin; Holzman, B.; Knoepfel, K. J.; Krupa, J.; Pedro, K.; Tran, N. V.

doi:10.3389/fdata.2020.604083

Cited by 21 publications

(16 citation statements)

References 29 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…With ML becoming increasingly common in neutrino experiments, the community is further steering its attention toward hardware acceleration of ML-based inference. GPU-accelerated ML inference as a service for computing in neutrino experiments is discussed in [229], while new developments are also targeting GPUor FPGA-based acceleration for use of machine learning algorithms such as 1D or 2D CNNs in real-time or online processing of raw LArTPC data at the data acquisition and trigger level [197][198][199].…”

Section: B Neutrino Experimentsmentioning

confidence: 99%

Machine Learning in the Search for New Fundamental Physics

Karagiorgi¹,

Kasieczka²,

Kravitz³

et al. 2021

Preprint

View full text Add to dashboard Cite

Machine learning plays a crucial role in enhancing and accelerating the search for new fundamental physics. We review the state of machine learning methods and applications for new physics searches in the context of terrestrial high energy physics experiments, including the Large Hadron Collider, rare event searches, and neutrino experiments. While machine learning has a long history in these fields, the deep learning revolution (early 2010s) has yielded a qualitative shift in terms of the scope and ambition of research. These modern machine learning developments are the focus of the present review.

show abstract

Section: B Neutrino Experimentsmentioning

confidence: 99%

Machine Learning in the Search for New Fundamental Physics

Karagiorgi¹,

Kasieczka²,

Kravitz³

et al. 2021

Preprint

View full text Add to dashboard Cite

show abstract

“…Other heterogeneous computing resources specialized for inference may be even more beneficial. This speed-up may benefit the experiments' computing workflows by accessing these resources as an on-demand, scalable service [46][47][48].…”

Section: Inference Timingmentioning

confidence: 99%

Charged Particle Tracking via Edge-Classifying Interaction Networks

Dezoort

Thais

Duarte

et al. 2021

Comput Softw Big Sci

View full text Add to dashboard Cite

Recent work has demonstrated that geometric deep learning methods such as graph neural networks (GNNs) are well suited to address a variety of reconstruction problems in high-energy particle physics. In particular, particle tracking data are naturally represented as a graph by identifying silicon tracker hits as nodes and particle trajectories as edges, given a set of hypothesized edges, edge-classifying GNNs identify those corresponding to real particle trajectories. In this work, we adapt the physics-motivated interaction network (IN) GNN toward the problem of particle tracking in pileup conditions similar to those expected at the high-luminosity Large Hadron Collider. Assuming idealized hit filtering at various particle momenta thresholds, we demonstrate the IN’s excellent edge-classification accuracy and tracking efficiency through a suite of measurements at each stage of GNN-based tracking: graph construction, edge classification, and track building. The proposed IN architecture is substantially smaller than previously studied GNN tracking architectures; this is particularly promising as a reduction in size is critical for enabling GNN-based tracking in constrained computing environments. Furthermore, the IN may be represented as either a set of explicit matrix operations or a message passing GNN. Efforts are underway to accelerate each representation via heterogeneous computing resources towards both high-level and low-latency triggering applications.

show abstract

“…Significant motivation is taken from the integration of these concepts for usage in high energy physics (HEP) where recent algorithmic advances and the availability of large datasets have greatly facilitated the adoption of ML. Previous work with experiments at the CERN Large Hadron Collider and the ProtoDUNE-SP experiment at the Fermi National Accelerator Laboratory have shown that the as-a-service computing model has the potential to offer impressive speed-ups, improved performance, and reduced complexity relative to traditional computing models (Krupa 2021;Rankin et al 2020;Wang et al 2020). These works have also demonstrated the ability to perform inference as-a-service with both GPUs as well as FPGAs.…”

Section: Appendixmentioning

confidence: 99%

“…In order to take full advantage of accelerators, modifications must be made to the standard model of computing, in which pipelines directly manage the accelerated resources they use for execution. An alternative model, which has gained popularity in other fields, is called "as-a-service" (Krupa 2021;Wang et al 2020). When used to specifically denote accelerated ML inference, it is referred to as Inference-as-a-Service (IaaS).…”

mentioning

confidence: 99%

Hardware-accelerated Inference for Real-Time Gravitational-Wave Astronomy

Gunny¹,

Rankin²,

Krupa³

et al. 2021

Preprint

Self Cite

View full text Add to dashboard Cite

The field of transient astronomy has seen a revolution with the first gravitational-wave detections and the arrival of multi-messenger observations they enabled. Transformed by the first detection of binary black hole and binary neutron star mergers, computational demands in gravitational-wave astronomy are expected to grow by at least a factor of two over the next five years as the global network of kilometer-scale interferometers are brought to design sensitivity. With the increase in detector sensitivity, real-time delivery of gravitational-wave alerts will become increasingly important as an enabler of multi-messenger followup. In this work, we report a novel implementation and deployment of deep learning inference for real-time gravitational-wave data denoising and astrophysical source identification. This is accomplished using a generic Inference-as-a-Service model that is capable of adapting to the future needs of gravitational-wave data analysis. Our implementation allows seamless incorporation of hardware accelerators and also enables the use of commercial or private (dedicated) as-a-service computing. Based on our results, we propose a paradigm shift in low-latency and offline computing in gravitational-wave astronomy. Such a shift can address key challenges in peak-usage, scalability and reliability, and provide a data analysis platform particularly optimized for deep learning applications. The achieved sub-millisecond scale latency will also be relevant for any machine learning-based real-time control systems that may be invoked in the operation of near-future and next generation ground-based laser interferometers, as well as the front-end collection, distribution and processing of data from such instruments.

show abstract

GPU-Accelerated Machine Learning Inference as a Service for Computing in Neutrino Experiments

Cited by 21 publications

References 29 publications

Machine Learning in the Search for New Fundamental Physics

Machine Learning in the Search for New Fundamental Physics

Charged Particle Tracking via Edge-Classifying Interaction Networks

Hardware-accelerated Inference for Real-Time Gravitational-Wave Astronomy

Contact Info

Product

Resources

About