2018 IEEE High Performance Extreme Computing Conference (HPEC) 2018
DOI: 10.1109/hpec.2018.8547629
|View full text |Cite
|
Sign up to set email alerts
|

Interactive Supercomputing on 40,000 Cores for Machine Learning and Data Analysis

Abstract: Interactive massively parallel computations are critical for machine learning and data analysis. These computations are a staple of the MIT Lincoln Laboratory Supercomputing Center (LLSC) and has required the LLSC to develop unique interactive supercomputing capabilities. Scaling interactive machine learning frameworks, such as TensorFlow, and data analysis environments, such as MATLAB/Octave, to tens of thousands of cores presents many technical challenges -in particular, rapidly dispatching many tasks throug… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
113
0

Year Published

2018
2018
2024
2024

Publication Types

Select...
4
4
1

Relationship

3
6

Authors

Journals

citations
Cited by 198 publications
(122 citation statements)
references
References 22 publications
0
113
0
Order By: Relevance
“…This work overcomes obstacles to improved model accuracy by employing a novel hypersparse neural network analysis of "video" stream representations of the Internet traffic ( Figure 3). Utilizing recent innovations in interactive supercomputing [29], [30], matrix-based graph theory [ in fe re n c e i n f e r e n c e t r a i n i n g t r a i n i n g sparse image of internet traffic A t Fig. 3.…”
Section: Approachmentioning
confidence: 99%
“…This work overcomes obstacles to improved model accuracy by employing a novel hypersparse neural network analysis of "video" stream representations of the Internet traffic ( Figure 3). Utilizing recent innovations in interactive supercomputing [29], [30], matrix-based graph theory [ in fe re n c e i n f e r e n c e t r a i n i n g t r a i n i n g sparse image of internet traffic A t Fig. 3.…”
Section: Approachmentioning
confidence: 99%
“…We describe the implementation of TapirXLA and the experimental setup to perform a fair comparison TapirXLA against XLA. We evaluated TapirXLA on a variety of multicore and manycore CPUs on the MIT Supercloud system [43], a heterogenous supercomputing system consisting of compute nodes with a variety of multicore and manycore processors.…”
Section: Discussionmentioning
confidence: 99%
“…We evaluated TapirXLA and XLA on all networks on a variety of multicore and manycore CPUs in the MIT Supercloud system [43], including an Intel Xeon Gold, an Intel Xeon E5, an Intel Xeon Phi, and an AMD Opteron. We followed the TensorFlow guidelines [48] and to set the threads used for intra-and inter-op thread counts equal to the number of processor cores and sockets on the system, respectively.…”
Section: Performance Comparison Of Tapirxla Versus Xlamentioning
confidence: 99%
“…The largest supercomputers currently available almost exclusively run the Linux operating system [8], [9]. Using these Linux powered supercomputers, it is possible to rapidly launch interactive applications on thousands of processors in a matter of seconds [10].…”
Section: Introductionmentioning
confidence: 99%