George Stelle scite author profile

George Stelle

5Publications

18Citation Statements Received

80Citation Statements Given

How they've been cited

How they cite others

Affiliations

Los Alamos National Laboratory, University of New Mexico, Sandia National Laboratories California

Publications

Order By: Most citations

Modeling Internet-Scale Policies for Cleaning up Malware

Hofmeyr

Moore

Forrest

et al. 2012

View full text Add to dashboard Cite

An emerging consensus among policy makers is that interventions undertaken by Internet Service Providers are the best way to counter the rising incidence of malware. However, assessing the suitability of countermeasures at this scale is hard. In this paper, we use an agent-based model, called ASIM, to investigate the impact of policy interventions at the Autonomous System level of the Internet. For instance, we find that coordinated intervention by the 0.2%-biggest ASes is more effective than uncoordinated efforts adopted by 30% of all ASes. Furthermore, countermeasures that block malicious transit traffic appear more effective than ones that block outgoing traffic. The model allows us to quantify and compare positive externalities created by different countermeasures. Our results give an initial indication of the types and levels of intervention that are most cost-effective at large scale.

show abstract

Task Parallel Incomplete Cholesky Factorization using 2D Partitioned-Block Layout

Kim

Rajamanickam

Stelle

et al. 2016

View full text Add to dashboard Cite

We introduce a task-parallel algorithm for sparse incomplete Cholesky factorization that utilizes a 2D sparse partitioned-block layout of a matrix. Our factorization algorithm follows the idea of algorithms-by-blocks by using the block layout. The algorithm-byblocks approach induces a task graph for the factorization. These tasks are inter-related to each other through their data dependences in the factorization algorithm. To process the tasks on various manycore architectures in a portable manner, we also present a portable tasking API that incorporates different tasking backends and device-specific features using an open-source framework for manycore platforms i.e., Kokkos. A performance evaluation is presented on both Intel Sandybridge and Xeon Phi platforms for matrices from the University of Florida sparse matrix collection to illustrate merits of the proposed task-based factorization. Experimental results demonstrate that our task-parallel implementation delivers about 26.6x speedup (geometric mean) over single-threaded incomplete Choleskyby-blocks and 19.2x speedup over serial Cholesky performance which does not carry tasking overhead using 56 threads on the Intel Xeon Phi processor for sparse matrices arising from various application problems.

show abstract

Using a Complementary Emulation-Simulation Co-Design Approach to Assess Application Readiness for Processing-in-Memory Systems

Stelle

Olivier

Stark

et al. 2014

View full text Add to dashboard Cite

Disruptive changes to computer architecture are paving the way toward extreme scale computing. The co-design strategy of collaborative research and development among computer architects, system software designers, and application teams can help to ensure that applications not only cope but thrive with these changes. In this paper, we present a novel combined co-design approach of emulation and simulation in the context of investigating future Processing in Memory (PIM) architectures. PIM enables co-location of data and computation to decrease data movement, to provide increases in memory speed and capacity compared to existing technologies and, perhaps most importantly for extreme scale, to improve energy efficiency. Our evaluation of PIM focuses on three mini-applications representing important production applications. The emulation and simulation studies examine the effects of locality-aware versus locality-oblivious data distribution and computation, and they compare PIM to conventional architectures. Both studies contribute in their own way to the overall understanding of the application-architecture interactions, and our results suggest that PIM technology shows great potential for efficient computation without negatively impacting productivity.2014 Hardware-Software Co-Design for High Performance Computing 978-1-4799-7564-8/14 $31.00

show abstract

Task Parallel Incomplete Cholesky Factorization using 2D Partitioned-Block Layout

Kim¹,

Rajamanickam²,

Stelle³

et al. 2016

Preprint

View full text Add to dashboard Cite

Scheduling Chapel Tasks with Qthreads on Manycore

Evans

Olivier

Barrett

et al. 2017

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

George Stelle

Modeling Internet-Scale Policies for Cleaning up Malware

Task Parallel Incomplete Cholesky Factorization using 2D Partitioned-Block Layout

Using a Complementary Emulation-Simulation Co-Design Approach to Assess Application Readiness for Processing-in-Memory Systems

Task Parallel Incomplete Cholesky Factorization using 2D Partitioned-Block Layout

Scheduling Chapel Tasks with Qthreads on Manycore

Contact Info

Product

Resources

About