Impact of On-chip Interconnect on In-memory Acceleration of Deep Neural Networks

Krishnan, Gokul; Mandal, Sumit K.; Chakrabarti, Chaitali; Seo, Jae-sun; Ogras, Ümit Y.; Cao, Yu

doi:10.1145/3460233

Cited by 14 publications

(4 citation statements)

References 39 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The interface between NeuroSim and popular ML frameworks such as PyTorch and TensorFlow has also been created to make it more user-friendly [95]. However, one major drawback of NeuroSim is that it assumes H-Tree based bus interconnect for inter-tile communication, which can consume up to 90% of the total energy consumption of DNN inference [96]. To overcome this issue, Krishnan et al…”

Section: That Includes An Nop For On-package Communication Noc For On...mentioning

confidence: 99%

End-to-End Benchmarking of Chiplet-Based In-Memory Computing

Krishnan,

K. Mandal,

A. Goksoy

et al. 2023

Neuromorphic Computing

Self Cite

View full text Add to dashboard Cite

In-memory computing (IMC)-based hardware reduces latency and energy consumption for compute-intensive machine learning (ML) applications. Several SRAM/RRAM-based IMC hardware architectures to accelerate ML applications have been proposed in the literature. However, crossbar-based IMC hardware poses several design challenges. We first discuss the different ML algorithms recently adopted in the literature. We then discuss the hardware implications of ML algorithms. Next, we elucidate the need for IMC architecture and the different components within a conventional IMC architecture. After that, we introduce the need for 2.5D or chiplet-based architectures. We then discuss the different benchmarking simulators proposed for monolithic IMC architectures. Finally, we describe an end-to-end chiplet-based IMC benchmarking simulator, SIAM.

show abstract

Section: That Includes An Nop For On-package Communication Noc For On...mentioning

confidence: 99%

End-to-End Benchmarking of Chiplet-Based In-Memory Computing

Krishnan,

K. Mandal,

A. Goksoy

et al. 2023

Neuromorphic Computing

Self Cite

View full text Add to dashboard Cite

show abstract

“…Large NN layers are assigned to multiple PEs, and the partial sums are aggregated in a global buffer (Long et al, 2019;Krishnan et al, 2021). To maximize the processing parallelism of memristor crossbars, some memristor computing systems (Zhu et al, 2020;Wan et al, 2022) proposed to directly transfer the partial outputs of a NN layer to the PEs where its next layer is located.…”

Section: Preliminariesmentioning

confidence: 99%

“…The memory accessing patterns leak the NN structure information. For some memristor computing systems, a similar layer-by-layer processing technique is used (Qiao et al, 2018;Krishnan et al, 2021). Thus, the memory accessing patterns could also be a side-channel vulnerability that adversaries can exploit in memristor computing systems.…”

Section: Thwarting Side-channel Attacksmentioning

confidence: 99%

Review of security techniques for memristor computing systems

Zou¹,

Du²,

Kvatinsky³

2022

Front. Electron. Mater

View full text Add to dashboard Cite

Neural network (NN) algorithms have become the dominant tool in visual object recognition, natural language processing, and robotics. To enhance the computational efficiency of these algorithms, in comparison to the traditional von Neuman computing architectures, researchers have been focusing on memristor computing systems. A major drawback when using memristor computing systems today is that, in the artificial intelligence (AI) era, well-trained NN models are intellectual property and, when loaded in the memristor computing systems, face theft threats, especially when running in edge devices. An adversary may steal the well-trained NN models through advanced attacks such as learning attacks and side-channel analysis. In this paper, we review different security techniques for protecting memristor computing systems. Two threat models are described based on their assumptions regarding the adversary’s capabilities: a black-box (BB) model and a white-box (WB) model. We categorize the existing security techniques into five classes in the context of these threat models: thwarting learning attacks (BB), thwarting side-channel attacks (BB), NN model encryption (WB), NN weight transformation (WB), and fingerprint embedding (WB). We also present a cross-comparison of the limitations of the security techniques. This paper could serve as an aid when designing secure memristor computing systems.

show abstract

“…Thus, RRAM and SRAM-based IMC accelerators have been proposed for DNNs in the literature [19,25]. However, IMC increases on-chip data volume, which increases latency and energy due to on-chip communication [26][27][28][29]. The high density and complexity of GCNs make the on-chip communication for IMC-based accelerators even more critical.…”

Section: Related Workmentioning

confidence: 99%

COIN: Communication-Aware In-Memory Acceleration for Graph Convolutional Networks

Mandal,

Krishnan,

Goksoy

et al. 2022

Preprint

Self Cite

View full text Add to dashboard Cite

Graph convolutional networks (GCNs) have shown remarkable learning capabilities when processing graphstructured data found inherently in many application areas. GCNs distribute the outputs of neural networks embedded in each vertex over multiple iterations to take advantage of the relations captured by the underlying graphs. Consequently, they incur a significant amount of computation and irregular communication overheads, which call for GCN-specific hardware accelerators. To this end, this paper presents a communicationaware in-memory computing architecture (COIN) for GCN hardware acceleration. Besides accelerating the computation using custom compute elements (CE) and in-memory computing, COIN aims at minimizing the intra-and inter-CE communication in GCN operations to optimize the performance and energy efficiency. Experimental evaluations with widely used datasets show up to 105× improvement in energy consumption compared to state-of-the-art GCN accelerator.

show abstract

Impact of On-chip Interconnect on In-memory Acceleration of Deep Neural Networks

Cited by 14 publications

References 39 publications

End-to-End Benchmarking of Chiplet-Based In-Memory Computing

End-to-End Benchmarking of Chiplet-Based In-Memory Computing

Review of security techniques for memristor computing systems

COIN: Communication-Aware In-Memory Acceleration for Graph Convolutional Networks

Contact Info

Product

Resources

About