2021 IEEE Hot Chips 33 Symposium (HCS) 2021
DOI: 10.1109/hcs52781.2021.9567191
|View full text |Cite
|
Sign up to set email alerts
|

Aquabolt-XL: Samsung HBM2-PIM with in-memory processing for ML accelerators and beyond

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
3
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
6
2

Relationship

0
8

Authors

Journals

citations
Cited by 24 publications
(3 citation statements)
references
References 0 publications
0
3
0
Order By: Relevance
“…There are also variety of prior works that leverage PIM for GEMV operations [25,40,44,48,49,68,83] due to their inherent potential in benefits towards bandwidth bound applications. However, none of these works enable simultaneous execution of PIM and NPU operations, necessary for the efficient execution of LLM inference.…”
Section: Discussionmentioning
confidence: 99%
“…There are also variety of prior works that leverage PIM for GEMV operations [25,40,44,48,49,68,83] due to their inherent potential in benefits towards bandwidth bound applications. However, none of these works enable simultaneous execution of PIM and NPU operations, necessary for the efficient execution of LLM inference.…”
Section: Discussionmentioning
confidence: 99%
“…NPUs: Examples of NPUs include Google's Tensor Processing Unit (TPU) [20], tensor cores in NVIDIA A100 Ampere architecture, Samsung NPU [21], Sambanova's RDU [22], IBM's AI Accelerator [23], Microsoft Brainwave [24], Tesla's Self-Driving computer [25], Facebook's ML accelerator [26], etc. NPU architectures can be standalone, a co-processor, or a near-data processing engine [27]- [29]. Most NPUs are spatial architectures (e.g., Fig.…”
Section: A Npu Design Requirements and Challengesmentioning
confidence: 99%
“…NPUs: Examples of NPUs include Google's Tensor Processing Unit (TPU) [20], tensor cores in NVIDIA A100 Ampere architecture, Samsung NPU [21], Sambanova's RDU [22], IBM's AI Accelerator [23], Microsoft Brainwave [24], Tesla's Self-Driving computer [25], Facebook's ML accelerator [26], etc. NPU architectures can be standalone, a co-processor, or a near-data processing engine [27]- [29]. Most NPUs are spatial architectures (e.g., Fig.…”
Section: A Npu Design Requirements and Challengesmentioning
confidence: 99%