2022 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) 2022
DOI: 10.1109/wacv51458.2022.00372
|View full text |Cite
|
Sign up to set email alerts
|

Hessian-Aware Pruning and Optimal Neural Implant

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
10
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
4
4
1

Relationship

1
8

Authors

Journals

citations
Cited by 20 publications
(10 citation statements)
references
References 36 publications
0
10
0
Order By: Relevance
“…It is easy to see this through a second-order Taylor series expansion, where the perturbation is dependent on not just the weight magnitude but also the Hessian (LeCun et al, 1990 ). As such there are several works that use second-order based pruning (LeCun et al, 1990 ; Hassibi and Stork, 1993 ; Hassibi et al, 1993 ; Wang et al, 2019a ; Yu et al, 2021 ).…”
Section: Technology State-of-the-artmentioning
confidence: 99%
“…It is easy to see this through a second-order Taylor series expansion, where the perturbation is dependent on not just the weight magnitude but also the Hessian (LeCun et al, 1990 ). As such there are several works that use second-order based pruning (LeCun et al, 1990 ; Hassibi and Stork, 1993 ; Hassibi et al, 1993 ; Wang et al, 2019a ; Yu et al, 2021 ).…”
Section: Technology State-of-the-artmentioning
confidence: 99%
“…It reduces the DNN model size and lower computation cost by replacing the floating point weights with low precision fixed-point data. Common quantization methods including directly apply uniform quantizers [24], [25], quantization-aware fine-tuning [26] and mixedprecision quantization [7], [8], [27]. Quantization can largely decrease DNN's arithmetic intensity but still cause significant accuracy degradation for ultra-low data precision.…”
Section: B Model Compressionmentioning
confidence: 99%
“…Finally, Q-BERT (Shen et al, 2020) employed approximate information about the Hessian spectrum in order to choose the quantization bit-widths applied to each layer. Follow-up work (Yu et al, 2022) applied a similar approach to structured pruning in the context of convolutional and languange models, using an approximation of the Hessian trace to decide which layers should be pruned. We note that this approach is quite different from the one we employ here, as we use completely different inverse-Hessian approximations to perform pruning decisions.…”
Section: Background and Related Workmentioning
confidence: 99%