2021
DOI: 10.48550/arxiv.2102.00160
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Deep Model Compression based on the Training History

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
2
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
1
1

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(2 citation statements)
references
References 0 publications
0
2
0
Order By: Relevance
“…VGG16 is a kind of Visual Geometry Group convolutional neural network, and this model was presented by K. Simonyan and A. Zisserman from the University of Oxford in the paper "Very Deep Convolutional Networks for Large-Scale Image Recognition." The VGG-16 network accomplishes 92.7% top-5 train and test accuracy in ImageNet, a dataset of over 14 million images of 1000 classes [34]. In VGG-16, more kernels are changed with the different numbers of 3 Γ— 3 filters to extract complex features cheaply.…”
Section: Vgg-16mentioning
confidence: 99%
“…VGG16 is a kind of Visual Geometry Group convolutional neural network, and this model was presented by K. Simonyan and A. Zisserman from the University of Oxford in the paper "Very Deep Convolutional Networks for Large-Scale Image Recognition." The VGG-16 network accomplishes 92.7% top-5 train and test accuracy in ImageNet, a dataset of over 14 million images of 1000 classes [34]. In VGG-16, more kernels are changed with the different numbers of 3 Γ— 3 filters to extract complex features cheaply.…”
Section: Vgg-16mentioning
confidence: 99%
“…Figure 3(b) depicts the FLOPS performed by the neural networks, as measured using the profiler from e-Lab 2 .FLOPS per convolution is given by 𝐹𝐿𝑂𝑃 π‘π‘œπ‘›π‘£ = 𝐹 β€’ 𝐹 β€’ 𝐢 𝑖𝑛 β€’ 𝐻 π‘œπ‘’π‘‘ β€’ π‘Š π‘œπ‘’π‘‘ β€’ 𝐢 π‘œπ‘’π‘‘ , where 𝐹 β€’ 𝐹 is the spatial dimension of the filter, 𝐢 𝑖𝑛 and 𝐢 π‘œπ‘’π‘‘ are the number of input and output channels, respectively, and 𝐻 π‘œπ‘’π‘‘ β€’ π‘Š π‘œπ‘’π‘‘ is the output shape[45]. As the number of patches (𝑁 𝑝 ) increases, the size of each patch decreases quadratically, (β„Žπ‘– 𝑠𝑖𝑧𝑒 𝑁 𝑝 ⁄ )2 .…”
mentioning
confidence: 99%