2022
DOI: 10.1007/978-3-031-16014-1_59
|View full text |Cite
|
Sign up to set email alerts
|

TT-ViT: Vision Transformer Compression Using Tensor-Train Decomposition

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2024
2024
2024
2024

Publication Types

Select...
1
1
1

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
(1 citation statement)
references
References 15 publications
0
1
0
Order By: Relevance
“…Image Pre-processing: The original size of input images is 600x400 and consists of red, green, and blue (RGB) color. These images are first pre-processed through three stages namely: reshape [31], rescale [32], and then conversion into tensors [33]. The images are reshaped to 224x224 dimensions (shown in Figure 3 In the next step all the images were rescaled, in this process each pixel of input is rescaled from initial range of 0-255 to 0-1 by dividing every pixel value to 255 as given in equation.…”
Section: Dataset and Environment Setupmentioning
confidence: 99%
“…Image Pre-processing: The original size of input images is 600x400 and consists of red, green, and blue (RGB) color. These images are first pre-processed through three stages namely: reshape [31], rescale [32], and then conversion into tensors [33]. The images are reshaped to 224x224 dimensions (shown in Figure 3 In the next step all the images were rescaled, in this process each pixel of input is rescaled from initial range of 0-255 to 0-1 by dividing every pixel value to 255 as given in equation.…”
Section: Dataset and Environment Setupmentioning
confidence: 99%