2023
DOI: 10.48550/arxiv.2303.11114
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

SeiT: Storage-Efficient Vision Training with Tokens Using 1% of Pixel Storage

Abstract: We need billion-scale images to achieve more generalizable and ground-breaking vision models, as well as massive dataset storage to ship the images (e.g., the LAION-4B dataset needs 240TB storage space). However, it has become challenging to deal with unlimited dataset storage with limited storage infrastructure. A number of storageefficient training methods have been proposed to tackle the problem, but they are rarely scalable or suffer from severe damage to performance. In this paper, we propose a storage-ef… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

0
0
0

Publication Types

Select...

Relationship

0
0

Authors

Journals

citations
Cited by 0 publications
references
References 47 publications
0
0
0
Order By: Relevance

No citations

Set email alert for when this publication receives citations?