2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2021
DOI: 10.1109/cvpr46437.2021.01070
|View full text |Cite
|
Sign up to set email alerts
|

Clusformer: A Transformer based Clustering Approach to Unsupervised Large-scale Face and Visual Landmark Recognition

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
11
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
4
2
1

Relationship

0
7

Authors

Journals

citations
Cited by 32 publications
(11 citation statements)
references
References 25 publications
0
11
0
Order By: Relevance
“…However, there is still a gap in utilizing large unlabelled images and videos to learn Re-ID models in an unsupervised end-to-end manner, which is more practical in real-world applications. Therefore, we suggest there is a great research opportunity in unsupervised endto-end person re-identification, in particular, leveraging the evolutionary vision transformers [123,124,125,126].…”
Section: Discussion and Future Directionsmentioning
confidence: 99%
“…However, there is still a gap in utilizing large unlabelled images and videos to learn Re-ID models in an unsupervised end-to-end manner, which is more practical in real-world applications. Therefore, we suggest there is a great research opportunity in unsupervised endto-end person re-identification, in particular, leveraging the evolutionary vision transformers [123,124,125,126].…”
Section: Discussion and Future Directionsmentioning
confidence: 99%
“…DA-Net (Guo et al, 2020) conducts clustering by leveraging non-local context information through density-based graph. Clusformer (Nguyen et al, 2021) clusters faces with a transformer. STAR-FC (Shen et al, 2021) develops a structure-preserved sampling strategy to train the edge classification GCN.…”
Section: Related Workmentioning
confidence: 99%
“…To satisfy the two principles, the criterion is proposed according to the F β -score (Rijsbergen, 1979) in information retrieval. Similar to visual grammars (Nguyen et al, 2021), all candidate neighbours are ordered by the similarity with the probe vertex in a sequence. Given candidate neighbours of size j probed by vertex v i , its quality criterion Q (j) is defined as:…”
Section: Candidate Neighbours Quality Criterionmentioning
confidence: 99%
See 2 more Smart Citations