2016
DOI: 10.1007/978-3-319-46448-0_44
|View full text |Cite
|
Sign up to set email alerts
|

Embedding Deep Metric for Person Re-identification: A Study Against Large Variations

Abstract: Person re-identification is challenging due to the large variations of pose, illumination, occlusion and camera view. Owing to these variations, the pedestrian data is distributed as highly-curved manifolds in the feature space, despite the current convolutional neural networks (CNN)'s capability of feature extraction. However, the distribution is unknown, so it is difficult to use the geodesic distance when comparing two samples. In practice, the current deep embedding methods use the Euclidean distance for t… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
196
0

Year Published

2017
2017
2021
2021

Publication Types

Select...
4
2
2

Relationship

0
8

Authors

Journals

citations
Cited by 248 publications
(196 citation statements)
references
References 41 publications
0
196
0
Order By: Relevance
“…These global metrics [16][17][18][19] project features into low dimension subspace where they tend to maximize the discrimination among different persons; however, these metrics still suffer a great challenge from impostor (an impostor is a person that belongs to the other person and, however, possess higher similarity with the given query than the right Gallery sample) samples [20,21]. Though, in past some attempts are made to eliminate impostors [14,[20][21][22], however, all these attempts have not given due consideration of different transform modals on which the reidentification images lie [23].…”
Section: Introductionmentioning
confidence: 99%
“…These global metrics [16][17][18][19] project features into low dimension subspace where they tend to maximize the discrimination among different persons; however, these metrics still suffer a great challenge from impostor (an impostor is a person that belongs to the other person and, however, possess higher similarity with the given query than the right Gallery sample) samples [20,21]. Though, in past some attempts are made to eliminate impostors [14,[20][21][22], however, all these attempts have not given due consideration of different transform modals on which the reidentification images lie [23].…”
Section: Introductionmentioning
confidence: 99%
“…Compared to first-order alternatives, our energy function is more robust against misalignment between sketch and photo channels, and can accommodate better the more detailed but noisier fine-grained feature map representation. Mahalanobis distance [41,35] is another example of a higher-order energy function in that it does O(N 2 ) comparisons for N channels. However it is based on elementwise difference followed by bilinear product so the effect is to learn which dimension pairs are important to match, rather than compensate for misalignment and noise between the input vectors.…”
Section: Shortcuts and Layer Fusion In Deep Learningmentioning
confidence: 99%
“…They have been used mainly for multi-view fusion, for example, fusing the text and image embeddings in visual question answering [21] and zero-shot recognition [4]. Outer product based distance is also used for formulating higher-order losses in Mahalanobis metric learning [41,35]. Given two vectors x and y, a Mahalanobis distance is defined as:…”
Section: Ranking Scorementioning
confidence: 99%
“…Ahmed et al [15] presented a deep convolutional architecture that captured local relationships between person images based on mid-level features. Generally, deep learning is usually utilized to learn feature representations by using deep convolutional features [14][15][16][17] or from the fully connected features [18][19][20] in person re-identification works.…”
Section: Related Workmentioning
confidence: 99%
“…(i) Feature construction and learning aim at designing or studying discriminative appearance descriptions [8][9][10][11][12][13][14][15][16][17][18][19][20] that are robust for distinguishing different pedestrians across arbitrary cameras. However, handcrafted feature construction is extremely challenging due to miscellaneous and complicated variations.…”
Section: Introductionmentioning
confidence: 99%