2017 IEEE International Conference on Computer Vision (ICCV) 2017
DOI: 10.1109/iccv.2017.564
|View full text |Cite
|
Sign up to set email alerts
|

End-to-End Face Detection and Cast Grouping in Movies Using Erdös-Rényi Clustering

Abstract: Figure 1: Clustering results from Hannah and Her Sisters. Each unique color shows a particular cluster. It can be seen that most individuals appear with a consistent color, indicating successful clustering. AbstractWe present an end-to-end system for detecting and clustering faces by identity in full-length movies. Unlike works that start with a predefined set of detected faces, we consider the end-to-end problem of detection and clustering together. We make three separate contributions. First, we combine a st… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
29
0

Year Published

2018
2018
2022
2022

Publication Types

Select...
4
3
2

Relationship

2
7

Authors

Journals

citations
Cited by 37 publications
(29 citation statements)
references
References 38 publications
0
29
0
Order By: Relevance
“…To link multiple object detections across video frames into temporally consistent tracklets, we use the algorithm from Jin et al (Sec. 3 of [26]) with the MD-Net tracker [38]. Now, given a tracklet that consistently follows an object through a video sequence, when the object detector did not fire (i.e.…”
Section: Automatic Labeling Of the Target Domainmentioning
confidence: 99%
“…To link multiple object detections across video frames into temporally consistent tracklets, we use the algorithm from Jin et al (Sec. 3 of [26]) with the MD-Net tracker [38]. Now, given a tracklet that consistently follows an object through a video sequence, when the object detector did not fire (i.e.…”
Section: Automatic Labeling Of the Target Domainmentioning
confidence: 99%
“…Verification losses. Next, we analyze LDML, contrastive, and triplet losses ( Table 5 rows [10][11][12][13][14][15][16][17][18]. While these losses are often used to perform clustering, they are not designed for it [47].…”
Section: #Chmentioning
confidence: 99%
“…3. We generate tracklets using the method from [26] and show results incorporating hard positives on pedestrian and face detection in the experiments section. The manually calculated purity over 300 randomly sampled frames was 94.46% for faces and 83.13% for pedestrians.…”
Section: Extension To Hard Positive Miningmentioning
confidence: 99%