2012 IEEE International Conference on Multimedia and Expo Workshops 2012
DOI: 10.1109/icmew.2012.116
|View full text |Cite
|
Sign up to set email alerts
|

Bi-Modal Person Recognition on a Mobile Phone: Using Mobile Phone Data

Abstract: This paper presents a novel fully automatic bi-modal, face and speaker, recognition system which runs in real-time on a mobile phone. The implemented system runs in real-time on a Nokia N900 and demonstrates the feasibility of performing both automatic face and speaker recognition on a mobile phone. We evaluate this recognition system on a novel publicly-available mobile phone database and provide a well defined evaluation protocol. This database was captured almost exclusively using mobile phones and aims to … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
147
0
1

Year Published

2014
2014
2020
2020

Publication Types

Select...
4
4

Relationship

0
8

Authors

Journals

citations
Cited by 189 publications
(148 citation statements)
references
References 14 publications
0
147
0
1
Order By: Relevance
“…On the other hand, this gives us the opportunity to test our system on a recently published challenging dataset that satisfies our experimental setting choices. The proposed framework is evaluated in a subject-independent fashion on native and non-native speech episodes of English speech from the MOBIO Database [24].…”
Section: Datasetmentioning
confidence: 99%
See 1 more Smart Citation
“…On the other hand, this gives us the opportunity to test our system on a recently published challenging dataset that satisfies our experimental setting choices. The proposed framework is evaluated in a subject-independent fashion on native and non-native speech episodes of English speech from the MOBIO Database [24].…”
Section: Datasetmentioning
confidence: 99%
“…Speaker-independent accent classification experiments are conducted first on continuous reading speech samples from the MOBIO Database [24], all captured by mobile phones. In that experiment, the experimental scenario is text-dependent, i.e., all speakers utter the same three-sentence paragraph.…”
Section: Introductionmentioning
confidence: 99%
“…We have used four publicly available datasets, namely, MOBIO [36], REPLAY-ATTACK [37], MSU-MFSD [38], and REPLAY-MOBILE [39].…”
Section: Datasetsmentioning
confidence: 99%
“…The MOBIO dataset [36] was collected for bi-modal (voice and face) biometric verification experiments using mobile devices. Therefore, it contains only licit protocols.…”
Section: Datasetsmentioning
confidence: 99%
“…The MOBIO database, developed by Marcel et al (2012), is a unique audio-visual corpus, as it was captured almost exclusively using mobile devices. It is composed of over 61 h of recordings of 150 speakers.…”
Section: Review Of Audio-visual Corporamentioning
confidence: 99%