2016
DOI: 10.1109/tcyb.2015.2488592
|View full text |Cite
|
Sign up to set email alerts
|

Discrimination Between Native and Non-Native Speech Using Visual Features Only

Abstract: Abstract-Accent is a soft biometric trait that can be inferred from pronunciation and articulation patterns characterising the speaking style of an individual. Past research has addressed the task of classifying accent, as belonging to a native language speaker or a foreign language speaker, by means of the audio modality only. However, features extracted from the visual stream of speech have been successfully used to extend or substitute audio-only approaches that target speech or language recognition. Motiva… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

1
2
0

Year Published

2016
2016
2019
2019

Publication Types

Select...
5
1

Relationship

2
4

Authors

Journals

citations
Cited by 8 publications
(3 citation statements)
references
References 55 publications
(73 reference statements)
1
2
0
Order By: Relevance
“…This is exactly as expected; while shape features are capable of capturing coarse deformations related to facial expression, appearance features are efficient in encapsulating finer movements and tale-telling transient features such as bulges, wrinkles and furrows [8,74,44]. Also, SIFT outperforms DCT.…”
Section: Accepted M Manuscriptsupporting
confidence: 70%
See 1 more Smart Citation
“…This is exactly as expected; while shape features are capable of capturing coarse deformations related to facial expression, appearance features are efficient in encapsulating finer movements and tale-telling transient features such as bulges, wrinkles and furrows [8,74,44]. Also, SIFT outperforms DCT.…”
Section: Accepted M Manuscriptsupporting
confidence: 70%
“…As a matter of fact, the latter have been shown to outperform uni-modal frameworks in various related tasks such as continuous interest prediction [40,16], detection of behavioral mimicry [41], and dimensional and continuous affect prediction [39], to mention but a few. Notably, other challenging problems such as accent classification [42,43,44] and pain intensity estimation [45] have been addressed based exclusively on visual features.…”
Section: Featuresmentioning
confidence: 99%
“…(26) The respective closed-form solutions are obtained by substituting (25) and (26) into (23) or (24).…”
Section: Discussionmentioning
confidence: 99%