2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2014
DOI: 10.1109/icassp.2014.6854519
|View full text |Cite
|
Sign up to set email alerts
|

Visual-only discrimination between native and non-native speech

Abstract: Accent is an important biometric characteristic that is defined by the presence of specific traits in the speaking style of an individual. These are identified by patterns in the speech production system, such as those present in the vocal tract or in lip movements. Evidence from linguistics and speech processing research suggests that visual information enhances speech recognition. Intrigued by these findings, along with the assumption that visually perceivable accent-related patterns are transferred from the… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
6
0

Year Published

2015
2015
2022
2022

Publication Types

Select...
5
1

Relationship

1
5

Authors

Journals

citations
Cited by 7 publications
(6 citation statements)
references
References 17 publications
0
6
0
Order By: Relevance
“…As a matter of fact, the latter have been shown to outperform uni-modal frameworks in various related tasks such as continuous interest prediction [40,16], detection of behavioral mimicry [41], and dimensional and continuous affect prediction [39], to mention but a few. Notably, other challenging problems such as accent classification [42,43,44] and pain intensity estimation [45] have been addressed based exclusively on visual features.…”
Section: Featuresmentioning
confidence: 99%
“…As a matter of fact, the latter have been shown to outperform uni-modal frameworks in various related tasks such as continuous interest prediction [40,16], detection of behavioral mimicry [41], and dimensional and continuous affect prediction [39], to mention but a few. Notably, other challenging problems such as accent classification [42,43,44] and pain intensity estimation [45] have been addressed based exclusively on visual features.…”
Section: Featuresmentioning
confidence: 99%
“…The lip contour extraction of curves is based on two formulas (3) and (4). Formula (3) presents the lip curve y 1 while formula (4) presents the lip curve y 2 .…”
Section: Face and Lip Detection Processmentioning
confidence: 99%
“…Here, three techniques have been employed for visual feature detection and description. Such techniques are SURF [3] (speeded up robust features), HoG [4] (histogram of oriented gradient) and Haar [5] for extracting lip contour features from visual source. SURF technique works by transforming the source image into coordinates using the integral image algorithm on ECG signals [6], which rapidly calculates summations of pixels over image sub-regions in constant time.…”
Section: Introductionmentioning
confidence: 99%
“…In this paper, three techniques have been employed for visual feature detection and description. Such techniques are SURF [3](speeded up robust features), HoG [4] (histogram of oriented gradient) and Haar [5]for extracting lip contour features from visual source. These techniques have been chosen based on their remarkable efficacy and reported accuracy in different domains, for example in [6], [7] and [8].…”
Section: Introductionmentioning
confidence: 99%