2014
DOI: 10.1016/j.csl.2014.02.006
|View full text |Cite
|
Sign up to set email alerts
|

Improving recognition of proper nouns in ASR through generating and filtering phonetic transcriptions

Abstract: Accurate phonetic transcription of proper nouns can be an important resource for commercial applications that embed speech technologies, such as audio indexing and vocal phone directory lookup. However, accurate phonetic transcription is more difficult to obtain for proper nouns than for regular words. Indeed, phonetic transcription of a proper noun depends on both the origin of the speaker pronouncing it and the origin of the proper noun itself.This work proposes a method that allows the extraction of phoneti… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
5
0

Year Published

2016
2016
2024
2024

Publication Types

Select...
4
3

Relationship

0
7

Authors

Journals

citations
Cited by 8 publications
(5 citation statements)
references
References 24 publications
(32 reference statements)
0
5
0
Order By: Relevance
“…AER is used as a metric for the entire system assessment. Inspired by WER [9], AER measures the minimum number of operations, i.e., substitutions, deletions, and insertions, to make a word under evaluation match the reference. The AER is calculated as follows:…”
Section: Discussionmentioning
confidence: 99%
See 1 more Smart Citation
“…AER is used as a metric for the entire system assessment. Inspired by WER [9], AER measures the minimum number of operations, i.e., substitutions, deletions, and insertions, to make a word under evaluation match the reference. The AER is calculated as follows:…”
Section: Discussionmentioning
confidence: 99%
“…This finding may help bring awareness when a pipeline is designed. Regarding the evaluation of the transcription, we adopt a concept of Word Error Rate (WER) [9] to propose AER for the overall performance index.…”
Section: Automatic Thai Finger Spelling Transcriptionmentioning
confidence: 99%
“…The conversion process needs rules or learning from the seed data given during the process. The data-driven approach derives the pronunciation data from the seed data (Laurent et al, 2014). This study focuses on data-driven modelling techniques, which will be explained further in Data-Driven G2P techniques.…”
Section: Grapheme-to-phoneme Conversionmentioning
confidence: 99%
“…Proper nouns have been identified as a challenging problem in ASR for a while now [4]. Recently some approaches have arisen to tackle this challenge with E2E ASR using a specialised architecture and losses [5] or using specific data and training procedures to better represent contextual information [6].…”
Section: Introductionmentioning
confidence: 99%