Experiments in spoken document retrieval using phoneme n-grams

Ng, Corinna; Wilkinson, Ross; Zobel, Justin

doi:10.1016/s0167-6393(00)00024-8

Cited by 21 publications

(18 citation statements)

References 20 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Therefore, the system will be better prepared for working in noisy environments, since it is able to cope not only with spelling errors, but also with out-of-vocabulary words and spelling, morphological or even historical variants Lee and Ahn, 1996;Mustafa and Al-Radaideh, 2004), in contrast with classical conflation techniques based on stemming, lemmatization or morphological analysis, which are negatively affected by these phenomena. This feature is extremely valuable, not only for regular text retrieval tasks, but also for specialized tasks such as spoken document retrieval (SDR) (Ng et al, 2000), or cross-lingual information retrieval (CLIR) over closely-related languages using no translation, but only cognate matching 4 (McNamee and Mayfield, 2004a). The third major factor for the success of n-grams in IR applications comes from their inherent language-independent nature.…”

Section: The N-gram Based Approachmentioning

confidence: 99%

Managing misspelled queries in IR applications

Vilares

Otero

2011

Information Processing & Management

View full text Add to dashboard Cite

Section: The N-gram Based Approachmentioning

confidence: 99%

Managing misspelled queries in IR applications

Vilares

Otero

2011

Information Processing & Management

View full text Add to dashboard Cite

“…Further, [195] shows that ignoring word boundaries when extracting phone-based features does not affect retrieval performance significantly.…”

Section: Phone-sequence Indexing Featuresmentioning

confidence: 99%

“…In [195], different methods for extracting overlapping phonesequence indexing features for SCR are explored in detail. This article arrives at the general conclusion that phone-based retrieval is not as effective as word-based retrieval, but there are certain situations where it is appropriate.…”

Section: Phone-sequence Indexing Featuresmentioning

confidence: 99%

“…Further, if speech recognition must be performed on a platform with limited capacity (i.e., a hand-held device), then a small language model, such as a phoneme bigram model, makes the ASR system lightweight and compact. The authors of [195] find that in terms of phone-sequence-based indexing features, a combination of phone 3-grams and 4-grams proved most effective. This result confirms the findings of [304] that phone-based features derived from word-level transcripts are able to help compensate for word-level error.…”

Section: Phone-sequence Indexing Featuresmentioning

confidence: 99%

See 1 more Smart Citation

Automatic Summarization

Larson

2012

FNT in Information Retrieval

View full text Add to dashboard Cite

“…The first one considers the transcription of speech utterances into phoneme or syllable sequences instead of word sequences by using a phoneme/syllable recognizer [8,9,10]. On the other hand, the second method proposes making use of more than the top-1 transcription hypothesis.…”

Section: Related Workmentioning

confidence: 99%

Combining Word and Phonetic-Code Representations for Spoken Document Retrieval

Reyes-Barragán

Montes-y-Gómez

Villaseñor-Pineda

2011

Computational Linguistics and Intelligent Text Processing

View full text Add to dashboard Cite

Abstract. The traditional approach for spoken document retrieval (SDR) uses an automatic speech recognizer (ASR) in combination with a word-based information retrieval method. This approach has only showed limited accuracy, partially because ASR systems tend to produce transcriptions of spontaneous speech with significant word error rate. In order to overcome such limitation we propose a method which uses word and phonetic-code representations in collaboration. The idea of this combination is to reduce the impact of transcription errors in the processing of some (presumably complex) queries by representing words with similar pronunciations through the same phonetic code. Experimental results on the CLEF-CLSR-2007 corpus are encouraging; the proposed hybrid method improved the mean average precision and the number of retrieved relevant documents from the traditional word-based approach by 3% and 7% respectively.

show abstract

Experiments in spoken document retrieval using phoneme n-grams

Cited by 21 publications

References 20 publications

Managing misspelled queries in IR applications

Managing misspelled queries in IR applications

Automatic Summarization

Combining Word and Phonetic-Code Representations for Spoken Document Retrieval

Contact Info

Product

Resources

About