New techniques for open-vocabulary spoken document retrieval

Wechsler, Martin; Munteanu, Eugen; Schäuble, Peter

doi:10.1145/290941.290950

Cited by 38 publications

(32 citation statements)

References 9 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…For this collection, another 2580 words were added to the pronunciation dictionary to transform all words, including query words, to phoneme sequences. In practice, this is done via an automated process [23]. Here, four versions of the transcriptions were used for retrieval:…”

Section: Sdr Collection From Trec-7mentioning

confidence: 99%

“…The rationale for this is that the incorrect transcription may be able to match relevant documents, which may also contain the incorrect transcription. This method is similar in concept to using a confusion matrix based approach by Wechsler [23] on the training collection, which can be used to determine which recognised phoneme is most likely to be recognised incorrectly as another. This technique, though not 100% accurate, is the only feasible approach for a larger collection.…”

Section: Experimental Questionsmentioning

confidence: 99%

“…Previous experiments by Wechsler [23,24], Ng and Zue [12,13,14] and Smeaton et al [19] on n-gram retrieval from phoneme transcriptions obtained directly from a phoneme recogniser showed that phoneme n-gram retrieval can be effective in practice. Other experiments have shown that phoneme retrieval can be used to complement word retrieval [25] when word recognition has failed, especially in situations where names and unknown words are mis-recognised.…”

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

Experiments in spoken document retrieval using phoneme n-grams

Wilkinson

Zobel

2000

Speech Communication

View full text Add to dashboard Cite

Section: Sdr Collection From Trec-7mentioning

confidence: 99%

Section: Experimental Questionsmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Experiments in spoken document retrieval using phoneme n-grams

Wilkinson

Zobel

2000

Speech Communication

View full text Add to dashboard Cite

“…Another work [24] suggests to guide word-embeddings with morphologically annotated data and shows achievement using German in a case study. Also, many papers study syllable-based n-gram methods to model language [25,26].…”

Section: Introductionmentioning

confidence: 99%

Towards Learning Word Representation

Wiercioch¹

2017

View full text Add to dashboard Cite

Abstract. Continuous vector representations, as a distributed representations for words have gained a lot of attention in Natural Language Processing (NLP) field. Although they are considered as valuable methods to model both semantic and syntactic features, they still may be improved. For instance, the open issue seems to be to develop different strategies to introduce the knowledge about the morphology of words. It is a core point in case of either dense languages where many rare words appear and texts which have numerous metaphors or similies. In this paper, we extend a recent approach to represent word information. The underlying idea of our technique is to present a word in form of a bag of syllable and letter n-grams. More specifically, we provide a vector representation for each extracted syllable-based and letter-based n-gram, and perform concatenation. Moreover, in contrast to the previous method, we accept n-grams of varied length n. Further various experiments, like tasks-word similarity ranking or sentiment analysis report our method is competitive with respect to other state-of-theart techniques and takes a step toward more informative word representation construction.

show abstract

“…-spoken document retrieval, in which written queries are used to search speech (e.g., broadcast news audio) archives for relevant speech information [5,6,15,16,17,19,20], -speech-driven (spoken query) retrieval, in which spoken queries are used to retrieve relevant textual information [2,3].…”

Section: Introductionmentioning

confidence: 99%

Speech-Driven Text Retrieval: Using Target IR Collections for Statistical Language Model Adaptation in Speech Recognition

Fujii

Itou

Ishikawa

2002

Information Retrieval Techniques for Speech Applications

View full text Add to dashboard Cite

Abstract. Speech recognition has of late become a practical technology for real world applications. Aiming at speech-driven text retrieval, which facilitates retrieving information with spoken queries, we propose a method to integrate speech recognition and retrieval methods. Since users speak contents related to a target collection, we adapt statistical language models used for speech recognition based on the target collection, so as to improve both the recognition and retrieval accuracy. Experiments using existing test collections combined with dictated queries showed the effectiveness of our method.

show abstract

New techniques for open-vocabulary spoken document retrieval

Cited by 38 publications

References 9 publications

Experiments in spoken document retrieval using phoneme n-grams

Experiments in spoken document retrieval using phoneme n-grams

Towards Learning Word Representation

Speech-Driven Text Retrieval: Using Target IR Collections for Statistical Language Model Adaptation in Speech Recognition

Contact Info

Product

Resources

About