2015
DOI: 10.1109/taslp.2015.2438543
|View full text |Cite
|
Sign up to set email alerts
|

Spoken Content Retrieval—Beyond Cascading Speech Recognition with Text Retrieval

Abstract: Spoken content retrieval refers to directly indexing and retrieving spoken content based on the audio rather than text descriptions. This potentially eliminates the requirement of producing text descriptions for multimedia content for indexing and retrieval purposes, and is able to precisely locate the exact time the desired information appears in the multimedia. Spoken content retrieval has been very successfully achieved with the basic approach of cascading automatic speech recognition (ASR) with text inform… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
58
0

Year Published

2017
2017
2021
2021

Publication Types

Select...
4
3
2

Relationship

0
9

Authors

Journals

citations
Cited by 100 publications
(58 citation statements)
references
References 289 publications
(299 reference statements)
0
58
0
Order By: Relevance
“…All the queries are distributed across all the speakers such that at least one speaker contains at least one query. The performance of QbE-STD is measured in terms of precision@N (p@N) and Mean Average Precision (MAP) [2]. The value of N varies according to the query (from 7 to 20).…”
Section: A Experimental Datasetmentioning
confidence: 99%
See 1 more Smart Citation
“…All the queries are distributed across all the speakers such that at least one speaker contains at least one query. The performance of QbE-STD is measured in terms of precision@N (p@N) and Mean Average Precision (MAP) [2]. The value of N varies according to the query (from 7 to 20).…”
Section: A Experimental Datasetmentioning
confidence: 99%
“…QbE-STD directly exploits the acoustic-level information for matching between spoken documents and a spoken query without transcribing them into phonemes or words. QbE-STD is important for low-resourced languages and under non-mainstream conditions and hence, it was also called an unsupervised STD [1], [2]. As a part of the MediaEval campaign, the Spoken Web Search (SWS) was started in 2011 [3].…”
Section: Introductionmentioning
confidence: 99%
“…These systems can be broadly classified into following categories [2]. Cascaded ASR with text information retrieval: The spoken content is converted into word or sub word sequences or lattices using ASR and then text retrieval techniques are applied.…”
Section: Overviewmentioning
confidence: 99%
“…Nowadays, it is receiving much importance due to the large volume of multimedia information. Research and technology improvements in automated speech recognition successfully achieved the information retrieval by using the transcribed textual form of the spoken contents [1]. Similarly, due to the exponential growth of internet and multimedia contents, the STD methods have been achieving much popularity.…”
Section: Introductionmentioning
confidence: 99%