2022
DOI: 10.1109/access.2022.3162854
|View full text |Cite
|
Sign up to set email alerts
|

Resources and Benchmarks for Keyword Search in Spoken Audio From Low-Resource Indian Languages

Abstract: This paper presents the resources and benchmarks developed for keyword search (KWS) in spoken audio from six low-resource Indian languages (from two families), namely Gujarati, Hindi, Marathi, Odia, Tamil, and Telugu. The current work on constructing keywords and building benchmark KWS systems is inspired by the popular IARPA Babel program and the subsequent works on low-resource KWS. The keywords are constructed by taking into account their properties i.e., occurrence, length, and average confusability; and t… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
0
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
3

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
(1 citation statement)
references
References 35 publications
(43 reference statements)
0
0
0
Order By: Relevance
“…There are many languages in India which are referred to as low-resourced (19) languages. Majority of the people around the globe speak a language that is low in resource where scripted data is not available.…”
Section: Impact Of Low-resource Availability On Processing Of Natural...mentioning
confidence: 99%
“…There are many languages in India which are referred to as low-resourced (19) languages. Majority of the people around the globe speak a language that is low in resource where scripted data is not available.…”
Section: Impact Of Low-resource Availability On Processing Of Natural...mentioning
confidence: 99%