Search on speech from spoken queries: the Multi-domain International ALBAYZIN 2018 Query-by-Example Spoken Term Detection Evaluation

Tejedor, Javier; Toledano, Doroteo T.; López-Otero, Paula; Docío-Fernández, Laura; Peñagarikano, Mikel; Rodríguez-Fuentes, Luis Javier; Sandoval, Antonio Moreno

doi:10.1186/s13636-019-0156-x

Cited by 4 publications

(2 citation statements)

References 72 publications

(71 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…These include THUMOS 14 (Jiang et al, 2014) as well as ActivityNet 1.2 and ActivityNet 1.3 challenges (Fabian Caba Heilbron and Niebles, 2015). Another example is queryby-example spoken term detection, as considered e.g., in ALBAYZIN 2018 challenge (Tejedor et al, 2019).…”

Section: Review Of Existing Datasetsmentioning

confidence: 99%

Contract Discovery: Dataset and a Few-Shot Semantic Retrieval Challenge with Competitive Baselines

Borchmann¹,

Wiśniewski²,

Gretkowski³

et al. 2020

Findings of the Association for Computational Linguistics: EMNLP 2020

View full text Add to dashboard Cite

We propose a new shared task of semantic retrieval from legal texts, in which a so-called contract discovery is to be performed-where legal clauses are extracted from documents, given a few examples of similar clauses from other legal acts. The task differs substantially from conventional NLI and shared tasks on legal information extraction (e.g., one has to identify text span instead of a single document, page, or paragraph). The specification of the proposed task is followed by an evaluation of multiple solutions within the unified framework proposed for this branch of methods. It is shown that state-of-the-art pretrained encoders fail to provide satisfactory results on the task proposed. In contrast, Language Model-based solutions perform better, especially when unsupervised fine-tuning is applied. Besides the ablation studies, we addressed questions regarding detection accuracy for relevant text fragments depending on the number of examples available. In addition to the dataset and reference results, LMs specialized in the legal domain were made publicly available.

show abstract

Section: Review Of Existing Datasetsmentioning

confidence: 99%

Contract Discovery: Dataset and a Few-Shot Semantic Retrieval Challenge with Competitive Baselines

Borchmann¹,

Wiśniewski²,

Gretkowski³

et al. 2020

Findings of the Association for Computational Linguistics: EMNLP 2020

View full text Add to dashboard Cite

show abstract

“…However, this approach cannot meet the requirements of speed and quality at the same time in practical applications. Thus, to avoid the decoding process of ASR, some methods [4][5][6] directly use the acoustic modeling part of ASR model to extract the features of audio signals, and then compare these features of different lengths by dynamic time wrapping (DTW) [7].…”

Section: Introductionmentioning

confidence: 99%

Fast Query-by-example Speech Search using Attention-based Deep Binary Embeddings

Yuan

Xie

Leung

et al. 2020

IEEE/ACM Trans. Audio Speech Lang. Process.

View full text Add to dashboard Cite

Traditional Query-by-Example (QbE) speech search approaches usually use methods based on frame-level features, while state-ofthe-art approaches tend to use models based on acoustic word embeddings (AWEs) to transform variable length audio signals into fixed length feature vector representations. However, these approaches cannot meet the requirements of the search quality as well as speed at the same time. In this paper, we propose a novel fast QbE speech search method based on separable models to fix this problem. First, a QbE speech search training framework is introduced. Second, we design a novel model inference scheme based on RepVGG which can efficiently improve the QbE search quality. Third, we modify and improve our QbE speech search model according to the proposed model inference scheme. Experiments on keywords dataset shows that our proposed method can improve the GPU Real-time Factor (RTF) from 1/150 to 1/2300 by just applying separable model scheme and outperforms other state-of-the-art methods.

show abstract

Designing an Iterative Adaptive Arithmetic Coding-Based Lossless Bio-signal Compression for Online Patient Monitoring System (IAALBC)

Mondal

Debnath

Tabassum

et al. 2023

Lecture Notes in Networks and Systems

View full text Add to dashboard Cite

Search on speech from spoken queries: the Multi-domain International ALBAYZIN 2018 Query-by-Example Spoken Term Detection Evaluation

Cited by 4 publications

References 72 publications

Contract Discovery: Dataset and a Few-Shot Semantic Retrieval Challenge with Competitive Baselines

Contract Discovery: Dataset and a Few-Shot Semantic Retrieval Challenge with Competitive Baselines

Fast Query-by-example Speech Search using Attention-based Deep Binary Embeddings

Designing an Iterative Adaptive Arithmetic Coding-Based Lossless Bio-signal Compression for Online Patient Monitoring System (IAALBC)

Contact Info

Product

Resources

About