2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2016
DOI: 10.1109/icassp.2016.7472835
|View full text |Cite
|
Sign up to set email alerts
|

Approximate search of audio queries by using DTW with phone time boundary and data augmentation

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
11
0

Year Published

2016
2016
2023
2023

Publication Types

Select...
5
3

Relationship

2
6

Authors

Journals

citations
Cited by 13 publications
(12 citation statements)
references
References 10 publications
1
11
0
Order By: Relevance
“…[35][36][37] propose a logistic regression-based fusion of acoustic keyword spotting and DTW-based systems using language-dependent phoneme recognizers. [38][39][40][41] use a logistic regression-based fusion on DTW-and phone-based systems. Oishi et al [42] uses a DTW-based search at the HMM state-level from syllables obtained from a word-based speech recognizer and a deep neural network (DNN) posteriorgram-based rescoring, and [43] adds a logistic regression-based approach for detection rescoring.…”
Section: Hybrid Approachmentioning
confidence: 99%
“…[35][36][37] propose a logistic regression-based fusion of acoustic keyword spotting and DTW-based systems using language-dependent phoneme recognizers. [38][39][40][41] use a logistic regression-based fusion on DTW-and phone-based systems. Oishi et al [42] uses a DTW-based search at the HMM state-level from syllables obtained from a word-based speech recognizer and a deep neural network (DNN) posteriorgram-based rescoring, and [43] adds a logistic regression-based approach for detection rescoring.…”
Section: Hybrid Approachmentioning
confidence: 99%
“…Our partial matching DTW systems, including fixedwindow [8,16] and phoneme-sequence [17] partial matching systems, were used to deal with T2 and T3 queries. In each fixed-window partial matching system, an analysis window between 70 and 90 frames long was defined.…”
Section: Dtw Systemsmentioning
confidence: 99%
“…Unsupervised acoustic modeling or feature extraction has been studied in [11][12][13][14] to deal with the lack of knowledge about target data. Partial matching techniques [15][16][17] have been developed to deal with different kinds of query matches for the QUESST 2014.…”
Section: Introductionmentioning
confidence: 99%
“…the DTW distance between two speech signals; distances from others variants of DTW such as subsequence DTW [156] and partial DTW [139]…”
Section: Neural Network Classifiermentioning
confidence: 99%
“…Specifically, overlapping sub-sequences, also called partial The proposed partial search approach is motivated by the success of the partial matching approach on the query-by-example [139,140] and repeating sequence detection [141] tasks. However, in [139,140], each partial sequence is a sequence of acoustic vectors rather than a sequence of subword units. In the context of KWS, the idea of deriving sub-sequences from the subword sequence of a keyword is similar to the ngram-based approach [93,142,143].…”
Section: Introductionmentioning
confidence: 99%