Interspeech 2018 2018
DOI: 10.21437/interspeech.2018-2384
|View full text |Cite
|
Sign up to set email alerts
|

Sampling Strategies in Siamese Networks for Unsupervised Speech Representation Learning

Abstract: Recent studies have investigated siamese network architectures for learning invariant speech representations using samedifferent side information at the word level. Here we investigate systematically an often ignored component of siamese networks: the sampling procedure (how pairs of same vs. different tokens are selected). We show that sampling strategies taking into account Zipf's Law, the distribution of speakers and the proportions of same and different pairs of words significantly impact the performance o… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
23
0

Year Published

2019
2019
2022
2022

Publication Types

Select...
4
2
1

Relationship

0
7

Authors

Journals

citations
Cited by 22 publications
(23 citation statements)
references
References 18 publications
0
23
0
Order By: Relevance
“…The BNFs are in any case competitive with the higher dimensional features, and have the advantage that they can be built using standard Kaldi scripts and do not require any training on the target language, so can easily be deployed to new languages. The competitive result of [43] also shows that in general a system trained on word pairs discovered from a UTD system can perform very well.…”
Section: Evaluation Using Zrsc Data and Measuresmentioning
confidence: 85%
“…The BNFs are in any case competitive with the higher dimensional features, and have the advantage that they can be built using standard Kaldi scripts and do not require any training on the target language, so can easily be deployed to new languages. The competitive result of [43] also shows that in general a system trained on word pairs discovered from a UTD system can perform very well.…”
Section: Evaluation Using Zrsc Data and Measuresmentioning
confidence: 85%
“…One type of neural approach that has received particular attention is Siamese networks [18]- [22]. A Siamese network consists of two identical sub-networks with tied weights taking in a pair of inputs [23].…”
Section: Introductionmentioning
confidence: 99%
“…The probability of selecting a node is a function of its degree, utilizing the function proposed by Riad et al. 29 The sampling compression function is chosen to be the square-root function, which retains the power-law degree distribution while keeping the frequency ranking of each node. When a batch of nodes S is sampled without replacement from this distribution, each node i has a set of positive edges, P i .…”
Section: Model Optimization With Batch Sampling Strategymentioning
confidence: 99%
“…For a test to differentiate between positive interactions and random noise interactions, we also uniformly sample a number of interactions from the set of all possible pairwise interactions to consider as negative interaction using the random node sampling distribution specified in Riad et al . 29 This set is denoted as E n , and the number of negative interactions is sampled such that the ratio of negative to positive interactions is 1.0. At evaluation time, the set of ground truth validation edges E d and random noise E n edges is used to calculate the precision and recall rates.…”
Section: Novel Link Predictionsmentioning
confidence: 99%