Interspeech 2020 2020
DOI: 10.21437/interspeech.2020-1011
|View full text |Cite
|
Sign up to set email alerts
|

Improved RawNet with Feature Map Scaling for Text-Independent Speaker Verification Using Raw Waveforms

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
49
0
1

Year Published

2021
2021
2024
2024

Publication Types

Select...
3
2
2

Relationship

0
7

Authors

Journals

citations
Cited by 68 publications
(50 citation statements)
references
References 0 publications
0
49
0
1
Order By: Relevance
“…This section provides a brief review of past work that led to the development of RawNet2 [7]. The treatment is upon automatic speaker verification.…”
Section: Previous Workmentioning
confidence: 99%
See 3 more Smart Citations
“…This section provides a brief review of past work that led to the development of RawNet2 [7]. The treatment is upon automatic speaker verification.…”
Section: Previous Workmentioning
confidence: 99%
“…RawNet2 [7] combines the merits of the original RawNet approach (RawNet1) with those of SincNet. The first layer of RawNet2 is essentially the same as that of SincNet, whereas the upper layers consist of the same residual blocks and GRU layer as RawNet1.…”
Section: Previous Workmentioning
confidence: 99%
See 2 more Smart Citations
“…While MOS is a desired measure for audio naturalness/fidelity in singing voice synthesis tasks, voice similarity is more diffi-1 Audio demo: https://nobody996.github.io/FastSVC/. Therefore, we adopt a pre-trained end-to-end speaker recognition model named RawNet2 [33] to objectively measure the voice similarity of the converted singing samples. We measure cosine similarities between embedding vectors of audio samples and the desired target speaker embedding vectors before and after conversion, where all embedding vectors are computed by the RawNet2 and singer/speaker embedding vectors are obtained by averaging his/her training audio samples.…”
Section: Objective Evaluationmentioning
confidence: 99%