2021
DOI: 10.1007/s11042-020-10212-0
|View full text |Cite
|
Sign up to set email alerts
|

Voice conversion spoofing detection by exploring artifacts estimates

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
4
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
4
2
1

Relationship

0
7

Authors

Journals

citations
Cited by 7 publications
(4 citation statements)
references
References 35 publications
0
4
0
Order By: Relevance
“…Tak et al utilized a Graph Attention Network (GAT) to model the feature quantities of multi-band and multi-temporal data [40]. Hemavathi et al adopted a blind source separation (BSS) based on non-negative matrix factorization to separate artificially synthesized sounds into genuine and fake components and then used a CNN network to achieve the target audio's authenticity identification [41].…”
Section: Related Workmentioning
confidence: 99%
“…Tak et al utilized a Graph Attention Network (GAT) to model the feature quantities of multi-band and multi-temporal data [40]. Hemavathi et al adopted a blind source separation (BSS) based on non-negative matrix factorization to separate artificially synthesized sounds into genuine and fake components and then used a CNN network to achieve the target audio's authenticity identification [41].…”
Section: Related Workmentioning
confidence: 99%
“…MFCC [24], LFCC, and CQCC [25] are the main features used for synthetic spoofing detection. Various deep learning architectures like DNN [26] - [27], LCNN [28] and LSTM [29] have been employed for spoofing detection. For the replay attack detection, seven augmentation techniques were tested; out of these, dynamic value change and pitch change showed an 8% improvement in base model accuracy [30].…”
Section: Literature Reviewmentioning
confidence: 99%
“…An alternative approach involves the exploration of artifact estimations, a result that arises when an impostor tries to transform their speech into a genuine version [33]. This study was based on the assumption that all manipulated speech samples would exhibit artifacts.…”
Section: Voice Spoofingmentioning
confidence: 99%