2022
DOI: 10.1016/j.jksuci.2022.02.024
|View full text |Cite
|
Sign up to set email alerts
|

A robust voice spoofing detection system using novel CLS-LBP features and LSTM

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
5
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
6
2
1

Relationship

1
8

Authors

Journals

citations
Cited by 15 publications
(5 citation statements)
references
References 30 publications
0
5
0
Order By: Relevance
“…Furthermore, 9.33%, 7.69%, 8.09%, 9.57%, and 2.502% EER are achieved by MFCC-ResNet [13], CQCC-ResNet [13], LFCC-GMM [58], and CQCC-GMM [58] respectively. Similarly, another model [59] attained 0.58% EER and 0.0160 t-DCF for the PA eval set. Other algorithms, such as [13] and [58], performed various experiments and attained 4.43%, 13.54%, 1.04%, and 0.459% EER.…”
Section: Hcomparative Analysis With Existing Features Extraction-base...mentioning
confidence: 86%
See 1 more Smart Citation
“…Furthermore, 9.33%, 7.69%, 8.09%, 9.57%, and 2.502% EER are achieved by MFCC-ResNet [13], CQCC-ResNet [13], LFCC-GMM [58], and CQCC-GMM [58] respectively. Similarly, another model [59] attained 0.58% EER and 0.0160 t-DCF for the PA eval set. Other algorithms, such as [13] and [58], performed various experiments and attained 4.43%, 13.54%, 1.04%, and 0.459% EER.…”
Section: Hcomparative Analysis With Existing Features Extraction-base...mentioning
confidence: 86%
“…For the LA set, the best EER is 0.045%, and our proposed spoofing detector attains a t-DCF of 0.002. The second-best EER is 0.06, and t-DCF is 0.0017 attained by [59]. Furthermore, 9.33%, 7.69%, 8.09%, 9.57%, and 2.502% EER are achieved by MFCC-ResNet [13], CQCC-ResNet [13], LFCC-GMM [58], and CQCC-GMM [58] respectively.…”
Section: Hcomparative Analysis With Existing Features Extraction-base...mentioning
confidence: 94%
“…To improve the ASV consistency in reverberant conditions [49,50], ASVspoof 2019 [2] comprises simulated replay recordings [51][52][53] in deep acoustic environments as opposed to the ASVspoof 2017 dataset [17], which contained the replay attacks. For the collection of PA samples, physical characteristics were considered, for example, room sizes in which the audios were synthesized, which were divided into three categories: small, medium, and large.…”
Section: Datasetmentioning
confidence: 99%
“…They evaluated their approach on ASVspoof 2019 and VSDC datasets. Dawood et al, (2022) suggested a new feature descriptor Center Lop-Sided Local binary patterns (CS-LBP) to represent audio files in the best manner. These features were also fed into the long short-term memory network for the detection of audio forgery.…”
Section: Related Workmentioning
confidence: 99%