Jiakang Li scite author profile

Jiakang Li

5Publications

62Citation Statements Received

55Citation Statements Given

How they've been cited

How they cite others

104

Affiliations

PLA Army Engineering University

Publications

Order By: Most citations

Joint Decision of Anti-Spoofing and Automatic Speaker Verification by Multi-Task Learning With Contrastive Loss

Sun

Zhang

et al. 2020

IEEE Access

View full text Add to dashboard Cite

Automatic speaker verification (ASV) is an emerging biometric verification technique with more and more applications. However, both verification accuracy and anti-spoofing should be considered carefully before putting ASV into practice, where anti-spoofing is also called replay detection in which voice is recorded, stored and replayed to deceive ASV systems. Cascaded decision of anti-spoofing and ASV is a straightforward solution to tackle the two issues. In this paper, joint decision of anti-spoofing and ASV was investigated in a multi-task learning framework with contrastive loss in order to improve the cascaded decision approach. A modified triplet loss was firstly constructed to supervise deep neural networks to extract embedding vectors containing information of both speaker identity and spoofing. The embedding vectors were subsequently taken as input features by back-end classifiers towards speaker and spoofing classification. The experimental results on both ASVspoof 2017 and ASVspoof 2019 showed that the proposed joint decision approach with triplet loss outperformed the corresponding baselines, a recent work on joint decision with Gaussian back-end fusion and our previous joint decision approach with cross-entropy loss.

show abstract

Multi-task learning of deep neural networks for joint automatic speaker verification and spoofing detection

Sun

Zhang

2019

View full text Add to dashboard Cite

When Automatic Voice Disguise Meets Automatic Speaker Verification

Zheng

Sun

et al. 2021

IEEE Trans.Inform.Forensic Secur.

View full text Add to dashboard Cite

Attention-Based LSTM Algorithm for Audio Replay Detection in Noisy Environments

Zhang

Sun

et al. 2019

Applied Sciences

View full text Add to dashboard Cite

Even though audio replay detection has improved in recent years, its performance is known to severely deteriorate with the existence of strong background noises. Given the fact that different frames of an utterance have different impacts on the performance of spoofing detection, this paper introduces attention-based long short-term memory (LSTM) to extract representative frames for spoofing detection in noisy environments. With this attention mechanism, the specific and representative frame-level features will be automatically selected by adjusting their weights in the framework of attention-based LSTM. The experiments, conducted using the ASVspoof 2017 dataset version 2.0, show that the equal error rate (EER) of the proposed approach was about 13% lower than the constant Q cepstral coefficients-Gaussian mixture model (CQCC-GMM) baseline in noisy environments with four different signal-to-noise ratios (SNR). Meanwhile, the proposed algorithm also improved the performance of traditional LSTM on audio replay detection systems in noisy environments. Experiments using bagging with different frame lengths were also conducted to further improve the proposed approach.

show abstract

When Automatic Voice Disguise Meets Automatic Speaker Verification

Zheng

Sun

et al. 2020

Preprint

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Jiakang Li

Joint Decision of Anti-Spoofing and Automatic Speaker Verification by Multi-Task Learning With Contrastive Loss

Multi-task learning of deep neural networks for joint automatic speaker verification and spoofing detection

When Automatic Voice Disguise Meets Automatic Speaker Verification

Attention-Based LSTM Algorithm for Audio Replay Detection in Noisy Environments

When Automatic Voice Disguise Meets Automatic Speaker Verification

Contact Info

Product

Resources

About