Kong Aik Lee scite author profile

The ASVspoof initiative was created to promote the development of countermeasures which aim to protect automatic speaker verification (ASV) from spoofing attacks. The first community-led, common evaluation held in 2015 focused on countermeasures for speech synthesis and voice conversion spoofing attacks. Arguably, however, it is replay attacks which pose the greatest threat. Such attacks involve the replay of recordings collected from enrolled speakers in order to provoke false alarms and can be mounted with greater ease using everyday consumer devices. ASVspoof 2017, the second in the series, hence focused on the development of replay attack countermeasures. This paper describes the database, protocols and initial findings. The evaluation entailed highly heterogeneous acoustic recording and replay conditions which increased the equal error rate (EER) of a baseline ASV system from 1.76% to 31.46%. Submissions were received from 49 research teams, 20 of which improved upon a baseline replay spoofing detector EER of 24.77%, in terms of replay/non-replay discrimination. While largely successful, the evaluation indicates that the quest for countermeasures which are resilient in the face of variable replay attacks remains very much alive.

show abstract

Text-dependent speaker verification: Classifiers, databases and RSR2015

Larcher

Lee

et al. 2014

Speech Communication

231

252

View full text Add to dashboard Cite

ASVspoof 2019: Future Horizons in Spoofed and Fake Audio Detection

Todisco¹,

Wang²,

Vestman³

et al. 2019

335

235

View full text Add to dashboard Cite

ASVspoof, now in its third edition, is a series of communityled challenges which promote the development of countermeasures to protect automatic speaker verification (ASV) from the threat of spoofing. Advances in the 2019 edition include: (i) a consideration of both logical access (LA) and physical access (PA) scenarios and the three major forms of spoofing attack, namely synthetic, converted and replayed speech; (ii) spoofing attacks generated with state-of-the-art neural acoustic and waveform models; (iii) an improved, controlled simulation of replay attacks; (iv) use of the tandem detection cost function (t-DCF) that reflects the impact of both spoofing and countermeasures upon ASV reliability. Even if ASV remains the core focus, in retaining the equal error rate (EER) as a secondary metric, ASVspoof also embraces the growing importance of fake audio detection. ASVspoof 2019 attracted the participation of 63 research teams, with more than half of these reporting systems that improve upon the performance of two baseline spoofing countermeasures. This paper describes the 2019 database, protocols and challenge results. It also outlines major findings which demonstrate the real progress made in protecting against the threat of spoofing and fake audio.

show abstract

Improving convergence of the NLMS algorithm using constrained subband updates

Lee

Gan

2004

IEEE Signal Process. Lett.

272

207

View full text Add to dashboard Cite

Spoken Language Recognition: From Fundamentals to Practice

2013

View full text Add to dashboard Cite

ASVspoof 2019: A large-scale public database of synthesized, converted and replayed speech

Wang

Yamagishi

Todisco

et al. 2020

Computer Speech & Language

199

120

View full text Add to dashboard Cite

Vulnerability of speaker verification systems against voice conversion spoofing attacks: The case of telephone speech

et al. 2012

View full text Add to dashboard Cite

Voice conversion -the methodology of automatically converting one's utterances to sound as if spoken by another speaker -presents a threat for applications relying on speaker verification. We study vulnerability of text-independent speaker verification systems against voice conversion attacks using telephone speech. We implemented a voice conversion systems with two types of features and nonparallel frame alignment methods and five speaker verification systems ranging from simple Gaussian mixture models (GMMs) to state-of-the-art joint factor analysis (JFA) recognizer. Experiments on a subset of NIST 2006 SRE corpus indicate that the JFA method is most resilient against conversion attacks. But even it experiences more than 5-fold increase in the false acceptance rate from 3.24 % to 17.33 %.

show abstract

ASVspoof 2017 Version 2.0: meta-data analysis and baseline enhancements

Todisco²,

et al. 2018

View full text Add to dashboard Cite

The now-acknowledged vulnerabilities of automatic speaker verification (ASV) technology to spoofing attacks have spawned interests to develop so-called spoofing countermeasures. By providing common databases, protocols and metrics for their assessment, the ASVspoof initiative was born to spearhead research in this area. The first competitive ASVspoof challenge held in 2015 focused on the assessment of countermeasures to protect ASV technology from voice conversion and speech synthesis spoofing attacks. The second challenge switched focus to the consideration of replay spoofing attacks and countermeasures. This paper describes Version 2.0 of the ASVspoof 2017 database which was released to correct data anomalies detected post-evaluation. The paper contains as-yet unpublished meta-data which describes recording and playback devices and acoustic environments. These support the analysis of replay detection performance and limits. Also described are new results for the official ASVspoof baseline system which is based upon a constant Q cesptral coefficient frontend and a Gaussian mixture model backend. Reported are enhancements to the baseline system in the form of log-energy coefficients and cepstral mean and variance normalisation in addition to an alternative i-vector backend. The best results correspond to a 48% relative reduction in equal error rate when compared to the original baseline system.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Kong Aik Lee

The ASVspoof 2017 Challenge: Assessing the Limits of Replay Spoofing Attack Detection

Text-dependent speaker verification: Classifiers, databases and RSR2015

ASVspoof 2019: Future Horizons in Spoofed and Fake Audio Detection

Improving convergence of the NLMS algorithm using constrained subband updates

Spoken Language Recognition: From Fundamentals to Practice

ASVspoof 2019: A large-scale public database of synthesized, converted and replayed speech

Vulnerability of speaker verification systems against voice conversion spoofing attacks: The case of telephone speech

ASVspoof 2017 Version 2.0: meta-data analysis and baseline enhancements

Contact Info

Product

Resources

About