Wavelet packet approximation of critical bands for speaker verification

Siafarikas, Mihalis; Ganchev, Todor; Fakotakis, Nikos; Kokkinakis, G.

doi:10.1007/s10772-009-9028-6

Cited by 9 publications

(2 citation statements)

References 28 publications

(35 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…It represents the common standard deviation for all Gaussian functions. Moreover, as demonstrated by Specht, (1990), the network is tolerant to the choice of the smoothing factor, and provides robust operation for a relatively wide range of valu (2) The PNN copes well with erroneous training vectors. In many cases a few sparse data samples can be sufficient for optimal performance.…”

Section: Characterization Of the Probabilistic Neural Networkmentioning

confidence: 97%

“…In validation tests on the NIST 2001 SRE database, whose design and statistical specifications are entirely different from the ones of Polycost, it was confirmed (Ganchev, Siafarikas, Fakotakis, 2004e) that the proposed speech features indeed facilitate the speaker verification process. They demonstrated lower error rates and decision cost than other successful DFT-and DWPT-based speech features, such as the widely used MFCC, and the features of (Sarikaya and Hansen, 2000) and (Farooq and Datta, 2002).Later, in(Siafarikas, Ganchev, Fakotakis, 2005) the same objective criterion was employed in a systematic search for the best wavelet packet tree (among 16 candidates) in a wider area and relaxed margins and bandwidth. The next subsections present the wavelet packet trees design and selection, and the computation of the wavelet packet-based speech features for speaker recognition.…”

mentioning

confidence: 99%

See 1 more Smart Citation

Speaker recognition

Ganchev¹

Self Cite

View full text Add to dashboard Cite

Firstly, I would like to express acknowledgement to Prof. Nikos Fakotakis, who served as a supervisor of my Ph.D. study. His comprehensive support I enjoyed from the very first day of my work at the Wire Communications Laboratory. It was his countenance, which made possible the successful completion of my study.I would like to express gratitude to Prof. George Kokkinakis, whose profound analysis of my prospective submissions helped me to improve both the presentation style and overall quality of the manuscripts. Subsequently, I would like to thank to: Assoc.Prof.John Mourjopoulos who directed my attention to the psychoacoustic aspects of speech perception, Ass.Prof. Evangelos Dermatas who inspired my interest to the recurrent neural networks, and to Prof. Michael Vrahatis who initiated me in the evolutionary computation techniques. Further, I would like to express thanks to: Dr. Anastasious Tsopanoglou, with whom I had regular discussions during the first year of my study and to Dr. Ilyas Potamitis, whose insights helped me to avoid many snares during all these years. In addition, I would like to thank to all colleagues who contributed to the comfortable collaborative atmosphere at the Wire Communications Laboratory. Finally, I would like to express acknowledgement to the State Scholarship Foundation of Greece (IKY), which financially supported my Ph.D. study during the years 2002/2003/2004/2005. The IKY scholarship gave me the placidity to focus on my study, which I enjoyed very much. I highly appreciate this support.v Contents: Dedication ……………………………………………………………………….. iv Acknowledgements ………………………………………………………………. v Contents ………………………………………………………………………….. vi List of figures …………………………………………………………………….. x List of tables ……………………………………………………………………… xiv List of abbreviations ……………………………………………………………… xv Notations and operations ………………………………………………………… xviii PART I. INTRODUCTION TO THE SPEAKER RECOGNITION TECHNOLOGY AND OVERVIEW OF THE STATE-OF-ART

show abstract

Section: Characterization Of the Probabilistic Neural Networkmentioning

confidence: 97%

mentioning

confidence: 99%

Speaker recognition

Ganchev¹

Self Cite

View full text Add to dashboard Cite

show abstract

Contemporary Methods for Speech Parameterization

Ganchev¹

2011

View full text Add to dashboard Cite

The authors of this series have been hand-selected. They comprise some of the most outstanding scientists -drawn from academia and private industry -whose research is marked by its novelty, applicability, and practicality in providing broad based speech solutions. The SpringerBriefs in Speech Technology series provides the latest findings in speech technology gleaned from comprehensive literature reviews and empirical investigations that are performed in both laboratory and real life settings. Some of the topics covered in this series include the presentation of real life commercial deployment of spoken dialog systems, contemporary methods of speech parameterization, developments in information security for automated speech, forensic speaker recognition, use of sophisticated speech analytics in call centers, and an exploration of new methods of soft computing for improving human-computer interaction. Those in academia, the private sector, the self service industry, law enforcement, and government intelligence, are among the principal audience for this series, which is designed to serve as an important and essential reference guide for speech developers, system designers, speech engineers, linguists and others. In particular, a major audience of readers will consist of researchers and technical experts in the automated call center industry where speech processing is a key component to the functioning of customer care contact centers.Amy Neustein, Ph.D., serves as Editor-in-Chief of the International Journal of Speech Technology (Springer). She edited the recently published book "Advances in Speech Recognition: Mobile Environments, Call Centers and Clinics" (Springer 2010), and serves as quest columnist on speech processing for Womensenews. Dr. Neustein is Founder and CEO of Linguistic Technology Systems, a NJ-based think tank for intelligent design of advanced natural language based emotion-detection software to improve human response in monitoring recorded conversations of terror suspects and helpline calls. Dr. Neustein's work appears in the peer review literature and in industry and mass media publications. Her academic books, which cover a range of political, social and legal topics, have been cited in the Chronicles of Higher Education, and have won her a pro Humanitate Literary Award. She serves on the visiting faculty of the National Judicial College and as a plenary speaker at conferences in artificial intelligence and computing. Dr. Neustein is a member of MIR (machine intelligence research) Labs, which does advanced work in computer technology to assist underdeveloped countries in improving their ability to cope with famine, disease/illness, and political and social affliction. She is a founding member of the New York City Speech Processing Consortium, a newly formed group of NY-based companies, publishing houses, and researchers dedicated to advancing speech technology research and development. # Springer Science+Business Media, LLC 2011 All rights reserved. This work may not be translated or copied in...

show abstract

Contemporary Methods for Speech Parameterization

Ganchev¹

2011

Contemporary Methods for Speech Parameterization

View full text Add to dashboard Cite

Wavelet packet approximation of critical bands for speaker verification

Cited by 9 publications

References 28 publications

Speaker recognition

Speaker recognition

Contemporary Methods for Speech Parameterization

Contemporary Methods for Speech Parameterization

Contact Info

Product

Resources

About