J. Fortuna scite author profile

This paper is concerned with the verification effectiveness in open-set, text-independent speaker identification. The study includes an analysis of the characteristics of this mode of speaker recognition and the potential causes of errors. The use of well-known score normalisation techniques for the purpose enhancing the reliability of the process is described and their relative effectiveness is experimentally investigated. The experiments are based on the dataset proposed for the 1-speaker detection task of the NIST Speaker Recognition Evaluation 2003. Based on the experimental results, it is demonstrated that significant benefits is achieved by using score normalisation in open-set identification, and that the level of this depends highly on the type of the approach adopted. The results also show that better performance can be achieved by using the cohort normalisation methods. In particular, the unconstrained cohort method with a relatively small cohort size appears to outperform all other approaches.

show abstract

Improved support vector classification using PCA and ICA feature space modification

Fortuna

Capson

2004

Pattern Recognition

View full text Add to dashboard Cite

Unsupervised speaker change detection using probabilistic pattern matching

Malegaonkar

Ariyaeeinia

Sivakumaran³

et al. 2006

IEEE Signal Process. Lett.

View full text Add to dashboard Cite

Abstract-This letter presents an investigation into the use of a probabilistic pattern matching approach for detecting speaker changes in audio streams. The experiments are conducted using clean speech as well as broadcast news material. It is shown that, in the proposed approach, the use of bilateral scoring is considerably more effective than unilateral scoring. Appropriate score normalization methods are considered in the study. It is observed that in all the cases, the bilateral scoring approach outperforms the currently popular method of Bayesian information criterion (BIC) for speaker change detection. This letter discusses the principles of the proposed approach and details the experimental investigations.

show abstract

Rigid Structure from Motion from a Blind Source Separation Perspective

Fortuna

Martı́nez

2009

Int J Comput Vis

View full text Add to dashboard Cite

We present an information theoretic approach to define the problem of structure from motion (SfM) as a blind source separation one. Given that for almost all practical joint densities of shape points, the marginal densities are non-Gaussian, we show how higher-order statistics can be used to provide improvements in shape estimates over the methods of factorization via Singular Value Decomposition (SVD), bundle adjustment and Bayesian approaches. Previous techniques have either explicitly or implicitly used only second-order statistics in models of shape or noise. A further advantage of viewing SfM as a blind source problem is that it easily allows for the inclusion of noise and shape models, resulting in Maximum Likelihood (ML) or Maximum a Posteriori (MAP) shape and motion estimates. A key result is that the blind source separation approach has the ability to recover the motion and shape matrices without the need to explicitly know the motion or shape pdf. We demonstrate that it suffices to know whether the pdf is sub-or super-Gaussian (i.e., semi-parametric estimation) and derive a simple formulation to determine this from the data. We provide extensive experimental results on synthetic and real tracked points in order to quantify the improvement obtained from this technique.

show abstract

A comparison of PCA and ICA for object recognition under varying illumination

Fortuna

Schuurman

Capson

View full text Add to dashboard Cite

Perturbation Estimation of the Subspaces for Structure from Motion with Noisy and Missing Data

Jia

Fortuna

Martı́nez

2006

View full text Add to dashboard Cite

show abstract

ICA filters for lighting invariant face recognition

Fortuna

Capson

2004

View full text Add to dashboard Cite

On the enhancement of speaker identification accuracy using weighted bilateral scoring

Malegaonkar¹,

Ariyaeeinia

Sivakumaran

et al. 2008

View full text Add to dashboard Cite

???This material is presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders. All persons copying this information are expected to adhere to the terms and constraints invoked by each author's copyright. In most cases, these works may not be reposted without the explicit permission of the copyright holder." ???Copyright IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE.??? DOI: 10.1109/CCST.2008.4751310This paper presents investigations into an effective bilateral scoring method in open-set speaker identification. The approach is based on the fact that two different speakers usually are not reciprocal. A difficulty in deploying bilateral scoring is that test utterances are normally much shorter than training utterances. To tackle this problem, the proposed approach provides the final identification score based on a weighted combination of independently normalised forward and reverse scores. Based on the experimental results obtained using clean and telephone quality speech, it is shown that the proposed approach is more effective than the conventional scoring methods in open-set speaker identification

show abstract

12 3

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

J. Fortuna

Verification effectiveness in open-set speaker identification

Improved support vector classification using PCA and ICA feature space modification

Unsupervised speaker change detection using probabilistic pattern matching

Rigid Structure from Motion from a Blind Source Separation Perspective

A comparison of PCA and ICA for object recognition under varying illumination

Perturbation Estimation of the Subspaces for Structure from Motion with Noisy and Missing Data

ICA filters for lighting invariant face recognition

On the enhancement of speaker identification accuracy using weighted bilateral scoring

Contact Info

Product

Resources

About