Byung-Min So scite author profile

In this paper, we introduce the automatic speaker identification system 'SPO(Supreme Prosecutors Office) Verifier'. SPO Verifier is a GMM(Gaussian mixture model)-UBM(universal background model) based automatic speaker recognition system and has been developed using Korean speakers' utterances. This system uses a channel compensation algorithm to compensate recording device characteristics. The system can give the users the ability to manage reference models with utterances from various environments to get more accurate recognition results. To evaluate the performance of SPO Verifier on Korean speakers, we compared this system with one of the most widely used commercial systems in the forensic field. The results showed that SPO Verifier shows lower EER(equal error rate) than that of the commercial system.

show abstract

Histogram Equalization Using Background Speakers' Utterances for Speaker Identification

Kim¹,

Yang²,

So³

et al. 2012

Phonetics and Speech Sciences

View full text Add to dashboard Cite

In this paper, we propose a novel approach to improve histogram equalization for speaker identification. Our method collects all speech features of UBM training data to make a reference distribution. The ranks of the feature vectors are calculated in the sorted list of the collection of the UBM training data and the test data. We use the ranks to perform order-based histogram equalization. The proposed method improves the accuracy of the speaker recognition system with short utterances. We use four kinds of speech databases to evaluate the proposed speaker recognition system and compare the system with cepstral mean normalization (CMN), mean and variance normalization (MVN), and histogram equalization (HEQ). Our system reduced the relative error rate by 33.3% from the baseline system.

show abstract

Restoration of damaged speech files using deep neural networks

Heo¹,

So²,

Yang³

et al. 2017

The Journal of the Acoustical Society of Korea

View full text Add to dashboard Cite

In this paper, we propose a method for restoring damaged audio files using deep neural network. It is different from the conventional file carving based restoration. The purpose of our method is to infer lost information which can not be restored by existing techniques such as the file carving. We have devised methods that can automate the tasks which are essential for the restoring but are inappropriate for humans. As a result of this study it has been shown that it is possible to restore the damaged files, which the conventional file carving method could not, by using tasks such as speech or nonspeech decision and speech encoder recognizer using a deep neural network.

show abstract

Speaker Identification on Various Environments Using an Ensemble of Kernel Principal Component Analysis

Yang¹,

Kim²,

So³

et al. 2012

The Journal of the Acoustical Society of Korea

View full text Add to dashboard Cite

ABSTRACT:In this paper, we propose a new approach to speaker identification technique which uses an ensemble of multiple classifiers (speaker identifiers). KPCA (kernel principal component analysis) enhances features for each classifier. To reduce the processing time and memory requirements, we select limited number of samples randomly which are used as estimation set for each KPCA basis. The experimental result shows that the proposed approach gives a higher identification accuracy than GKPCA (greedy kernel principal component analysis). II. 관련 연구 GMM-UBM을 이용한 화자 식별GMM-UBM [6] 은 가우시안 혼합 모델 (GMM, Gaussian mixture model) [7] 로 배경 화자 모델 (UBM, universal background model)을 구축하고 각 화자의 학습 발성 으로 MAP (maximum a posteriori) 적응을 수행함으로

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Byung-Min So

Automated recovery of damaged audio files using deep neural networks

Forensic Automatic Speaker Identification System for Korean Speakers

Histogram Equalization Using Background Speakers' Utterances for Speaker Identification

Restoration of damaged speech files using deep neural networks

Speaker Identification on Various Environments Using an Ensemble of Kernel Principal Component Analysis

Contact Info

Product

Resources

About