A Robust Wavelet-Based Text-Independent Speaker Identification

Surve, Sunil; Singh, N.M.; Lande, B. K.

doi:10.1109/iccima.2007.149

Cited by 3 publications

(3 citation statements)

References 19 publications

Supporting

Mentioning

Contrasting

Unclassified

Order By: Relevance

“…MFDWC, which were initially defined to improve speech recongintion problems (Tavenei et al [187]), have been subsequently applied to other machine hearing realed applications such as speaker verification/identification (see Tufekci and Gurbuz [188], Nghia et al [189]), and audio-based surveillance systems (Rabaoui et al [24]). …”

Section: Wavelet-based Perceptual Featuresmentioning

confidence: 99%

A Review of Physical and Perceptual Feature Extraction Techniques for Speech, Music and Environmental Sounds

2016

View full text Add to dashboard Cite

Endowing machines with sensing capabilities similar to those of humans is a prevalent quest in engineering and computer science. In the pursuit of making computers sense their surroundings, a huge effort has been conducted to allow machines and computers to acquire, process, analyze and understand their environment in a human-like way. Focusing on the sense of hearing, the ability of computers to sense their acoustic environment as humans do goes by the name of machine hearing. To achieve this ambitious aim, the representation of the audio signal is of paramount importance. In this paper, we present an up-to-date review of the most relevant audio feature extraction techniques developed to analyze the most usual audio signals: speech, music and environmental sounds. Besides revisiting classic approaches for completeness, we include the latest advances in the field based on new domains of analysis together with novel bio-inspired proposals. These approaches are described following a taxonomy that organizes them according to their physical or perceptual basis, being subsequently divided depending on the domain of computation (time, frequency, wavelet, image-based, cepstral, or other domains). The description of the approaches is accompanied with recent examples of their application to machine hearing related problems.

show abstract

Section: Wavelet-based Perceptual Featuresmentioning

confidence: 99%

A Review of Physical and Perceptual Feature Extraction Techniques for Speech, Music and Environmental Sounds

2016

View full text Add to dashboard Cite

show abstract

“…Tuy nhiên để các ứng dụng xử lý tiếng nói trong máy tính có thể được áp dụng rộng rãi trong thực tế, tính tự nhiên của tiếng nói được xử lý cũng cần được quan tâm [2]. Để đảm bảo tiếng nói sau xử lý (như tiếng nói được tổng hợp) được tự nhiên, một trong những vấn đề quan trọng cần đảm bảo là thông tin về người nói, bao gồm cả các thông tin chung về người nói như giới tính, độ tuổi,…, đến các thông tin chi tiết như thông tin nhận danh chính xác người nói [3][4][5][6][7]. Các hệ thống tổng hợp tiếng nói nhân tạo thường chỉ có thể tổng hợp ra tiếng nói của một số giọng nói đã được thu sẵn và huấn luyện trước cho máy tính.…”

Section: Giới Thiệuunclassified

“…Nếu bỏ qua vấn đề tổng hợp giọng nguồn bằng HMM, bản chất của phương pháp biến đổi giọng người nói HTT là các khung của tiếng nói giọng nguồn được thay thế bằng các khung vật lý giống nhất của giọng đích trong cùng âm vị. Mặc dù việc lựa chọn và thay thế mẫu tiếng nói giọng nguồn bằng mẫu tiếng nói giọng đích đã được đề xuất trước đó [7], hiệu quả biến đổi giọng người nói trong HTT là vượt trội so với các phương pháp thay thế mẫu khác do việc sử dụng các khung tiếng nói rất ngắn thay thế các mẫu tiếng nói dài như âm vị [7] sẽ tối ưu việc tìm được khung/mẫu tiếng nói đích phù hợp nhất.…”

Section: Phƣơng Pháp Biến đổI Giọng Ngƣời Nói Dựa Vào Thay Thế Khungunclassified

Một kỹ thuật biến đổi giọng người nói hiệu quả sử dụng kỹ thuật phân rã tiếng nói theo thời gian

Nghĩa¹

2016

Công nghệ CNTT-TT

View full text Add to dashboard Cite

Voice transformation is an important issue in speech synthesis when we need to synthesize multiple output voices but do not want to rebuid the synthesis system. Speech transformed by the conventional method using Gaussian Mixture Model (GMM) is not high-quality due to the oversmoothness of GMM. Therefore, a number of methods have been proposed to overcome the disadvantages of the conventional method using GMM. Among them, Hidden Markov Model Trajectory Tiling (HTT) and Temporal Decomposition – GMM (TD-GMM) improve the effectiveness of voice transformation. However, they still have drawbacks. In this paper, a voice transformation method using the modified restricted TD (MRTD) is proposed. The experimental results with Vietnamese and English corpus confirm the effectiveness of the proposed method compared with HTT and TD-GMM.

show abstract

A Method of Extracting Feature of Phonic Signal Based on the Matrix Analysis

2013

AMM

View full text Add to dashboard Cite

In the speech recognition technology, feature extraction is essential for the system recognition rate, taking amount of strategies to find the better feature vectors are most researchers target. This paper presents a method of extracting feature of audio signal based on the discrete wavelet transform, then decomposed the coefficient matrix by the matrix analysis way, through this method to find a new thinking on the way of extracting feature vector. The method can be achieved in the procedure. The main purpose is to reduce the dimension of feature vector, make the vector briefer, and then reduce the computing complexity in the embedded system. This method can reduce the feature vectors dimension, accelerated the computing velocity.

show abstract

A Robust Wavelet-Based Text-Independent Speaker Identification

Cited by 3 publications

References 19 publications

A Review of Physical and Perceptual Feature Extraction Techniques for Speech, Music and Environmental Sounds

A Review of Physical and Perceptual Feature Extraction Techniques for Speech, Music and Environmental Sounds

Một kỹ thuật biến đổi giọng người nói hiệu quả sử dụng kỹ thuật phân rã tiếng nói theo thời gian

A Method of Extracting Feature of Phonic Signal Based on the Matrix Analysis

Contact Info

Product

Resources

About