norm for comparative ratin automatic prediction from the value. In the rest of this paper proposed measure, analyze its scores, and compare it to other 2.(Smseg), the distortion m several known features of sounds by the human e a , spe . Frequency scale band integration in the cochlea,
ABSTRACTA new, perceptually-motivated objective measure for estimating the subjective quality of coded speech is presented. It takes into account (i) auditory frequency warping (Bark transformation), (ii) critical-band integration, (iii) amplitude sensitivity variations with frequency and (iv) conversion from loudness level to loudness. For each 10 ms segment of an utterance, a weighted spectral vector is computed via 15 critical band filters. The overall distortion, called Bark Spectral Distortion (BSD), is the average squared Euclidean distance between spectral vectors of the original and coded utterance. In tests with speech distorted by a modulated noise reference unit (IvlNRU) or coded at rates of 2.4 -64 kbh, the measure predicted mean opinion score (MOS) ratings notably better than segmental SNR. The standard error in estimating MOS scores with the new measure was 0.2 -0.3.