Mean opinion score ratings of reproduced sound quality typically pool all contributing perceptual factors into a single rating of basic audio quality. In order to improve understanding of the trade-offs between selected sound quality degradations that might arise in systems for the delivery of high quality multichannel audio, it was necessary to evaluate the influence of timbral and spatial fidelity changes on basic audio quality grades. The relationship between listener ratings of degraded multichannel audio quality on one timbral and two spatial fidelity scales was exploited to predict basic audio quality ratings of the same material using a regression model. It was found that timbral fidelity ratings dominated but that spatial fidelity predicted a substantial proportion of the basic audio quality.
Previous research has indicated that the relationship between the interaural cross-correlation coefficient ͑IACC͒ of a narrow-band sound and its perceived auditory source width is dependent on its frequency. However, this dependency has not been investigated in sufficient detail for researchers to be able to properly model it in order to produce a perceptually relevant IACC-based model of auditory source width. A series of experiments has therefore been conducted to investigate this frequency dependency in a controlled manner, and to derive an appropriate model. Three main factors were discovered in the course of these experiments. First, the nature of the frequency dependency of the perceived auditory source width of stimuli with an IACC of 1 was determined, and an appropriate mathematical model was derived. Second, the loss of perceived temporal detail at high frequencies, caused by the breakdown of phase locking in the ear, was found to be relevant, and the model was modified accordingly using rectification and a low-pass filter. Finally, it was found that there was a further frequency dependency at low frequencies, and a method for modeling this was derived. The final model was shown to predict the experimental data well.
The preferences of a large number of naïve listeners were elicited in response to a selection of multichannel audio items that had been degraded in quality by using band-limiting and down-mixing algorithms. Relationships were sought between these preference ratings and the quality judgements of experienced listeners in an attempt to determine whether one could be predicted from the other. Results suggest that a simple regression model can be used to do this with adequate results, but that a better prediction can be successfully based on experienced listener ratings of timbral and spatial fidelity. There is a difference between naïve and experienced listeners in the weightings of the fidelities and their relationship to overall quality.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.