“…(9) Loudness in bark frequency bands was extracted using the algorithm from the Acoustics Research Centre at the University of Salford. 35 The process starts by filtering the sound using a bank of 28 log-spaced filters ranging from 250 Hz to 12.5 kHz, each filter having a one-third octave bandwidth to mirror the critical bands of the auditory periphery. The outputs of each filter are scaled to match human perception at each frequency range.…”
Salience describes the phenomenon by which an object stands out from a scene. While its
underlying processes are extensively studied in vision, mechanisms of auditory salience
remain largely unknown. Previous studies have used well-controlled auditory scenes to shed
light on some of the acoustic attributes that drive the salience of sound events.
Unfortunately, the use of constrained stimuli in addition to a lack of well-established
benchmarks of salience judgments hampers the development of comprehensive theories of
sensory-driven auditory attention. The present study explores auditory salience in a set
of dynamic natural scenes. A behavioral measure of salience is collected by having human
volunteers listen to two concurrent scenes and indicate continuously which one attracts
their attention. By using natural scenes, the study takes a data-driven rather than
experimenter-driven approach to exploring the parameters of auditory salience. The
findings indicate that the space of auditory salience is multidimensional (spanning
loudness, pitch,
spectral shape, as well as other acoustic attributes), nonlinear and highly
context-dependent. Importantly, the results indicate that contextual information about the entire
scene over both short and long scales needs to be considered in order to properly account
for perceptual judgments of salience.
“…(9) Loudness in bark frequency bands was extracted using the algorithm from the Acoustics Research Centre at the University of Salford. 35 The process starts by filtering the sound using a bank of 28 log-spaced filters ranging from 250 Hz to 12.5 kHz, each filter having a one-third octave bandwidth to mirror the critical bands of the auditory periphery. The outputs of each filter are scaled to match human perception at each frequency range.…”
Salience describes the phenomenon by which an object stands out from a scene. While its
underlying processes are extensively studied in vision, mechanisms of auditory salience
remain largely unknown. Previous studies have used well-controlled auditory scenes to shed
light on some of the acoustic attributes that drive the salience of sound events.
Unfortunately, the use of constrained stimuli in addition to a lack of well-established
benchmarks of salience judgments hampers the development of comprehensive theories of
sensory-driven auditory attention. The present study explores auditory salience in a set
of dynamic natural scenes. A behavioral measure of salience is collected by having human
volunteers listen to two concurrent scenes and indicate continuously which one attracts
their attention. By using natural scenes, the study takes a data-driven rather than
experimenter-driven approach to exploring the parameters of auditory salience. The
findings indicate that the space of auditory salience is multidimensional (spanning
loudness, pitch,
spectral shape, as well as other acoustic attributes), nonlinear and highly
context-dependent. Importantly, the results indicate that contextual information about the entire
scene over both short and long scales needs to be considered in order to properly account
for perceptual judgments of salience.
“…LL Z was calculated from the time-averaged 1/3-octave-band SPLs from 25 Hz to 12.5 kHz according to the algorithm described in Ref. 25, The relationships between the loudness score and the values assessed by the respective indicators are given in Fig. 5.…”
Wind turbine noise has become a serious environmental noise issue. To investigate this problem, a research project has been conducted over the three years from fiscal year 2010. In this project, experimental studies on human auditory sensation of wind turbine noise have been conducted together with field measurements and social surveys. Since wind turbine noise is often argued to be a low-frequency noise issue, the experiments were conducted with emphasis on the low-frequency components of wind turbine noise. Among them, an auditory experiment was performed to investigate noise indicators suitable for the assessment of wind turbine noise. In this experiment, not only wind turbine noise but also various environmental noises that were recorded so as to include low-frequency components down to infrasound were reproduced down to 4 Hz and loudness tests were performed. The experimental results were evaluated using the A-and C-weighted sound pressure levels, Zwicker loudness level, and Moore-Glasberg loudness level. It has been found that the A-weighted sound pressure level is a simple and appropriate indicator for the loudness assessment of general environmental noises. In addition, the relationships between the noise indicators are discussed on the basis of the results of numerical investigations.
“…Table 2 explains the four items from the viewpoints of the dummy head position, time-course of the event, and movements of sound images. Figure 2 shows the time-courses of the loudness for these items estimated by Zwicker's method [16] in 500-ms intervals. Comparing Figs.…”
SUMMARYThe sense of presence is crucial to evaluate the performance of audio-visual (AV) equipment and content. Previously, the overall presence was evaluated for a set of AV content items by asking subjects to judge the presence of the entire content item. In this study, the sense of presence is evaluated for a time-series using the method of continuous judgment by category. Specifically, the audio signals of 40 content items with durations of approximately 30 s each were recorded with a dummy head, and then presented as stimuli to subjects via headphones. The corresponding visual signals were recorded using a video camera in the full-HD format, and reproduced on a 65-inch display. In the experiments, 20 subjects evaluated the instantaneous sense of presence of each item on a seven-point scale under two conditions: audio-only or audio-visual. At the end of the time-series, the subjects also evaluated the overall presence of the item by seven categories. Based on these results, the effects of visual information on the sense of presence were examined. The overall presence is highly correlated with the ten-percentile exceeded presence score, S 10 , which is the score that is exceeded for the 10% of the time during the responses. Based on the instantaneous presence data in this study, we are one step closer to our ultimate goal of developing a real-time operational presence meter. key words: content presence, method of continuous judgment by category, audio-visual information
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.