2011
DOI: 10.1121/1.3642604
|View full text |Cite
|
Sign up to set email alerts
|

The Timbre Toolbox: Extracting audio descriptors from musical signals

Abstract: The analysis of musical signals to extract audio descriptors that can potentially characterize their timbre has been disparate and often too focused on a particular small set of sounds. The Timbre Toolbox provides a comprehensive set of descriptors that can be useful in perceptual research, as well as in music information retrieval and machine-learning approaches to content-based retrieval in large sound databases. Sound events are first analyzed in terms of various input representations (short-term Fourier tr… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
260
0
6

Year Published

2012
2012
2022
2022

Publication Types

Select...
5
2
2

Relationship

0
9

Authors

Journals

citations
Cited by 287 publications
(286 citation statements)
references
References 29 publications
0
260
0
6
Order By: Relevance
“…Indeed, because timbral perception does not require formalised musical knowledge, individuals could be expected to vary in the information they can access for this task purely on the basis of what they have previously listened to, and to what extent. We will look at three other datasets-including shorter, 400ms clips-and explore other features, for example those provided by Peeters and colleagues' recently published toolbox [28], as well as standard MFCC coefficients and spectral centroid-based measures.…”
Section: Discussionmentioning
confidence: 99%
“…Indeed, because timbral perception does not require formalised musical knowledge, individuals could be expected to vary in the information they can access for this task purely on the basis of what they have previously listened to, and to what extent. We will look at three other datasets-including shorter, 400ms clips-and explore other features, for example those provided by Peeters and colleagues' recently published toolbox [28], as well as standard MFCC coefficients and spectral centroid-based measures.…”
Section: Discussionmentioning
confidence: 99%
“…In the area of music information retrieval a lot of research has been devoted to extract audio descriptors (or features) that could concisely represent sound and music [24]. Several software libraries are available to easily extract brightness, spectral flux, and other descriptors from a given soundfile and to collect statistical descriptors from them.…”
Section: Reducing Dimensionality: a Compact Description Of Soundsmentioning
confidence: 99%
“…We extract features presented in [7] and others typically used in the Music Information Retrieval field: the four spectral moments (Centroid, Spread, Skewness, Kurtosis), other spectral indicators (Brightness, Rolloff, Flux, Irregularity, Flatness), features related to the distribution of harmonics (Tristimulus coefficients, Odd-Even ratio), two vectorial features describing the spectrum shape (Mel-Frequency Cepstral Coefficients, MFCC, and Spectral Contrast [12]) and some temporal features (Attack time, Attack slope, RMS energy and Zero-crossing rate). We refer to [13], [14], [15], [16], [17] for a detailed explanation of these features.…”
Section: B Feature Analysismentioning
confidence: 99%
“…Low-level features are objective descriptors devoted to capture specific aspects of the sound. Since the timbre is the combination of many factors ranging from acoustics to perception, feature-based analysis resulted particularly suitable for musical instruments characterization [6] [7] [8]. In [9] the authors take advantage of feature-based analysis for a musical instruments recognition scenario.…”
Section: Introductionmentioning
confidence: 99%