2013
DOI: 10.1109/jproc.2013.2251591
|View full text |Cite
|
Sign up to set email alerts
|

An Overview on Perceptually Motivated Audio Indexing and Classification

Abstract: An audio indexing system aims at describing audio content by identifying, labeling or categorizing different acoustic events. Since the resulting audio classification and indexing is meant for direct human consumption, it is highly desirable that it produces perceptually relevant results. This can be obtained by integrating specific knowledge of the human auditory system in the design process to various extent. In this paper, we highlight some of the important concepts used in audio classification and indexing… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
22
0

Year Published

2016
2016
2023
2023

Publication Types

Select...
5
2

Relationship

0
7

Authors

Journals

citations
Cited by 42 publications
(22 citation statements)
references
References 107 publications
0
22
0
Order By: Relevance
“…Although, providing the reader with a comprehensive view of specific machine hearing problems exceeds the goals of this work, the interested reader will find diverse examples of the machine hearing applications throughout the paper. Examples include speaker identification (like in Yuo et al [27]), music genre classification (Tzanetakis and Cook [28]), environmental sound recognition (e.g., the works by Ando [29] and Valero and Alías [30]), audio indexing and retrieval (Richard et al [31]), or CASA (as in the works by Peltonen et al [18], Chu et al [14], Valero and Alías [15]). …”
Section: Architecture Of Machine Hearing Systemsmentioning
confidence: 99%
See 3 more Smart Citations
“…Although, providing the reader with a comprehensive view of specific machine hearing problems exceeds the goals of this work, the interested reader will find diverse examples of the machine hearing applications throughout the paper. Examples include speaker identification (like in Yuo et al [27]), music genre classification (Tzanetakis and Cook [28]), environmental sound recognition (e.g., the works by Ando [29] and Valero and Alías [30]), audio indexing and retrieval (Richard et al [31]), or CASA (as in the works by Peltonen et al [18], Chu et al [14], Valero and Alías [15]). …”
Section: Architecture Of Machine Hearing Systemsmentioning
confidence: 99%
“…On the other hand, we can find those techniques that try to explicitly integrate perception in the parameterization process or derive it through the computation of signal features capable of extracting perceptually relevant aspects from the input audio, as described by Richard et al [31]. The former typically include in the parameterization process simplified audition models of the hearing system (e.g., by considering from Bark, Mel or Gammatone filter-banks to more complex models based on electroencephalograms).…”
Section: Audio Features Taxonomy and Review Of Extraction Techniquesmentioning
confidence: 99%
See 2 more Smart Citations
“…They constitute a set of scalar parameters related to the spectral description of the musical signal. They were the subject of an extensive literature; further studies of their extraction are presented in [23]. The choice of these features, said low level, are generally depending on the desired application and on extraction duration.…”
Section: ) Features Representing the Musical Signal Timbrementioning
confidence: 99%