2010 Fifth International Conference on Digital Information Management (ICDIM) 2010
DOI: 10.1109/icdim.2010.5664645
|View full text |Cite
|
Sign up to set email alerts
|

Environment sound recognition using zero crossing features and MPEG-7

Abstract: In this paper, we perform several experiments focusing on the problems of environment recognition from audio particularly for forensic application. We investigated the effect of temporal zero crossing fea ture and some selected MPEG-7 audio low level descriptors on environment sound recognition. The performance is evaluated against varying number of training sounds and samples per each training file. Experimental results show that higher recognition accuracy is achieved by increasing the number of training fil… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2011
2011
2012
2012

Publication Types

Select...
1
1
1

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
(1 citation statement)
references
References 15 publications
0
1
0
Order By: Relevance
“…Depending on the tasks of audio forensic practitioners, different sets of audio features could be employed. In [10], AlQahtani et al made use of MPEG-7 audio low level descriptors along with temporal zero crossing as features vector for automatic recognition of environment sounds. Recently, Sen et al proposed a new feature extraction technique coming from a new transformation which is based on the Nyquist filter bank and achieved significant result in speaker identification [11].…”
Section: B Audio Featuresmentioning
confidence: 99%
“…Depending on the tasks of audio forensic practitioners, different sets of audio features could be employed. In [10], AlQahtani et al made use of MPEG-7 audio low level descriptors along with temporal zero crossing as features vector for automatic recognition of environment sounds. Recently, Sen et al proposed a new feature extraction technique coming from a new transformation which is based on the Nyquist filter bank and achieved significant result in speaker identification [11].…”
Section: B Audio Featuresmentioning
confidence: 99%