2020
DOI: 10.1016/j.apacoust.2019.107041
|View full text |Cite
|
Sign up to set email alerts
|

Environmental sound monitoring using machine learning on mobile devices

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
9
0
1

Year Published

2020
2020
2024
2024

Publication Types

Select...
8
1
1

Relationship

0
10

Authors

Journals

citations
Cited by 28 publications
(10 citation statements)
references
References 17 publications
0
9
0
1
Order By: Relevance
“…The importance of these areas for health protection and promotion is quite obvious, as they are potentially well suited to the needs of citizens of metropolitan areas in relieving stress and improving feelings of well-being. Further interesting developments will deal with analysis of tags used in social networks to describe everyday acoustic environments and/or pictures of places, and machine learning techniques for sound categorization in terms of perception [93,94].…”
Section: Discussionmentioning
confidence: 99%
“…The importance of these areas for health protection and promotion is quite obvious, as they are potentially well suited to the needs of citizens of metropolitan areas in relieving stress and improving feelings of well-being. Further interesting developments will deal with analysis of tags used in social networks to describe everyday acoustic environments and/or pictures of places, and machine learning techniques for sound categorization in terms of perception [93,94].…”
Section: Discussionmentioning
confidence: 99%
“…Audio textures and its associated features, such as LBP and HOG, are used in ASA [106]. In many application MFCCs have been used for ASA [107]. Recently deep learning algorithms have taken the TFR of the textural audio as an image and performed various classification tasks [32,33,108,24,109,110].…”
Section: Applications Of Audio Texturesmentioning
confidence: 99%
“…Thus, the MFCC coefficients retrieves data related to the sound as a short-term power spectrum composed by a linear cosine transform of a log power spectrum on a nonlinear Mel-scale of frequency, which, as previously used in other studies, e.g. , [7 , 13] , it consists in the codification of audio data for the identification of the different frequencies. It allows the categorization of the different data collected, because it discretizes the data for the correct identification.…”
Section: Data Descriptionmentioning
confidence: 99%