2021
DOI: 10.1155/2021/9937383
|View full text |Cite
|
Sign up to set email alerts
|

Sound Classification Based on Multihead Attention and Support Vector Machine

Abstract: Sound classification is a broad area of research that has gained much attention in recent years. The sound classification systems based on recurrent neural networks (RNNs) and convolutional neural networks (CNNs) have undergone significant enhancements in the recognition capability of models. However, their computational complexity and inadequate exploration of global dependencies for long sequences restrict improvements in their classification results. In this paper, we show that there are still opportunities… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
7
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
4
1
1

Relationship

0
6

Authors

Journals

citations
Cited by 8 publications
(7 citation statements)
references
References 31 publications
0
7
0
Order By: Relevance
“…The accuracy of the model can be defined by the formula given below: 𝑎𝑐𝑐𝑢𝑟𝑎𝑐𝑦 = 𝑇𝑃+𝑇𝑁 𝑇𝑃+𝑇𝑁+𝐹𝑃+𝐹𝑁 ( 6 ) 3…”
Section: Discussionmentioning
confidence: 99%
See 1 more Smart Citation
“…The accuracy of the model can be defined by the formula given below: 𝑎𝑐𝑐𝑢𝑟𝑎𝑐𝑦 = 𝑇𝑃+𝑇𝑁 𝑇𝑃+𝑇𝑁+𝐹𝑃+𝐹𝑁 ( 6 ) 3…”
Section: Discussionmentioning
confidence: 99%
“…Previously, researchers used mathematical techniques like standard statistical pattern recognition (SPR), Gaussian classifier (GS), and Gaussian Mixture Model (GMM) to classify music genres. In the age of AI, researchers have been using various techniques of Machine learning like Multi-Class Support Vector Machine (SVM) [6], K-Nearest Neighbors (KNN) [7], Linear Kernel SVM, Polynomial Kernel SVM, Decision Tree, Random Forest, Ada Boost, Naïve Bayes, Linear Discriminant Analysis (LDA) classifier, Logistic Regression, and Sigmoid Kernel SVM [6], [8]- [10]. In modern days, deep learning is not only solving the problem of computer vision but also dealing with sequencing and time series problems [11][12].…”
Section: Introductionmentioning
confidence: 99%
“…Returning to the classification of sound/noise, which can be classically used as the base for the detection danger from the noise around an individual/population, [15] demonstrates that sound categorization performance can still be improved by swapping out the recurrent architecture for a parallel processing structure during feature extraction. The research processes the huge data and uses it to develop the model using Deep Learning Algorithms, namely CNN (Convolutional Neural Networks) and LSTM (Long Short-Term Memory).…”
Section: Technical Backgroundmentioning
confidence: 99%
“…In this research, the audio analysis technique adopted is the Fourier Transform and Mel-Spectrogram (similar to [31]), and the audio was sampled at 44.1kHz (just like in [32]) for further processing. Post-cleaning, the sound data is subjected to three different deep learning models (1D-CNN, 2D-CNN, and LSTM) for the classification of sound from a person's surroundings (the likes of which have been used in various pieces of research cited above, for example: [15]), and to detect a threat from it. If the threat is detected, then an automatic alert message is sent to the registered help or the emergency services.…”
Section: Technical Backgroundmentioning
confidence: 99%
“…In the aspect of audio recognition, Yang and Zhao [ 17 ] proposed an acoustic scene classification method based on the support vector machine (SVM), which enhanced the sound texture to improve the classification accuracy. Greco et al [ 18 ] proposed a voice recognition system based on the heuristic deep learning method.…”
Section: Related Workmentioning
confidence: 99%