Hybrid neural network based on novel audio feature for vehicle type identification

Chen, Haoze; Zhang, Zhijie

doi:10.1038/s41598-021-87399-1

Cited by 14 publications

(10 citation statements)

References 17 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Researchers [77] classify accelerating vehicles using audio and apply random noise and pitch shift to expand their dataset. Other work [14] focuses on vehicle type identification using MFCCs and other audio features, these authors apply city noise to augment captured signals. There have been additional applications of augmentation to acoustic vehicle DL related tasks [7,13,53].…”

Section: Audio Data Augmentationmentioning

confidence: 99%

The AI Mechanic: Acoustic Vehicle Characterization Neural Networks

Terwilliger¹,

Siegel²

2022

Preprint

View full text Add to dashboard Cite

In a world increasingly dependent on road-based transportation, it is essential to understand vehicles. We introduce the AI mechanic, an acoustic vehicle characterization deep learning system, as an integrated approach using sound captured from mobile devices to enhance transparency and understanding of vehicles and their condition for non-expert users. We develop and implement novel cascading architectures for vehicle understanding, which we define as sequential, conditional, multilevel networks that process raw audio to extract highly-granular insights. To showcase the viability of cascading architectures, we build a multi-task convolutional neural network that predicts and cascades vehicle attributes to enhance fault detection. We train and test these models on a synthesized dataset reflecting more than 40 hours of augmented audio and achieve > 92% validation set accuracy on attributes (fuel type, engine configuration, cylinder count and aspiration type). Our cascading architecture additionally achieved 93.6% validation and 86.8% test set accuracy on misfire fault prediction, demonstrating margins of 16.4% / 7.8% and 4.2% / 1.5% improvement over naïve and parallel baselines. We explore experimental studies focused on acoustic features, data augmentation, feature fusion, and data reliability. Finally, we conclude with a discussion of broader implications, future directions, and application areas for this work.

show abstract

Section: Audio Data Augmentationmentioning

confidence: 99%

The AI Mechanic: Acoustic Vehicle Characterization Neural Networks

Terwilliger¹,

Siegel²

2022

Preprint

View full text Add to dashboard Cite

show abstract

“…Piczak 4 first proposed the use of 2-D CNN to learn Log-Mel spectrogram features, which has significantly improved ESC performance compared with traditional machine learning algorithms such as KNN and SVM. Chen et al 5 accurately identified the audio signal of the vehicle by fusing the LSTM unit into the convolutional neural network. Boddapati et al 6 uses AlexNet 7 and GoogLeNet 8 to classify the environmental sound features extracted from the spectrum.…”

Section: Related Workmentioning

confidence: 99%

Fast environmental sound classification based on resource adaptive convolutional neural network

Zheng

Yin

et al. 2022

Sci Rep

View full text Add to dashboard Cite

Recently, with the construction of smart city, the research on environmental sound classification (ESC) has attracted the attention of academia and industry. The development of convolutional neural network (CNN) makes the accuracy of ESC reach a higher level, but the accuracy improvement brought by CNN is often accompanied by the deepening of network layers, which leads to the rapid growth of parameters and floating-point operations (FLOPs). Therefore, it is difficult to transplant CNN model to embedded devices, and the classification speed is also difficult to accept. In order to reduce the hardware requirements of running CNN and improve the speed of ESC, this paper proposes a resource adaptive convolutional neural network (RACNN). RACNN uses a novel resource adaptive convolutional (RAC) module, which can generate the same number of feature maps as conventional convolution operations more cheaply, and extract the time and frequency features of audio efficiently. The RAC block based on the RAC module is designed to build the lightweight RACNN model, and the RAC module can also be used to upgrade the existing CNN model. Experiments based on public datasets show that RACNN achieves higher performance than the state-of-the-art methods with lower computational complexity.

show abstract

“…Mel-frequency cepstral coefficients (MFCCs) are used in conjunction with ML or deep learning (DL) as features in a number of existing AVDI systems: in [8] they are used with a modified MLP, in [5] they are extracted from a specific high energy audio region and used with an ANN and knearest neighbors (KNN) classifier, and in [9] they are used in a feature set containing the pitch class profile (PCP) and short-term energy (STE) of vehicle audio signals in a hybrid convolutional neural network (CNN) containing a long shortterm memory (LSTM) layer.…”

Section: Related Workmentioning

confidence: 99%

“…[5] MFCC KNN /ANN [6] DFT SVM [7] STFT SVM [8] MFCC MLP [9] MFCC CNN /PCP/STE /LSTM [10] Mod-PCEN SNN [11] GCC RANSAC [12] GCC RANSAC [13] DWT LR architecture in which information acquired at sub-Nyquist rates from different frequency bands is used in a range of applications without prior signal reconstruction.…”

Section: Related Workmentioning

confidence: 99%

“…Various low-cost, low-complexity VDI techniques have been proposed, with acoustic vehicle detection and identification (AVDI) in particular being the subject of wide-ranging research [1][2][3][4][5][6][7][8][9][10][11][12][13][14][15]. The low price and simple installation process of AVDI systems make them an attractive alternative to other more expensive and difficult-to-install VDI systems, such as video camera-, radar-, or induction loop coilbased systems.…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

C-AVDI: Compressive Measurement-Based Acoustic Vehicle Detection and Identification

Dawton¹,

Ishida

Arakawa

2021

IEEE Access

View full text Add to dashboard Cite

As society grows ever more interconnected, the need for sophisticated signal processing and data analysis techniques becomes increasingly apparent. This is particularly true in the field of intelligent transportation systems (ITSs), where various sensing applications generate data at an exponential rate. In this paper, we present C-AVDI, a compressive measurement-based acoustic vehicle detection and identification architecture capable of extracting information from vehicle audio signals while sampling at sub-Nyquist rates. In addition, we further reduce the overall complexity by performing any necessary signal filtering during the acquisition process, removing the need for a separate filtering stage in the system's front-end. Our results obtained from data collected under a range of weather conditions present an accuracy of 80% with a back-end analog-to-digital converter (ADC) sample rate of 3 kHz, with initial results from a microcontroller (MCU) implementation of our proposed system presenting an accuracy of 72%.

show abstract

Hybrid neural network based on novel audio feature for vehicle type identification

Cited by 14 publications

References 17 publications

The AI Mechanic: Acoustic Vehicle Characterization Neural Networks

The AI Mechanic: Acoustic Vehicle Characterization Neural Networks

Fast environmental sound classification based on resource adaptive convolutional neural network

C-AVDI: Compressive Measurement-Based Acoustic Vehicle Detection and Identification

Contact Info

Product

Resources

About