ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2022
DOI: 10.1109/icassp43922.2022.9747712
|View full text |Cite
|
Sign up to set email alerts
|

Determining the best Acoustic Features for Smoker Identification

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
1
0

Year Published

2023
2023
2023
2023

Publication Types

Select...
2
2
1

Relationship

0
5

Authors

Journals

citations
Cited by 5 publications
(2 citation statements)
references
References 29 publications
0
1
0
Order By: Relevance
“…Similarly, a parameterized convolutional neural network (CNN) was used for acoustic modeling from raw waveform for the dysarthria speech recognition tasks [17]. In [18], a SincNet-based speech feature learning method was proposed to achieve automatic smoker identification tasks. It can be found that investigating learnable frontends has drawn a lot of attention from researchers and made significant progress in the field of speech process-ing.…”
Section: Related Workmentioning
confidence: 99%
“…Similarly, a parameterized convolutional neural network (CNN) was used for acoustic modeling from raw waveform for the dysarthria speech recognition tasks [17]. In [18], a SincNet-based speech feature learning method was proposed to achieve automatic smoker identification tasks. It can be found that investigating learnable frontends has drawn a lot of attention from researchers and made significant progress in the field of speech process-ing.…”
Section: Related Workmentioning
confidence: 99%
“…They investigate the performance of four acoustic feature sets or representations extracted using three feature extraction or learning approaches: (i) handcrafted feature sets, including the extended Geneva Minimalistic Acoustic Parameter Set and the Computational Paralinguistics Challenge Set; (ii) the Bag-of-Audio-Words representations; (iii) the neural representations extracted from raw waveform signals by SincNet. Experimental results show that: (i) SincNet feature representations are the most effective for smoker identification and outperform the MFCC baseline features by 16% in absolute accuracy; (ii) the performance of hand-crafted feature sets and the Bag-of-Audio-Words representations rely on the scale of the dimensions of feature vectors [10].…”
Section: Previous Studiesmentioning
confidence: 99%