2018 40th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC) 2018
DOI: 10.1109/embc.2018.8512237
|View full text |Cite
|
Sign up to set email alerts
|

Crackle and Breathing Phase Detection in Lung Sounds with Deep Bidirectional Gated Recurrent Neural Networks

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

2
33
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
6
1

Relationship

0
7

Authors

Journals

citations
Cited by 40 publications
(38 citation statements)
references
References 15 publications
2
33
0
Order By: Relevance
“…However, the dataset used was relatively small (489 15-s recordings) and the task was to detect inspiratory and expiratory sounds in a 0.1-s segment (time frame) instead of detecting the events of inhalations and exhalations. Messner et al [25] applied the BiGRU to two features, namely Mel-frequency cepstral coefficients (MFCCs) and short-time Fourier transform (STFT)-derived spectrograms, and achieved an F1 score of approximately 86% for breath phase detection based on 4,656 inhalations and 4,720 exhalations and an F1 score of approximately 72% for crackle detection based on 1,339 crackle events. Jácome et al [26] used a faster region-based CNN (Faster R-CNN) framework to obtain a sensitivity of 97.5% and specificity of 85% in inspiratory phase detection and a sensitivity of 95.5% and specificity of 82.5% in expiratory phase detection, which was based on a dataset comprising 3,212 inspiratory phases and 2,842 expiratory phases.…”
Section: Plos Onementioning
confidence: 99%
See 2 more Smart Citations
“…However, the dataset used was relatively small (489 15-s recordings) and the task was to detect inspiratory and expiratory sounds in a 0.1-s segment (time frame) instead of detecting the events of inhalations and exhalations. Messner et al [25] applied the BiGRU to two features, namely Mel-frequency cepstral coefficients (MFCCs) and short-time Fourier transform (STFT)-derived spectrograms, and achieved an F1 score of approximately 86% for breath phase detection based on 4,656 inhalations and 4,720 exhalations and an F1 score of approximately 72% for crackle detection based on 1,339 crackle events. Jácome et al [26] used a faster region-based CNN (Faster R-CNN) framework to obtain a sensitivity of 97.5% and specificity of 85% in inspiratory phase detection and a sensitivity of 95.5% and specificity of 82.5% in expiratory phase detection, which was based on a dataset comprising 3,212 inspiratory phases and 2,842 expiratory phases.…”
Section: Plos Onementioning
confidence: 99%
“…signals were then processed using STFT [25,54,68]. In the STFT, we set a Hanning window size of 256 and hop length of 64; no additional zero-padding was applied.…”
Section: Plos Onementioning
confidence: 99%
See 1 more Smart Citation
“…Feature extraction in state-of-the-art deep learning based systems typically involves generating twodimensional time-frequency spectrograms that are able to capture both fine grained temporal and spectral information as well as present a much wider time context than single frame analysis. While a variety of spectrogram transformations have been utilised, Mel-based methods such as log-Mel spectra [19], [20], [21] and stacked MFCC features [19], [22], [23], [24], [25], [26] are the most popular ones. Some researchers combined different types of spectrogram, e.g.…”
Section: Introductionmentioning
confidence: 99%
“…A more serious issue with this research field has been the difficulty of comparing between techniques due to the lack of standardised datasets for evaluation. Most publications evaluate on proprietary datasets that are unavailable to others [9], [10], [13], [19], [25].…”
Section: Introductionmentioning
confidence: 99%