2022
DOI: 10.1101/2022.02.07.22270598
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Can you hear me now? Clinical applications of audio recordings

Abstract: Audio and speech have several implicit characteristics that have the potential for the identification and quantification of clinical disorders. This PRISMA-guided review is designed to provide an overview of the landscape of automated clinical audio processing to build data-driven predictive models and infer phenotypes of a variety of neuropsychiatric, cardiac, respiratory and other disorders. We detail the important components of this processing workflow, specifically data acquisition and processing, algorith… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
6
0

Year Published

2022
2022
2023
2023

Publication Types

Select...
2
1

Relationship

1
2

Authors

Journals

citations
Cited by 3 publications
(6 citation statements)
references
References 92 publications
(147 reference statements)
0
6
0
Order By: Relevance
“…In addition, we plan to analyze voice and audio characteristics and extract-related features (such as pause rates, pitch, loudness, acoustic and spectral features, and multiple speakers, such as parent and child) [ 51 , 52 ]. Voice analytics will add a new dimension to PGHD analytics by investigating vocal and environmental audio features (markers) with patient notes and building a multimodal pipeline, such as improving transcription quality, improving sentiment analysis, identifying the environmental factors [ 53 , 54 ], and guiding future data collection protocols.…”
Section: Discussionmentioning
confidence: 99%
“…In addition, we plan to analyze voice and audio characteristics and extract-related features (such as pause rates, pitch, loudness, acoustic and spectral features, and multiple speakers, such as parent and child) [ 51 , 52 ]. Voice analytics will add a new dimension to PGHD analytics by investigating vocal and environmental audio features (markers) with patient notes and building a multimodal pipeline, such as improving transcription quality, improving sentiment analysis, identifying the environmental factors [ 53 , 54 ], and guiding future data collection protocols.…”
Section: Discussionmentioning
confidence: 99%
“…It would be beneficial to comprehensively quantify the sensitivity of our and other such algorithms by establishing a "gold standard" of recordings labeled in terms of speakers, e.g., patients, clinicians and other speakers. Our algorithm also leveraged the placement of the lavalier microphone on the patient, as is commonplace in clinical research settings [8], [17]. Thus, it's performance may not translate for other recording settings, which may need their own specialized algorithms.…”
Section: Conclusion and Discussionmentioning
confidence: 99%
“…To the best of our knowledge, this is one of the largest clinical audio datasets collected for research purposes [8].…”
Section: A Audio Datasetmentioning
confidence: 99%
See 2 more Smart Citations