Interspeech 2021 2021
DOI: 10.21437/interspeech.2021-1801
|View full text |Cite
|
Sign up to set email alerts
|

Investigating the Utility of Multimodal Conversational Technology and Audiovisual Analytic Measures for the Assessment and Monitoring of Amyotrophic Lateral Sclerosis at Scale

Abstract: We propose a cloud-based multimodal dialog platform for the remote assessment and monitoring of Amyotrophic Lateral Sclerosis (ALS) at scale. This paper presents our vision, technology setup, and an initial investigation of the efficacy of the various acoustic and visual speech metrics automatically extracted by the platform. 82 healthy controls and 54 people with ALS (pALS) were instructed to interact with the platform and completed a battery of speaking tasks designed to probe the acoustic, articulatory, pho… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
11
0

Year Published

2022
2022
2023
2023

Publication Types

Select...
8

Relationship

0
8

Authors

Journals

citations
Cited by 16 publications
(11 citation statements)
references
References 32 publications
0
11
0
Order By: Relevance
“…The prominent Audio/Visual Emotion Challenge and Workshop ( AVEC ) addressed this aspect: featured sub-challenges in which audio and video data or features from clinical interviews ( 92 ) and interviews with virtual agents ( 93 , 94 ) from the Distress Analysis Interview Corpus [DAIC, ( 32 )] are provided as well as data on bipolar disorder ( 95 ). In addition, setups in which data is collected from the smartphone's camera as additional video input within a commercial setup are nowadays easily conceivable ( 96 ). The number of smart devices with sensors is constantly growing and therefore this topic has also been increasingly reflected in more recent dataset publications in this review ( 40 , 48 , 65 , 72 ).…”
Section: Discussionmentioning
confidence: 99%
“…The prominent Audio/Visual Emotion Challenge and Workshop ( AVEC ) addressed this aspect: featured sub-challenges in which audio and video data or features from clinical interviews ( 92 ) and interviews with virtual agents ( 93 , 94 ) from the Distress Analysis Interview Corpus [DAIC, ( 32 )] are provided as well as data on bipolar disorder ( 95 ). In addition, setups in which data is collected from the smartphone's camera as additional video input within a commercial setup are nowadays easily conceivable ( 96 ). The number of smart devices with sensors is constantly growing and therefore this topic has also been increasingly reflected in more recent dataset publications in this review ( 40 , 48 , 65 , 72 ).…”
Section: Discussionmentioning
confidence: 99%
“…Remote longitudinal assessment tools for orofacial and speech applications have several clinical and research benefits. From a research perspective, these home-based assessments can be used to detect impairments associated with neurodegenerative disease that, with further development, could aid in clinical decision-making [ 15 ]. Home-based assessment also enables in-depth quantification of stability or change over time in individuals with neurological diseases due to disease progression or intervention [ 5 , 7 , 45 ].…”
Section: Discussionmentioning
confidence: 99%
“…Although these cameras are cumbersome to use and thus are not feasible for home-based use, high-quality 2D cameras in consumer electronics (e.g., smartphones and tablets) show promise for enabling high-quality orofacial assessment remotely from patients' homes. Recent research has demonstrated that that artificial intelligence-enabled remote assessment using 2D cameras can detect neurological impairment [ 15 ] and frequent at-home assessments of individuals with neurodegenerative disease can enhance disease monitoring [ 16 ]. There is an urgent need to explore the measurement properties of 2D camera systems in the context of remote assessment.…”
Section: Introductionmentioning
confidence: 99%
“…Traditional laboratory-based assessments of speech/orofacial kinematics, such as electromagnetic articulography (EMA) and 3D cameras, have a rich history of use in this research space; however, these technologies are expensive and require specialized training for operation and data analysis [16][17][18][19][20][21][22]. Consumer-grade technologies have started to show promise for disease detection/classification [4,[23][24][25]. For example, Neumann et al [24] showed that ALS could be detected across the disease severity range using laptopbased cameras and microphones in naturalistic settings via a web-based software tool.…”
Section: Introductionmentioning
confidence: 99%
“…Consumer-grade technologies have started to show promise for disease detection/classification [4,[23][24][25]. For example, Neumann et al [24] showed that ALS could be detected across the disease severity range using laptopbased cameras and microphones in naturalistic settings via a web-based software tool. These collective findings support further development of video-based approaches for digital health technologies.…”
Section: Introductionmentioning
confidence: 99%