Purpose The purpose of this research note is to provide a performance comparison of available algorithms for the automated evaluation of oral diadochokinesis using speech samples from patients with amyotrophic lateral sclerosis (ALS). Method Four different algorithms based on a wide range of signal processing approaches were tested on a sequential motion rate /pa/-/ta/-/ka/ syllable repetition paradigm collected from 18 patients with ALS and 18 age- and gender-matched healthy controls (HCs). Results The best temporal detection of syllable position for a 10-ms tolerance value was achieved for ALS patients using a traditional signal processing approach based on a combination of filtering in the spectrogram, Bayesian detection, and polynomial thresholding with an accuracy rate of 74.4%, and for HCs using a deep learning approach with an accuracy rate of 87.6%. Compared to HCs, a slow diadochokinetic rate ( p < .001) and diadochokinetic irregularity ( p < .01) were detected in ALS patients. Conclusions The approaches using deep learning or multiple-step combinations of advanced signal processing methods provided a more robust solution to the estimation of oral DDK variables than did simpler approaches based on the rough segmentation of the signal envelope. The automated acoustic assessment of oral diadochokinesis shows excellent potential for monitoring bulbar disease progression in individuals with ALS.
<p class="BodyTextNext"><em>Evaluation of precision of consonant articulation is commonly used metric in assessment of pathological speech. </em><em>However, up to date most of the research on consonant characteristics was performed on English while there are obvious language-specific differences. The aim of the current study was therefore to investigate the patterns of consonant articulation in Czech across 6 stop consonants with respect to age and gender. The database used consisted of 30 female and 30 male healthy participants. Four acoustic variables including voice onset time (VOT), VOT ratio and two spectral moments were analyzed. The Czech plosives /p/, /t/ and /k/ were found to be characterized by short voicing lag (average VOT ranged from 14 to 32 ms) while voiced plosives /b/, /d/ and /g/ by long voicing lead (average VOT ranged from -79 to -91 ms). </em><em>Furthermore, we observed significantly longer duration of both VOT </em><em>(p < 0.05) </em><em>and VOT ratio </em><em>(p < 0.01) </em><em>of voiceless plosives in female compared to male gender. Finally, we revealed a significant negative correlation between age and duration of voiceless </em><em>(</em><em>r = -0.36, p </em><em>< 0.05) </em><em>as well as voiced VOT </em><em>(</em><em>r = -0.45, p =</em><em> 0.01) </em><em>in female but not in male participants.</em></p>
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.