Brian S. Helfer scite author profile

1In individuals with major depressive disorder, neurophysiological changes often alter motor control and thus affect the mechanisms controlling speech production and facial expression. These changes are typically associated with psychomotor retardation, a condition marked by slowed neuromotor output that is behaviorally manifested as altered coordination and timing across multiple motor-based properties. Changes in motor outputs can be inferred from vocal acoustics and facial movements as individuals speak. We derive novel multi-scale correlation structure and timing feature sets from audio-based vocal features and videobased facial action units from recordings provided by the 4th International Audio/Video Emotion Challenge (AVEC). The feature sets enable detection of changes in coordination, movement, and timing of vocal and facial gestures that are potentially symptomatic of depression. Combining complementary features in Gaussian mixture model and extreme learning machine classifiers, our multivariate regression scheme predicts Beck depression inventory ratings on the AVEC test set with a root-mean-square error of 8.12 and mean absolute error of 6.31. Future work calls for continued study into detection of neurological disorders based on altered coordination and timing across audio and video modalities.

show abstract

Vocal biomarkers of depression based on motor incoordination

Williamson

Quatieri

Helfer

et al. 2013

148

View full text Add to dashboard Cite

1In Major Depressive Disorder (MDD), neurophysiologic changes can alter motor control [1,2] and therefore alter speech production by influencing the characteristics of the vocal source, tract, and prosodics. Clinically, many of these characteristics are associated with psychomotor retardation, where a patient shows sluggishness and motor disorder in vocal articulation, affecting coordination across multiple aspects of production [3,4]. In this paper, we exploit such effects by selecting features that reflect changes in coordination of vocal tract motion associated with MDD. Specifically, we investigate changes in correlation that occur at different time scales across formant frequencies and also across channels of the delta-mel-cepstrum. Both feature domains provide measures of coordination in vocal tract articulation while reducing effects of a slowly-varying linear channel, which can be introduced by time-varying microphone placements. With these two complementary feature sets, using the AVEC 2013 depression dataset, we design a novel Gaussian mixture model (GMM)-based multivariate regression scheme, referred to as Gaussian Staircase Regression, that provides a root-mean-squared-error (RMSE) of 7.42 and a mean-absolute-error (MAE) of 5.75 on the standard Beck depression rating scale. We are currently exploring coordination measures of other aspects of speech production, derived from both audio and video signals.

show abstract

Robust detection of individual forensic profiles in DNA mixtures

Isaacson¹,

Schwoebel

Shcherbina

et al. 2015

Forensic Science International: Genetics

View full text Add to dashboard Cite

On the relative importance of vocal source, system, and prosody in human depression

Horwitz

Quatieri

Helfer

et al. 2013

View full text Add to dashboard Cite

Tracking depression severity from audio and video based on speech articulatory coordination

Williamson

Young

Nierenberg

et al. 2019

Computer Speech & Language

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Brian S. Helfer

Vocal and Facial Biomarkers of Depression based on Motor Incoordination and Timing

Vocal biomarkers of depression based on motor incoordination

Robust detection of individual forensic profiles in DNA mixtures

On the relative importance of vocal source, system, and prosody in human depression

Tracking depression severity from audio and video based on speech articulatory coordination

Contact Info

Product

Resources

About