Rivka Levitan scite author profile

Automatic detection of depression has attracted increasing attention from researchers in psychology, computer science, linguistics, and related disciplines. As a result, promising depression detection systems have been reported. This paper surveys these efforts by presenting the first cross-modal review of depression detection systems and discusses best practices and most promising approaches to this task.

show abstract

Implementing Acoustic-Prosodic Entrainment in a Conversational Avatar

Levitan

Beňuš

Gálvez

et al. 2016

View full text Add to dashboard Cite

Entrainment, aka accommodation or alignment, is the phenomenon by which conversational partners become more similar to each other in behavior. While there has been much work on some behaviors there has been little on entrainment in speech and even less on how Spoken Dialogue Systems (SDS) which entrain to their users' speech can be created. We present an architecture and algorithm for implementing acoustic-prosodic entrainment in SDS and show that speech produced under this algorithm conforms to the feature targets, satisfying the properties of entrainment behavior observed in human-human conversations. We present results of an extrinsic evaluation of this method, comparing whether subjects are more likely to ask advice from a conversational avatar that entrains vs. one that does not, in English, Spanish and Slovak SDS.

show abstract

Speech vs. text: A comparative analysis of features for depression detection systems

Morales

Levitan

2016

View full text Add to dashboard Cite

Looking for Structure in Lexical and Acoustic-Prosodic Entrainment Behaviors

Weise

Levitan

2018

View full text Add to dashboard Cite

Entrainment has been shown to occur for various linguistic features individually. Motivated by cognitive theories regarding linguistic entrainment, we analyze speakers' overall entrainment behaviors and search for an underlying structure. We consider various measures of both acoustic-prosodic and lexical entrainment, measuring the latter with a novel application of two previously introduced methods in addition to a standard high-frequency word measure. We present a negative result of our search, finding no meaningful correlations, clusters, or principal components in various entrainment measures, and discuss practical and theoretical implications.

show abstract

OpenMM: An Open-Source Multimodal Feature Extraction Tool

Morales

Scherer

Levitan

2017

View full text Add to dashboard Cite

The primary use of speech is in face-to-face interactions and situational context and human behavior therefore intrinsically shape and affect communication. In order to usefully model situational awareness, machines must have access to the same streams of information humans have access to. In other words, we need to provide machines with features that represent each communicative modality: face and gesture, voice and speech, and language. This paper presents OpenMM: an open-source multimodal feature extraction tool. We build upon existing open-source repositories to present the first publicly available tool for multimodal feature extraction. The tool provides a pipeline for researchers to easily extract visual and acoustic features. In addition, the tool also performs automatic speech recognition (ASR) and then uses the transcripts to extract linguistic features. We evaluate the OpenMM's multimodal feature set on deception, depression and sentiment classification tasks and show its performance is very promising. This tool provides researchers with a simple way of extracting multimodal features and consequently a richer and more robust feature representation for machine learning tasks.

show abstract

A Linguistically-Informed Fusion Approach for Multimodal Depression Detection

Morales¹,

Scherer²,

Levitan³

2018

View full text Add to dashboard Cite

show abstract

Automatically Classifying Self-Rated Personality Scores from Speech

An¹,

Levitan²,

Levitan³

et al. 2016

View full text Add to dashboard Cite

Automatic personality recognition is useful for many computational applications, including recommendation systems, dating websites, and adaptive dialogue systems. There have been numerous successful approaches to classify the "Big Five" personality traits from a speaker's utterance, but these have largely relied on judgments of personality obtained from external raters listening to the utterances in isolation. This work instead classifies personality traits based on self-reported personality tests, which are more valid and more difficult to identify. Our approach, which uses lexical and acoustic-prosodic features, yields predictions that are between 6.4% and 19.2% more accurate than chance. This approach predicts Opennessto-Experience and Neuroticism most successfully, with less accurate recognition of Extroversion. We compare the performance of classification and regression techniques, and also explore predicting personality clusters.

show abstract

Identifying Individual Differences in Gender, Ethnicity, and Personality from Dialogue for Deception Detection

Levitan¹,

Levitan²,

An³

et al. 2016

View full text Add to dashboard Cite

When automatically detecting deception, it is important to model individual differences across speakers. We explore the automatic identification of individual traits such as gender, native language, and personality, using acoustic-prosodic and lexical features from an initial non-deceptive dialogue. We also explore predicting success at deception and at deception detection, using the same features.

show abstract

12 3 4 5

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

334 Leonard St

Brooklyn, NY 11211

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Rivka Levitan

A Cross-modal Review of Indicators for Depression Detection Systems

Implementing Acoustic-Prosodic Entrainment in a Conversational Avatar

Speech vs. text: A comparative analysis of features for depression detection systems

Looking for Structure in Lexical and Acoustic-Prosodic Entrainment Behaviors

OpenMM: An Open-Source Multimodal Feature Extraction Tool

A Linguistically-Informed Fusion Approach for Multimodal Depression Detection

Automatically Classifying Self-Rated Personality Scores from Speech

Identifying Individual Differences in Gender, Ethnicity, and Personality from Dialogue for Deception Detection

Contact Info

Product

Resources

About