Meishu Song scite author profile

The COVID-19 outbreak was announced as a global pandemic by the World Health Organisation in March 2020 and has affected a growing number of people in the past few weeks. In this context, advanced artificial intelligence techniques are brought to the fore in responding to fight against and reduce the impact of this global health crisis. In this study, we focus on developing some potential use-cases of intelligent speech analysis for COVID-19 diagnosed patients. In particular, by analysing speech recordings from these patients, we construct audio-onlybased models to automatically categorise the health state of patients from four aspects, including the severity of illness, sleep quality, fatigue, and anxiety. For this purpose, two established acoustic feature sets and support vector machines are utilised. Our experiments show that an average accuracy of .69 obtained estimating the severity of illness, which is derived from the number of days in hospitalisation. We hope that this study can foster an extremely fast, low-cost, and convenient way to automatically detect the COVID-19 disease.

show abstract

Audiovisual Analysis for Recognising Frustration during Game-Play: Introducing the Multimodal Game Frustration Database

Song

Yang

Baird

et al. 2019

View full text Add to dashboard Cite

An Early Study on Intelligent Analysis of Speech under COVID-19: Severity, Sleep Quality, Fatigue, and Anxiety

Qian¹,

Song²,

Yang³

et al. 2020

Preprint

View full text Add to dashboard Cite

Computer Audition for Fighting the SARS-CoV-2 Corona Crisis—Introducing the Multitask Speech Corpus for COVID-19

Qian

Schmitt

Zheng

et al. 2021

IEEE Internet Things J.

View full text Add to dashboard Cite

Computer audition (CA) has experienced a fast development in the past decades by leveraging advanced signal processing and machine learning techniques. In particular, for its non-invasive and ubiquitous character by nature, CA based applications in healthcare have increasingly attracted attention in recent years. During the tough time of the global crisis caused by the COVID-19 (coronavirus disease 2019), scientists and engineers in data science have collaborated to think of novel ways in prevention, diagnosis, treatment, tracking, and management of this global pandemic. On one hand, we have witnessed the power of 5G, internet of things, big data, computer vision, and artificial intelligence in applications of epidemiology modelling, drug and/or vaccine finding and designing, fast CT screening, and quarantine management. On the other hand, relevant studies in exploring the capacity of CA are extremely lacking and underestimated. To this end, we propose a novel multi-task speech corpus for COVID-19 research usage. We collected 51 confirmed COVID-19 patients' in-the-wild speech data in Wuhan city, China. We define three main tasks in this corpus, i. e., three-category classification tasks for evaluating the physical and/or mental

show abstract

Adventitious Respiratory Classification Using Attentive Residual Neural Networks

Yang

Liu

Song

et al. 2020

View full text Add to dashboard Cite

Every year, respiratory diseases affect millions of people worldwide, becoming one of the main causes of death in nowadays society. Currently, the COVID-19-known as a novel respiratory illness-has triggered a global health crisis, which has been identified as the greatest challenge of our time since the Second World War. COVID-19 and many other respiratory diseases present often common symptoms, which impairs their early diagnosis; thus, restricting their prevention and treatment. In this regard, in order to encourage a faster and more accurate detection of these kinds of diseases, the automatic identification of respiratory illness through the application of machine learning methods is a very promising area of research aimed to support clinicians. With this in mind, we apply attention-based Convolutional Neural Networks for the recognition of adventitious respiratory cycles on the International Conference on Biomedical Health Informatics 2017 challenge database. Experimental results indicate that the architecture of residual networks with attention mechanism achieves a significant improvement w. r. t. the baseline models.

show abstract

Frustration recognition from speech during game interaction using wide residual networks

Song

Mallol-Ragolta

Parada-Cabaleiro

et al. 2021

Virtual Reality & Intelligent Hardware

View full text Add to dashboard Cite

Predicting Group Work Performance from Physical Handwriting Features in a Smart English Classroom

Song

Qian

Chen

et al. 2021

View full text Add to dashboard Cite

Supervised Contrastive Learning for Game-Play Frustration Detection from Speech

Song

Parada-Cabaleiro

Liu

et al. 2021

View full text Add to dashboard Cite

Frustration is a common response during game interactions, typically decreasing a user's engagement and leading to game failure. Artificially intelligent methods capable to automatically detect a user's level of frustration at an early stage are hence of great interest for game designers, since this would enable optimisation of a player's experience in real-time. Nevertheless, research in this context is still in its infancy, mainly relying on the use of pre-trained models and fine-tuning tailored to a specific dataset. Furthermore, this lack in research is due to the limited data available and to the ambiguous labelling of frustration, which leads to outcomes which are not generalisable in the real-world. Meanwhile, contrastive loss has been considered instead of the traditional cross-entropy loss in a variety of machine learning applications, showing to be more robust for system stability alternative in self-supervised learning. Following this trend, we hypothesise that using a supervised contrastive loss might overcome the limitations of the cross-entropy loss yielded by the labels' ambiguity. In fact, our experiments demonstrate that using the supervised contrastive method as a loss function, results improve for the automatic recognition (binary frustration vs nofrustration) of game-induced frustration from speech with an Unweighted Average Recall increase from 86.4 % to 89.9 %.

show abstract

12 3

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Meishu Song

An Early Study on Intelligent Analysis of Speech Under COVID-19: Severity, Sleep Quality, Fatigue, and Anxiety

Audiovisual Analysis for Recognising Frustration during Game-Play: Introducing the Multimodal Game Frustration Database

An Early Study on Intelligent Analysis of Speech under COVID-19: Severity, Sleep Quality, Fatigue, and Anxiety

Computer Audition for Fighting the SARS-CoV-2 Corona Crisis—Introducing the Multitask Speech Corpus for COVID-19

Adventitious Respiratory Classification Using Attentive Residual Neural Networks

Frustration recognition from speech during game interaction using wide residual networks

Predicting Group Work Performance from Physical Handwriting Features in a Smart English Classroom

Supervised Contrastive Learning for Game-Play Frustration Detection from Speech

Contact Info

Product

Resources

About