Deep multiple instance learning for foreground speech localization in ambient audio from wearable devices

Hebbar, Rajat; Papadopoulos, Pavlos; Reyes, Ramón Gerardo Recio; Danvers, Alexander; Polsinelli, Angelina J.; Moseley, Suzanne; Sbarra, David A.; Mehl, Matthias R.; Narayanan, Shrikanth

doi:10.1186/s13636-020-00194-0

Cited by 9 publications

(4 citation statements)

References 24 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…This might in part be due to technical limitations of the used MS algorithm: Although the algorithm achieved high accuracies of more than 85% in prior studies (Lane et al, 2012; Rabbi et al, 2011), the algorithm’s accuracy in less controlled environments is probably lower, as indicated by the size of agreement with DRM and ESM in the current study. In the future, researchers will likely have access to more sophisticated algorithms—for example, first evidence suggests that algorithms based on a distinction of foreground versus background sound might outperform more traditional voice-detection algorithms (Hebbar et al, 2021).…”

Section: Discussionmentioning

confidence: 99%

“…In general, we expected the agreement between DRM and MS to be lower than the agreement between ESM and MS because of the greater time delay and increased memory biases of DRM compared with ESM and MS. We further expected DRM and ESM to agree more on face-to-face interactions than DRM and MS or ESM and MS because of a closer alignment of operationalizations (e.g., social interactions assessed in DRM and MS may include periods without conversation) and because of technical challenges of MS, such as accurately identifying speakers (e.g., the participant or a surrounding group of people) and filtering out background noise (Hebbar et al, 2021). Accordingly, we derived the following hypotheses:…”

Section: The Present Studymentioning

confidence: 99%

See 1 more Smart Citation

Does Your Smartphone “Know” Your Social Life? A Methodological Comparison of Day Reconstruction, Experience Sampling, and Mobile Sensing

Roos

Krämer

Richter³

et al. 2023

Advances in Methods and Practices in Psychological Science

View full text Add to dashboard Cite

Mobile sensing is a promising method that allows researchers to directly observe human social behavior in daily life using people’s mobile phones. To date, limited knowledge exists on how well mobile sensing can assess the quantity and quality of social interactions. We therefore examined the agreement among experience sampling, day reconstruction, and mobile sensing in the assessment of multiple aspects of daily social interactions (i.e., face-to-face interactions, calls, and text messages) and the possible unique access to social interactions that each method has. Over 2 days, 320 smartphone users (51% female, age range = 18–80, M = 39.53 years) answered up to 20 experience-sampling questionnaires about their social behavior and reconstructed their days in a daily diary. Meanwhile, face-to-face and smartphone-mediated social interactions were assessed with mobile sensing. The results showed some agreement between measurements of face-to-face interactions and high agreement between measurements of smartphone-mediated interactions. Still, a large number of social interactions were captured by only one of the methods, and the quality of social interactions is still difficult to capture with mobile sensing. We discuss limitations and the unique benefits of day reconstruction, experience sampling, and mobile sensing for assessing social behavior in daily life.

show abstract

Section: Discussionmentioning

confidence: 99%

Section: The Present Studymentioning

confidence: 99%

Does Your Smartphone “Know” Your Social Life? A Methodological Comparison of Day Reconstruction, Experience Sampling, and Mobile Sensing

Roos

Krämer

Richter³

et al. 2023

Advances in Methods and Practices in Psychological Science

View full text Add to dashboard Cite

show abstract

“…It is not uncommon for an EAR study to accrue hundreds of hours of audio data. These audio data then, at least for the moment, need to be listened to and behaviorally coded by human coders (see Dubey et al, 2016;Hebbar et al, 2021;Schindler et al, 2022, for recent proof-of-concept attempts to automate aspects of the coding). In that, then, the EAR, as a naturalistic observation method, is ultimately subject to at least some of the same challenges that lab-based observation is.…”

Section: Mobile Sensing -The Whymentioning

confidence: 99%

Mobile Sensing Methods

Mehl¹,

Schoedel

2023

Preprint

View full text Add to dashboard Cite

Mobile and wearable devices are equipped with an array of embedded digital sensors that produce continuous steams of digital trace data. These now effectively ubiquitous digital trace data can empower psychologists to study human behavior “in vivo” and “in situ,” which has been an important goal following the American Psychology Association’s “Decade of Behavior” (2000-2009). As a result, there is currently great excitement among psychological scientists about the potentials that sensor-based mobile technologies offer for studying everyday situations and behavior ecologically and at scale. This novel approach to field research is commonly referred to as mobile sensing, or passive sensing. In this chapter we retrace mobile sensing’s journey in social and personality psychology. We begin with briefly reviewing forces that helped pave the way for mobile sensing’s warm welcome in the field. In the second and third part, we provide an overview of the current use of mobile sensing in social and personality psychology and convey a very basic how-to. Finally, we end the chapter with a discussion of where we think mobile sensing currently stands and identify some possible future directions.

show abstract

“…Multi-instance learning (MIL) was originally used for the field of hand-printed numerals identification [ 1 ] and drug activity prediction [ 2 ]. Instead of considering a series of individually labeled instances, MIL focuses on the labels of sets (or called bags ) of instances and demonstrate strong capabilities in many areas [ 3 ], e.g., speech localization [ 4 ], entity classification [ 5 ], protein structure determination [ 6 ], biometric authentication system [ 7 – 10 ], human pose estimation [ 11 ], medical image analysis [ 12 ], understanding chest CT imaging of COVID-19 [ 13 ], and clinical outcome prediction of COVID-19 [ 14 ].…”

Section: Introductionmentioning

confidence: 99%

A theoretical analysis based on causal inference and single-instance learning

Wang

2022

Appl Intell

View full text Add to dashboard Cite

Although using single-instance learning methods to solve multi-instance problems has achieved excellent performance in many tasks, the reasons for this success still lack a rigorous theoretical explanation. In particular, the potential relation between the number of causal factors (also called causal instances) in a bag and the model performance is not transparent. The goal of our study is to use the causal relationship between instances and bags to enhance the interpretability of multi-instance learning. First, we provide a lower bound on the number of instances required to determine causal factors in a real multi-instance learning task. Then, we provide a lower bound on the single-instance learning loss function when testing instances and training instances follow the same distribution and extend this conclusion to the situation where the distribution changes. Thus, theoretically, we demonstrate that the number of causal factors in the bag is an important parameter that affects the performance of the model when using single-instance learning methods to solve multi-instance learning problems. Finally, combining with a specific classification task, we experimentally validate our theoretical analysis.

show abstract

Deep multiple instance learning for foreground speech localization in ambient audio from wearable devices

Cited by 9 publications

References 24 publications

Does Your Smartphone “Know” Your Social Life? A Methodological Comparison of Day Reconstruction, Experience Sampling, and Mobile Sensing

Does Your Smartphone “Know” Your Social Life? A Methodological Comparison of Day Reconstruction, Experience Sampling, and Mobile Sensing

Mobile Sensing Methods

A theoretical analysis based on causal inference and single-instance learning

Contact Info

Product

Resources

About