2019
DOI: 10.1145/3328932
|View full text |Cite
|
Sign up to set email alerts
|

Multi-task Self-Supervised Learning for Human Activity Detection

Abstract: Deep learning methods are successfully used in applications pertaining to ubiquitous computing, pervasive intelligence, health, and well-being. Speci cally, the area of human activity recognition (HAR) is primarily transformed by the convolutional and recurrent neural networks, thanks to their ability to learn semantic representations directly from raw input. However, in order to extract generalizable features massive amounts of well-curated data are required, which is a notoriously challenging task; hindered … Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

1
199
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
4
4

Relationship

1
7

Authors

Journals

citations
Cited by 230 publications
(226 citation statements)
references
References 66 publications
1
199
0
Order By: Relevance
“…A possible solution is to use self-supervised learning architecture that can be used to increase the generalizability of a classification model with a small number of labeled samples while utilizing unlabeled images that are more easily available. Self-supervised learning in the form of rotation augmentation has shown to improve the model generalizability in a number of computer vision [3,4,8,11] and human activity recognition [2,9] tasks. Inspired by our success with AugToAct [2], where we modeled the self-supervision task as regression (reconstruction of a rotated/scaled input feature space), instead of classification (predicting whether rotation was done) with higher performance gain, we propose to augment self-supervision with a reinforcement learning loop [1,6] to learn the optimum augmentation policy for self-supervision.…”
Section: Label Desert Problemmentioning
confidence: 99%
“…A possible solution is to use self-supervised learning architecture that can be used to increase the generalizability of a classification model with a small number of labeled samples while utilizing unlabeled images that are more easily available. Self-supervised learning in the form of rotation augmentation has shown to improve the model generalizability in a number of computer vision [3,4,8,11] and human activity recognition [2,9] tasks. Inspired by our success with AugToAct [2], where we modeled the self-supervision task as regression (reconstruction of a rotated/scaled input feature space), instead of classification (predicting whether rotation was done) with higher performance gain, we propose to augment self-supervision with a reinforcement learning loop [1,6] to learn the optimum augmentation policy for self-supervision.…”
Section: Label Desert Problemmentioning
confidence: 99%
“…L EARNING representations with deep neural networks have made tremendous improvements in the last few years on challenging real-world tasks [1]- [4], thanks to the emergence of massive datasets. In particular, the wealth of sensory data from the Internet of Things (IoT) devices are only recently being leveraged for tackling important problems in understanding context, user monitoring, health, and other predictive analytics tasks, e.g., for emotional well-being [5], [6], sleep tracking [7], and physical activity detection [8]. The success is mainly attributed to the supervised methods that utilize labeled datasets for training models in a central environment.…”
Section: Introductionmentioning
confidence: 99%
“…Traditionally, HAR through wearable and mobile sensing relied on sliding window segmentation and manual feature extraction, followed by a variety of supervised learning techniques to recognize simple and complex activities like walking, running, cycling and cooking. While in simple scenarios hand-crafted features may suffice, deep learning methods have proven to be more effective in complex HAR tasks [12,21,29,37,39]. Indeed, deep learning has shown promise in HAR by automatically extracting useful features for the target task [42,61].…”
Section: Introductionmentioning
confidence: 99%
“…It is particularly important to consider limitations due to their size, diversity and ability to capture and represent the richness, noise and complexity that can be found in free-living, unconstrained data [25,48,49]. These limitations on mobile sensing datasets are only exacerbated by the difficulty in collecting labeled data outside of laboratory settings [39].…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation