2018
DOI: 10.1145/3161174
|View full text |Cite
|
Sign up to set email alerts
|

Multimodal Deep Learning for Activity and Context Recognition

Abstract: Wearables and mobile devices see the world through the lens of half a dozen low-power sensors, such as, barometers, accelerometers, microphones and proximity detectors. But differences between sensors ranging from sampling rates, discrete and continuous data or even the data type itself make principled approaches to integrating these streams challenging. How, for example, is barometric pressure best combined with an audio sample to infer if a user is in a car, plane or bike? Critically for applications, how su… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
123
0
1

Year Published

2018
2018
2024
2024

Publication Types

Select...
4
3
2

Relationship

1
8

Authors

Journals

citations
Cited by 214 publications
(138 citation statements)
references
References 53 publications
1
123
0
1
Order By: Relevance
“…Deep neural networks are composed of multiple (parameterized) non-linear transformations that are trained through a supervised or unsupervised objective function with the aim of yielding useful representations. ese techniques have achieved indisputable empirical success across a broad spectrum of problems [4,21,28,35,43,56,65,66] thanks to the increasing dataset sizes and computing power availability. Nevertheless, representation learning still stands as a fundamental problem in machine intelligence and is an active area of research (see [8] for a detailed survey).…”
Section: Representation Learningmentioning
confidence: 99%
See 1 more Smart Citation
“…Deep neural networks are composed of multiple (parameterized) non-linear transformations that are trained through a supervised or unsupervised objective function with the aim of yielding useful representations. ese techniques have achieved indisputable empirical success across a broad spectrum of problems [4,21,28,35,43,56,65,66] thanks to the increasing dataset sizes and computing power availability. Nevertheless, representation learning still stands as a fundamental problem in machine intelligence and is an active area of research (see [8] for a detailed survey).…”
Section: Representation Learningmentioning
confidence: 99%
“…Over the last years, deep neural networks have been widely adopted for time-series and sensory data processing; achieving impressive performance in several application areas pertaining to pervasive sensing, ubiquitous computing, industries, health and well-being [17,21,38,56,60,73]. In particular, for smartphone-based human Fig.…”
Section: Introductionmentioning
confidence: 99%
“…Their goals are similar, while their learning processes are different. Both can be exploited to extract patterns from unlabeled mobile data, which may be subsequently employed for various supervised learning tasks, e.g., routing [186], mobile activity recognition [187], [188], periocular verification [189] and base station user number prediction [190].…”
Section: Auto-encodersmentioning
confidence: 99%
“…MagTrack also detects aggressive steering motions that can indicate fatigue driving. Simultaneous Tracking and Classification: STC was initially proposed to solve the target tracking and classification in the radar monitoring scenario [26,46,47,53]. The basic idea is to use the special feature metrics to infer the target type.…”
Section: State Of the Artmentioning
confidence: 99%