2021
DOI: 10.1007/s00138-021-01249-8
|View full text |Cite
|
Sign up to set email alerts
|

Early, intermediate and late fusion strategies for robust deep learning-based multimodal action recognition

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
29
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
5
4

Relationship

0
9

Authors

Journals

citations
Cited by 72 publications
(47 citation statements)
references
References 40 publications
0
29
0
Order By: Relevance
“…Our multimodal strategy is decoupled from any particular artificial neural network architecture, and thus, it can be easily integrated into an existing multimodal pipeline. Practically multimodal learning is done utilizing different data fusion strategies, i.e., early fusion, middle fusion, and late fusion [47][48][49].…”
Section: Deeptlf and Multimodal Datamentioning
confidence: 99%
“…Our multimodal strategy is decoupled from any particular artificial neural network architecture, and thus, it can be easily integrated into an existing multimodal pipeline. Practically multimodal learning is done utilizing different data fusion strategies, i.e., early fusion, middle fusion, and late fusion [47][48][49].…”
Section: Deeptlf and Multimodal Datamentioning
confidence: 99%
“…After extracting multimodal features, it is important to aggregate them together. The more typical fusion methods [25,26] are divided into early fusion and late fusion. Early fusion refers to directly splicing the multimodal input signals and then sending them into a unified network structure for training.…”
Section: Modality Aggregationmentioning
confidence: 99%
“…Different sensors can provide different perception information on the same context. Studies [31,32] have confirmed that the feature fusion strategy of different convolutional layer feature maps can effectively solve these problems. Multimodal fusion uses different models to extract different perception features and combine them to improve the performances obtained by using only one modality.…”
Section: Multi-feature Fusion Cnnsmentioning
confidence: 99%
“…There are three types of multimodal fusion architectures: early fusion, late fusion, and intermediate fusion [31,32]. An early fusion architecture [33,34] integrates multiple data before conducting the analysis.…”
Section: Multi-feature Fusion Cnnsmentioning
confidence: 99%