Multi-Task Learning for Dense Prediction Tasks: A Survey

Vandenhende, Simon; Georgoulis, Stamatios; Gansbeke, Wouter Van; Proesmans, Marc; Dai, Dengxin; Gool, Luc Van

doi:10.1109/tpami.2021.3054719

Cited by 391 publications

(258 citation statements)

References 77 publications

Supporting

Mentioning

222

Contrasting

Order By: Relevance

“…During the training process, the decreasing rate of L c , L d , L s , L p , and L r may be inconsistent, causing the model to be dominated by a certain module [14]. Hence, need to use weight coefficients, for the entire network, the final loss function is given by…”

Section: Classification Lossmentioning

confidence: 99%

A Deep Transfer Learning Method for Bearing Fault Diagnosis Based on Domain Separation and Adversarial Learning

Xiang

Zhang

Gao

et al. 2021

Shock and Vibration

View full text Add to dashboard Cite

Current studies on intelligent bearing fault diagnosis based on transfer learning have been fruitful. However, these methods mainly focus on transfer fault diagnosis of bearings under different working conditions. In engineering practice, it is often difficult or even impossible to obtain a large amount of labeled data from some machines, and an intelligent diagnostic method trained by labeled data from one machine may not be able to classify unlabeled data from other machines, strongly hindering the application of these intelligent diagnostic methods in certain industries. In this study, a deep transfer learning method for bearing fault diagnosis, domain separation reconstruction adversarial networks (DSRAN), was proposed for the transfer fault diagnosis between machines. In DSRAN, domain-difference and domain-invariant feature extractors are used to extract and separate domain-difference and domain-invariant features, respectively Moreover, the idea of generative adversarial networks (GAN) was used to improve the network in learning domain-invariant features. By using domain-invariant features, DSRAN can adopt the distribution of the data in the source and target domains. Six transfer fault diagnosis experiments were performed to verify the effectiveness of the proposed method, and the average accuracy reached 89.68%. The results showed that the DSRAN method trained by labeled data obtained from one machine can be used to identify the health state of the unlabeled data obtained from other machines.

show abstract

Section: Classification Lossmentioning

confidence: 99%

A Deep Transfer Learning Method for Bearing Fault Diagnosis Based on Domain Separation and Adversarial Learning

Xiang

Zhang

Gao

et al. 2021

Shock and Vibration

View full text Add to dashboard Cite

show abstract

“…It achieves this by utilizing encoder-level interactions to generate a shared representation [22,9,17], by using decoder-level interactions to improve single task results from multi-modal distillation [35,38], or a set combination of both. [32] shows that in an MTL setting, performance strongly varies depending on a wide range of parameters (e.g task type, label source) and thus architecture and optimization strategies must be selected on a per case basis. In general it is observed that encoder level interactions perform well for multiple classification problems while decoder level interactions have an advantage in dense prediction tasks.…”

Section: Related Workmentioning

confidence: 99%

“…The surrounding area of each point is discretized into a set number of bins. This allows us to restate the problem of the bounding box center localization on the transverse plane (x, z) as a classification problem which are shown to be better fitted for encoder-focused architectures [32]. To achieve finer details, we allow a residual to be regressed for each bin.…”

Section: Joint Proposal Generation and Point Cloud Semantic Segmentationmentioning

confidence: 99%

“…This can be highly beneficial when dealing with real world problems of navigation in complex scenes that involves onthe-go path planning and collision avoidance. Semantic Feature Fusion: In MTL, the problem of invariance vs. sensitivity is always apparent [32]. The network may never converge to a state where it extracts a vital feature for one task because it contradicts with the objective of the other.…”

Section: Dass In a 2-stage 3d Object Detection Pipelinementioning

confidence: 99%

See 1 more Smart Citation

Improving Point Cloud Semantic Segmentation by Learning 3D Object Detection

Unal

Gool

Dai

2021

2021 IEEE Winter Conference on Applications of Computer Vision (WACV)

Self Cite

View full text Add to dashboard Cite

Point cloud semantic segmentation plays an essential role in autonomous driving, providing vital information about drivable surfaces and nearby objects that can aid higher level tasks such as path planning and collision avoidance. While current 3D semantic segmentation networks focus on convolutional architectures that perform great for well represented classes, they show a significant drop in performance for underrepresented classes that share similar geometric features. We propose a novel Detection Aware 3D Semantic Segmentation (DASS) framework that explicitly leverages localization features from an auxiliary 3D object detection task. By utilizing multitask training, the shared feature representation of the network is guided to be aware of per class detection features that aid tackling the differentiation of geometrically similar classes. We additionally provide a pipeline that uses DASS to generate high recall proposals for existing 2-stage detectors and demonstrate that the added supervisory signal can be used to improve 3D orientation estimation capabilities. Extensive experiments on both the SemanticKITTI and KITTI object datasets show that DASS can improve 3D semantic segmentation results of geometrically similar classes up to 37.8% IoU in image FOV while maintaining high precision bird's-eye view (BEV) detection results.

show abstract

“…Multitask learning (MTL) deep neural networks are widely used in many fields [26,[33][34][35][36]. It would be an intuitive and promising idea to concurrently solve the AMR and MPE tasks together in an MTL way.…”

Section: Introductionmentioning

confidence: 99%

JMRPE‐Net: Joint modulation recognition and parameter estimation of cognitive radar signals with a deep multitask network

Zhu

Zhang

et al. 2021

IET Radar Sonar & Navi

View full text Add to dashboard Cite

The newly developed cognitive radar (CR) can implement flexible work modes defined with a set of mode definition parameters. Each definition parameter can employ its modulation type and corresponding optimised modulating values. Automatic recognition and analysis of CR work mode are significant challenges for electromagnetic reconnaissance applications. In this article, a deep multitask neural network is proposed for Joint automatic Modulation Recognition and modulation Parameter Estimation (JMRPE-Net) for the emerging task of CR signals analysis. The proposed JMRPE-Net consists of a fork-shaped architecture in which three cascaded convolutional layers act as a shared module for the extraction of common features followed by multiple branches of long short-term memory layers with the attention mechanism for task-specific features extraction. The proposed network can receive a sequence of CR pulse signals as input and parallelly perform automatic modulation recognition (AMR) and modulation parameter estimation tasks for multiple work mode definition parameters. Extensive experiments are conducted based on simulated radar intermediate frequency signals with consideration of imperfections of real-world electromagnetic environments. The experimental results validate the superiority of the proposed JMRPE-Net against the existing state-of-the-art single task AMR methods or parameter estimation methods.This is an open access article under the terms of the Creative Commons Attribution-NonCommercial-NoDerivs License, which permits use and distribution in any medium, provided the original work is properly cited, the use is non-commercial and no modifications or adaptations are made.

show abstract

Multi-Task Learning for Dense Prediction Tasks: A Survey

Cited by 391 publications

References 77 publications

A Deep Transfer Learning Method for Bearing Fault Diagnosis Based on Domain Separation and Adversarial Learning

A Deep Transfer Learning Method for Bearing Fault Diagnosis Based on Domain Separation and Adversarial Learning

Improving Point Cloud Semantic Segmentation by Learning 3D Object Detection

JMRPE‐Net: Joint modulation recognition and parameter estimation of cognitive radar signals with a deep multitask network

Contact Info

Product

Resources

About