Stochastic Filter Groups for Multi-Task CNNs: Learning Specialist and Generalist Convolution Kernels

Bragman, Felix J. S.; Tanno, Ryutaro; Ourselin, Sébastien; Alexander, Daniel C.; Cardoso, M. Jorge

doi:10.1109/iccv.2019.00147

Cited by 49 publications

(21 citation statements)

References 28 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…To alleviate this issue, several recent works [9], [10], [11], [12] proposed efficient design procedures that automatically decide where to share or branch within the network. Similarly, stochastic filter groups [56] re-purposed the convolution kernels in each layer to support shared or task-specific behaviour. Soft Parameter Sharing.…”

Section: Soft and Hard Parameter Sharing In Deep Learningmentioning

confidence: 99%

“…Huang et al [67] introduced a method rooted in Neural Architecture Search (NAS) for the automated construction of a tree-based multi-attribute learning network. Stochastic filter groups [56] re-purposed the convolution kernels in each layer of the network to support shared or taskspecific behaviour. In a similar vein, feature partitioning [68] presented partitioning strategies to assign the convolution kernels in each layer of the network into different tasks.…”

Section: Other Approachesmentioning

confidence: 99%

See 1 more Smart Citation

Multi-Task Learning for Dense Prediction Tasks: A Survey

Vandenhende¹,

Georgoulis²,

Gansbeke³

et al. 2021

IEEE Trans. Pattern Anal. Mach. Intell.

339

222

View full text Add to dashboard Cite

With the advent of deep learning, many dense prediction tasks, i.e. tasks that produce pixel-level predictions, have seen significant performance improvements. The typical approach is to learn these tasks in isolation, that is, a separate neural network is trained for each individual task. Yet, recent multi-task learning (MTL) techniques have shown promising results w.r.t. performance, computations and/or memory footprint, by jointly tackling multiple tasks through a learned shared representation. In this survey, we provide a well-rounded view on state-of-the-art deep learning approaches for MTL in computer vision, explicitly emphasizing on dense prediction tasks. Our contributions concern the following. First, we consider MTL from a network architecture point-of-view. We include an extensive overview and discuss the advantages/disadvantages of recent popular MTL models. Second, we examine various optimization methods to tackle the joint learning of multiple tasks. We summarize the qualitative elements of these works and explore their commonalities and differences. Finally, we provide an extensive experimental evaluation across a variety of dense prediction benchmarks to examine the pros and cons of the different methods, including both architectural and optimization based strategies.

show abstract

Section: Soft and Hard Parameter Sharing In Deep Learningmentioning

confidence: 99%

Section: Other Approachesmentioning

confidence: 99%

Multi-Task Learning for Dense Prediction Tasks: A Survey

Vandenhende¹,

Georgoulis²,

Gansbeke³

et al. 2021

IEEE Trans. Pattern Anal. Mach. Intell.

339

222

View full text Add to dashboard Cite

show abstract

“…Encoder-focused approaches primarily emphasize on architectures that can encode multi-purpose feature representations through supervision from multiple tasks. Such encoding is typically achieved, for example, via feature fusion [41], branching [25,43,36,61], selfsupervision [10], attention [33], or filter grouping [1]. Decoder-focused approaches start from the feature representations learned at the encoding stage, and further refine them at the decoding stage by distilling information across tasks in a one-off [63], sequential [65], recursive [66], or even multi-scale [62] manner.…”

Section: Related Workmentioning

confidence: 99%

“…When it comes to learning multiple tasks under a single model, multi-task learning (MTL) techniques [2,54] have been employed in the literature. On the one hand, encoder-focused approaches [41,25,36,10,43,33,1,61] emphasize learning feature representations from multi-task supervisory signals by employing architectures that encode shared and task-specific information. On the other hand, decoder-focused approaches [63,65,66,62] utilize the multi-task feature representations learned at the encoding stage to distill cross-task information at the decoding stage, thus refining the original feature representations.…”

Section: Introductionmentioning

confidence: 99%

Reparameterizing Convolutions for Incremental Multi-Task Learning Without Task Interference

Kanakis

Bruggemann

Saha

et al. 2020

Computer Vision – ECCV 2020

View full text Add to dashboard Cite

Multi-task networks are commonly utilized to alleviate the need for a large number of highly specialized single-task networks. However, two common challenges in developing multi-task models are often overlooked in literature. First, enabling the model to be inherently incremental, continuously incorporating information from new tasks without forgetting the previously learned ones (incremental learning). Second, eliminating adverse interactions amongst tasks, which has been shown to significantly degrade the single-task performance in a multi-task setup (task interference). In this paper, we show that both can be achieved simply by reparameterizing the convolutions of standard neural network architectures into a non-trainable shared part (filter bank) and taskspecific parts (modulators), where each modulator has a fraction of the filter bank parameters. Thus, our reparameterization enables the model to learn new tasks without adversely affecting the performance of existing ones. The results of our ablation study attest the efficacy of the proposed reparameterization. Moreover, our method achieves state-of-the-art on two challenging multi-task learning benchmarks, PASCAL-Context and NYUD, and also demonstrates superior incremental learning capability as compared to its close competitors. The code and models are made publicly available 1 .

show abstract

“…Attention-based LSTM was then used for feature embedding, but the outliers were not handled effectively, which could affect the model performance [43]. According to the needs of each task, CNNs stochastic filter groups grouped the convolution kernel of each convolution layer [44]. There are some other networks such as branched multi-task networks [45], sluice networks [46] and learning sparse sharing [47] to address multiple task sharing issues, but it was difficult to train them due to the high complexity of the model.…”

Section: Related Workmentioning

confidence: 99%

A Multi-task Learning Model for Daily Activity Forecast in Smart Home

Yang

Gong

Liu

et al. 2020

Sensors

View full text Add to dashboard Cite

Daily activity forecasts play an important role in the daily lives of residents in smart homes. Category forecasts and occurrence time forecasts of daily activity are two key tasks. Category forecasts of daily activity are correlated with occurrence time forecasts, however, existing research has only focused on one of the two tasks. Moreover, the performance of daily activity forecasts is low when the two tasks are performed in series. In this paper, a forecast model based on multi-task learning is proposed to forecast category and occurrence time of daily activity mutually and iteratively. Firstly, raw sensor events are pre-processed to form a feature space of daily activity. Secondly, a parallel multi-task learning model which combines a convolutional neural network (CNN) with bidirectional long short-term memory (Bi-LSTM) units are developed as the forecast model. Finally, five distinct datasets are used to evaluate the proposed model. The experimental results show that compared with the state-of-the-art single-task learning models, this model improves accuracy by at least 2.22%, and the metrics of NMAE, NRMSE and R2 are improved by at least 1.542%, 7.79% and 1.69%, respectively.

show abstract

Stochastic Filter Groups for Multi-Task CNNs: Learning Specialist and Generalist Convolution Kernels

Cited by 49 publications

References 28 publications

Multi-Task Learning for Dense Prediction Tasks: A Survey

Multi-Task Learning for Dense Prediction Tasks: A Survey

Reparameterizing Convolutions for Incremental Multi-Task Learning Without Task Interference

A Multi-task Learning Model for Daily Activity Forecast in Smart Home

Contact Info

Product

Resources

About