Representation Compensation Networks for Continual Semantic Segmentation

Zhang, Changbin; Xiao, Jia-Wen; Liu, Xialei; Chen, Ying-Cong; Cheng, Ming–Ming

doi:10.1109/cvpr52688.2022.00692

Cited by 53 publications

(43 citation statements)

References 113 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…CSWKD [41] weights the distillation loss based on the old and new class similarity. Other than knowledge distillation, RCIL [63] designs a two-branch module to decouple the representation learning of old and new classes. In multiorgan segmentation, only one study [31] applies CSS, based…”

Section: Related Workmentioning

confidence: 99%

“…Federated learning is a related solution [43], but it may not always be viable or easily accessible considering the requirement for sophisticated and expensive software/hardware computing infrastructures. Alternatively, we achieve this clinically preferred goal via continual semantic segmentation (CSS), which is emerging very recently in the natural image domain [5,11,36,37,63] but has been only scarcely studied for medical imaging [31,39].…”

Section: Introductionmentioning

confidence: 99%

“…There are several recent CSS work in computer vision [5,11,36,37,63]. MiB loss is often applied to handle the background-label conflicting issue [5,11].…”

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

Continual Segment: Towards a Single, Unified and Accessible Continual Segmentation Model of 143 Whole-body Organs in CT Scans

Ji¹,

Guo²,

Wang³

et al. 2023

Preprint

View full text Add to dashboard Cite

Deep learning empowers the mainstream medical image segmentation methods. Nevertheless current deep segmentation approaches are not capable of efficiently and effectively adapting and updating the trained models when new incremental segmentation classes (along with new training datasets or not) are required to be added. In real clinical environment, it can be preferred that segmentation models could be dynamically extended to segment new organs/tumors without the (re-)access to previous training datasets due to obstacles of patient privacy and data storage. This process can be viewed as a continual semantic segmentation (CSS) problem, being understudied for multiorgan segmentation. In this work, we propose a new architectural CSS learning framework to learn a single deep segmentation model for segmenting a total of 143 whole-body organs. Using the encoder/decoder network structure, we demonstrate that a continually-trained then frozen encoder coupled with incrementally-added decoders can extract and preserve sufficiently representative image features for new classes to be subsequently and validly segmented. To maintain a single network model complexity, we trim each decoder progressively using neural architecture search and teacher-student based knowledge distillation. To incorporate with both healthy and pathological organs appearing in different datasets, a novel anomaly-aware and confidence learning module is proposed to merge the overlapped organ predictions, originated from different decoders. Trained and validated on 3D CT scans of 2500+ patients from four datasets, our single network can segment total 143 wholebody organs with very high accuracy, closely reaching the upper bound performance level by training four separate segmentation models (i.e., one model per dataset/task).

show abstract

Section: Related Workmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Continual Segment: Towards a Single, Unified and Accessible Continual Segmentation Model of 143 Whole-body Organs in CT Scans

Ji¹,

Guo²,

Wang³

et al. 2023

Preprint

View full text Add to dashboard Cite

show abstract

“…Other techniques including mBCE and DKD in Sec. 3.2 can be applied to the off-the-shelf models, which brings significant performance gains, compared to current CISS methods [5,9,22,29] (See Tables 1 and 3).…”

Section: Numerical Values Of Zmentioning

confidence: 99%

“…Class-incremental semantic segmentation (CISS) adopts a CIL paradigm for the task of semantic segmentation. CISS methods [5,9,21,29] typically exploit a softmax cross-entropy (CE) term along with knowledge distillation (KD) [14]. Although the CE term helps to learn novel classes, applying the softmax function to all classes, including both old and novel ones, lowers class probabilities of old ones.…”

Section: Introductionmentioning

confidence: 99%

Decomposed Knowledge Distillation for Class-Incremental Semantic Segmentation

Baek¹,

Oh²,

Lee³

et al. 2022

Preprint

View full text Add to dashboard Cite

Class-incremental semantic segmentation (CISS) labels each pixel of an image with a corresponding object/stuff class continually. To this end, it is crucial to learn novel classes incrementally without forgetting previously learned knowledge. Current CISS methods typically use a knowledge distillation (KD) technique for preserving classifier logits, or freeze a feature extractor, to avoid the forgetting problem. The strong constraints, however, prevent learning discriminative features for novel classes. We introduce a CISS framework that alleviates the forgetting problem and facilitates learning novel classes effectively. We have found that a logit can be decomposed into two terms. They quantify how likely an input belongs to a particular class or not, providing a clue for a reasoning process of a model. The KD technique, in this context, preserves the sum of two terms (i.e., a class logit), suggesting that each could be changed and thus the KD does not imitate the reasoning process. To impose constraints on each term explicitly, we propose a new decomposed knowledge distillation (DKD) technique, improving the rigidity of a model and addressing the forgetting problem more effectively. We also introduce a novel initialization method to train new classifiers for novel classes. In CISS, the number of negative training samples for novel classes is not sufficient to discriminate old classes. To mitigate this, we propose to transfer knowledge of negatives to the classifiers successively using an auxiliary classifier, boosting the performance significantly. Experimental results on standard CISS benchmarks demonstrate the effectiveness of our framework.

show abstract