Enhancing U-Net with Spatial-Channel Attention Gate for Abnormal Tissue Segmentation in Medical Imaging

Khanh, Trinh Le Ba; Dao, Duy-Phuong; Ho, Ngoc-Huynh; Yang, Hyung-Jeong; Baek, Eu-Tteum; Lee, Guee-Sang; Kim, Soo-Hyung; Yoo, Seok Bong

doi:10.3390/app10175729

Cited by 66 publications

(43 citation statements)

References 30 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…In addition, the t-score method performed worse on our testing dataset (Dice about 0.37) than what is described by the developers 14 (Dice about 0.5) Therefore, the classical t-score method was considered insufficiently efficient to segment lesions in our large and heterogeneous clinical dataset. The architecture of our proposed 3D DAGMNet, depicted in Figure 2, is equipped with intra skip connections as UNet3+ 24 , fused multi-scale contextual information block, deep supervision, L1-regularization on final predicts, Dual Attention Gate (DAG) [25][26][27] , self-normalized activation (SeLU) 42 , and batch normalization. The details of the important components and training techniques/parameters are outlined in the following subsections.…”

Section: Modified C-fuzzy Methods 15mentioning

confidence: 99%

“…Further developments of UNet variants, such as Mnet, DenseUnet, Unet++, and Unet3+ [22][23][24] optimized the features utilization. The emergence of attention-gate techniques [25][26][27] conditioned networks to focus on local semantical features. Recent studies applied "attention UNets", for example, to predict final ischemic lesions from baseline MRIs 28,29 .…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Deep learning-based detection and segmentation of diffusion abnormalities in acute ischemic stroke

Liu

Hsu

et al. 2021

Preprint

View full text Add to dashboard Cite

Background: Accessible tools to efficiently detect and segment diffusion abnormalities in acute strokes are highly anticipated by the clinical and research communities. Methods: We developed a tool with deep learning networks trained and tested on a large dataset of 2,348 clinical diffusion weighted MRIs of patients with acute and sub-acute ischemic strokes, and further tested for generalization on 280 MRIs of an external dataset (STIR). Results: Our proposed model outperforms generic networks and DeepMedic, particularly in small lesions, with lower false positive rate, balanced precision and sensitivity, and robustness to data perturbs (e.g., artefacts, low resolution, technical heterogeneity). The agreement with human delineation rivals the inter-evaluator agreement; the automated lesion quantification of volume and contrast has virtually total agreement with human quantification. Conclusion: Our tool is fast, public, accessible to non-experts, with minimal computational requirements, to detect and segment lesions via a single command line. Therefore, it fulfills the conditions to perform large scale, reliable and reproducible clinical and translational research.

show abstract

Section: Modified C-fuzzy Methods 15mentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

Deep learning-based detection and segmentation of diffusion abnormalities in acute ischemic stroke

Liu

Hsu

et al. 2021

Preprint

View full text Add to dashboard Cite

show abstract

“…However, over the past few years supervised learning for biomedical images segmentation has managed to achieve a human-level performance that is very promising [101]. In particular, the U-Net architecture and training strategy [103] which was originally proposed to deal with the lower number of samples commonly found in biomedical domains has been successfully applied to many segmentation problems, including abnormal tissue segmentation [104], organ segmentation in CT images [105], and tumour segmentation in brain MRI [106].…”

Section: Segmentationmentioning

confidence: 99%

“…The use of attention mechanisms and saliency masks have gained some traction in this area, as they provide a way to visualize what region of an image was attended to that led to the predicted outcome. They were employed recently in models used to screen chest X-rays of COVID-19 patients [119], predict lung module malignancy from longitudinal CT [120], perform abnormal tissue segmentation in natural, CT and MRI images [104] and quantification of knee osteoarthrisis in X-ray images [121], while the MDNet model of Zhang et al [109] used attention mechanisms to indicate which area of the image corresponded to the text in the generated diagnostic report.…”

Section: Model Interpretabilitymentioning

confidence: 99%

Enterprise imaging and big data: A review from a medical physics perspective

et al. 2021

View full text Add to dashboard Cite

In recent years enterprise imaging (EI) solutions have become a core component of healthcare initiatives, while a simultaneous rise in big data has opened up a number of possibilities in how we can analyze and derive insights from large amounts of medical data. Together they afford us a range of opportunities that can transform healthcare in many fields. This paper provides a review of recent developments in EI and big data in the context of medical physics. It summarizes the key aspects of EI and big data in practice, with discussion and consideration of the steps necessary to implement an EI strategy. It examines the benefits that a healthcare service can achieve through the implementation of an EI solution by looking at it through the lenses of: compliance, improving patient care, maximizing revenue, optimizing workflows, and applications of artificial intelligence that support enterprise imaging. It also addresses some of the key challenges in enterprise imaging, with discussion and examples presented for those in systems integration, governance, and data security and privacy.

show abstract

“…The self-attention focused on the response of positions in a sequence while CCBAM focuses on cross-channel and spatial information of feature maps. Another related work is the spatial-channel gating scheme proposed for the U-Net structure [18], which addressed medical image segmentation problem. [19] also applied a concurrent space-channel-wise attention to the redundant convolutional encoder-decoder (RCED) for speech enhancement.…”

Section: Introductionmentioning

confidence: 99%

Monaural Speech Enhancement with Complex Convolutional Block Attention Module and Joint Time Frequency Losses

Zhao

Nguyen

2021

ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

View full text Add to dashboard Cite

Deep complex U-Net structure and convolutional recurrent network (CRN) structure achieve state-of-the-art performance for monaural speech enhancement. Both deep complex U-Net and CRN are encoder and decoder structures with skip connections, which heavily rely on the representation power of the complex-valued convolutional layers. In this paper, we propose a complex convolutional block attention module (CCBAM) to boost the representation power of the complexvalued convolutional layers by constructing more informative features. The CCBAM is a lightweight and general module which can be easily integrated into any complex-valued convolutional layers. We integrate CCBAM with the deep complex U-Net and CRN to enhance their performance for speech enhancement. We further propose a mixed loss function to jointly optimize the complex models in both time-frequency (TF) domain and time domain. By integrating CCBAM and the mixed loss, we form a new end-to-end (E2E) complex speech enhancement framework. Ablation experiments and objective evaluations show the superior performance of the proposed approaches.

show abstract

Enhancing U-Net with Spatial-Channel Attention Gate for Abnormal Tissue Segmentation in Medical Imaging

Cited by 66 publications

References 30 publications

Deep learning-based detection and segmentation of diffusion abnormalities in acute ischemic stroke

Deep learning-based detection and segmentation of diffusion abnormalities in acute ischemic stroke

Enterprise imaging and big data: A review from a medical physics perspective

Monaural Speech Enhancement with Complex Convolutional Block Attention Module and Joint Time Frequency Losses

Contact Info

Product

Resources

About