Search citation statements
Paper Sections
Citation Types
Year Published
Publication Types
Relationship
Authors
Journals
In this paper, we design and evaluate the performance of the Multi-resolution Twinned Residual Auto-Encoders (MR-TRAE) model, a deep learning (DL)-based architecture specifically designed for achieving multi-resolution super-resolved images from low-resolution (LR) inputs at various scaling factors. For this purpose, we expand on the recently introduced Twinned Residual Auto-Encoders (TRAE) paradigm for single-image super-resolution (SISR) to extend it to the multi-resolution (MR) domain. The main contributions of this work include (i) the architecture of the MR-TRAE model, which utilizes cascaded trainable up-sampling modules for progressively increasing the spatial resolution of low-resolution (LR) input images at multiple scaling factors; (ii) a novel loss function designed for the joint and semi-blind training of all MR-TRAE model components; and (iii) a comprehensive analysis of the MR-TRAE trade-off between model complexity and performance. Furthermore, we thoroughly explore the connections between the MR-TRAE architecture and broader cognitive paradigms, including knowledge distillation, the teacher-student learning model, and hierarchical cognition. Performance evaluations of the MR-TRAE benchmarked against state-of-the-art models (such as U-Net, generative adversarial network (GAN)-based, and single-resolution baselines) were conducted using publicly available datasets. These datasets consist of LR computer tomography (CT) scans from patients with COVID-19. Our tests, which explored multi-resolutions at scaling factors $$\times (2,4,8)$$ × ( 2 , 4 , 8 ) , showed a significant finding: the MR-TRAE model can reduce training times by up to $$60\%$$ 60 % compared to those of the baselines, without a noticeable impact on achieved performance.
In this paper, we design and evaluate the performance of the Multi-resolution Twinned Residual Auto-Encoders (MR-TRAE) model, a deep learning (DL)-based architecture specifically designed for achieving multi-resolution super-resolved images from low-resolution (LR) inputs at various scaling factors. For this purpose, we expand on the recently introduced Twinned Residual Auto-Encoders (TRAE) paradigm for single-image super-resolution (SISR) to extend it to the multi-resolution (MR) domain. The main contributions of this work include (i) the architecture of the MR-TRAE model, which utilizes cascaded trainable up-sampling modules for progressively increasing the spatial resolution of low-resolution (LR) input images at multiple scaling factors; (ii) a novel loss function designed for the joint and semi-blind training of all MR-TRAE model components; and (iii) a comprehensive analysis of the MR-TRAE trade-off between model complexity and performance. Furthermore, we thoroughly explore the connections between the MR-TRAE architecture and broader cognitive paradigms, including knowledge distillation, the teacher-student learning model, and hierarchical cognition. Performance evaluations of the MR-TRAE benchmarked against state-of-the-art models (such as U-Net, generative adversarial network (GAN)-based, and single-resolution baselines) were conducted using publicly available datasets. These datasets consist of LR computer tomography (CT) scans from patients with COVID-19. Our tests, which explored multi-resolutions at scaling factors $$\times (2,4,8)$$ × ( 2 , 4 , 8 ) , showed a significant finding: the MR-TRAE model can reduce training times by up to $$60\%$$ 60 % compared to those of the baselines, without a noticeable impact on achieved performance.
Objective. Given the different noise distribution information of global and local magnetic resonance (MR) images, this study aims to extend the current work on convolutional neural networks that preserve global structure and local details in MR image denoising tasks. Approach. This study proposed a parallel and serial network for denoising 3D MR images, called 3D-PSNet. We use the residual depthwise separable convolution block to learn the local information of the feature map, reduce the network parameters, and thus improve the training speed and parameter efficiency. In addition, we consider the feature extraction of the global image and utilize residual dilated convolution to process the feature map to expand the receptive field of the network and avoid the loss of global information. Finally, we combine both of them to form a parallel network. What’s more, we integrate reinforced residual convolution blocks with dense connections to form serial network branches, which can remove redundant information and refine features to further obtain accurate noise information. The peak signal-to-noise ratio, structural similarity index measure, and root mean square error metrics of 3D-PSNet are as high as 47.79%, 99.81%, and 0.40%, respectively, achieving competitive denoising effect on three public datasets. The ablation experiments demonstrated the effectiveness of all the designed modules regarding all the evaluated metrics in both datasets. Significance. The proposed 3D-PSNet takes advantage of multi-scale receptive fields, local feature extraction and residual dense connections to more effectively restore the global structure and local fine features in MR images, and is expected to help doctors quickly and accurately diagnose patients' conditions.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.