TandemNet: Distilling Knowledge from Medical Images Using Diagnostic Reports as Optional Semantic References

Zhang, Zizhao; Chen, Pingjun; Sapkota, Manish; Yang, Lin

doi:10.1007/978-3-319-66179-7_37

Cited by 49 publications

(28 citation statements)

References 11 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…In the testing stage, the input is a fundus image only, and the output is a probabilistic map of the lesion types in the image. Zhang et al . proposed a multimodal network that jointly learns from medical images and their diagnostic reports, in which semantic information interacts with visual information to improve the image understanding ability by teaching the network to distill informative features.…”

Section: Expanding Datasets For Deep Learningmentioning

confidence: 99%

Deep learning in medical imaging and radiation therapy

et al. 2018

View full text Add to dashboard Cite

The goals of this review paper on deep learning (DL) in medical imaging and radiation therapy are to (a) summarize what has been achieved to date; (b) identify common and unique challenges, and strategies that researchers have taken to address these challenges; and (c) identify some of the promising avenues for the future both in terms of applications as well as technical innovations. We introduce the general principles of DL and convolutional neural networks, survey five major areas of application of DL in medical imaging and radiation therapy, identify common themes, discuss methods for dataset expansion, and conclude by summarizing lessons learned, remaining challenges, and future directions.

show abstract

Section: Expanding Datasets For Deep Learningmentioning

confidence: 99%

Deep learning in medical imaging and radiation therapy

et al. 2018

View full text Add to dashboard Cite

show abstract

“…In the testing stage, the input is a fundus image only, and the output is a probabilistic map of the lesion types in the image. Zhang et al 334 proposed a multimodal network that jointly learns from medical images and their diagnostic reports, in which semantic information interacts with visual information to improve the image understanding ability by teaching the network to distill informative features. Applied to bladder cancer images and the corresponding diagnostic reports, the network demonstrated improved performance compared to baseline CNN that only use image information for training.…”

Section: B Data Annotation Via Mining Text Reportsmentioning

confidence: 99%

Computer-Aided Diagnostic System for Early Detection of Acute Renal Transplant Rejection Using Diffusion-Weighted MRI

Shehata

Khalifa

Soliman

et al. 2019

IEEE Trans. Biomed. Eng.

View full text Add to dashboard Cite

show abstract

“…Attention mechanism has been explored for image captioning [17], voice activity detection [18], speech emotion recognition [19] and question answering [20]. For biomedical imaging, attention has been used for report generation [21], disease classification [22], [23], organ segmentation [24] and localization [25]. In [26], authors have introduced attention mechanism for macular OCT classification where the proposed deep network requires a large number of model parameters, but their performance evaluation is limited.…”

Section: Introductionmentioning

confidence: 99%

Multi-Level Dual-Attention Based CNN for Macular Optical Coherence Tomography Classification

Mishra

Mandal

Puhan

2019

IEEE Signal Process. Lett.

View full text Add to dashboard Cite

In this letter, we propose a multi-level dual-attention model to classify two common macular diseases, age-related macular degeneration (AMD) and diabetic macular edema (DME) from normal macular eye conditions using optical coherence tomography (OCT) imaging technique. Our approach unifies the dual-attention mechanism at multi-levels of the pre-trained deep convolutional neural network (CNN). It provides a focused learning mechanism by taking into account both multi-level features based attention focusing on the salient coarser features and self-attention mechanism attending higher entropy regions of the finer features. Our proposed method enables the network to automatically focus on the relevant parts of the input images at different levels of feature subspaces. This leads to a more locally deformation-aware feature generation and classification. The proposed approach does not require pre-processing steps such as extraction of region of interest, denoising and retinal flattening, making the network more robust and fully automatic. Experimental results on two macular OCT databases show the superior performance of our proposed approach as compared to the current state-of-the-art methodologies.

show abstract

TandemNet: Distilling Knowledge from Medical Images Using Diagnostic Reports as Optional Semantic References

Cited by 49 publications

References 11 publications

Deep learning in medical imaging and radiation therapy

Deep learning in medical imaging and radiation therapy

Computer-Aided Diagnostic System for Early Detection of Acute Renal Transplant Rejection Using Diffusion-Weighted MRI

Multi-Level Dual-Attention Based CNN for Macular Optical Coherence Tomography Classification

Contact Info

Product

Resources

About