2022
DOI: 10.3390/diagnostics13010072
|View full text |Cite
|
Sign up to set email alerts
|

A Deep CNN Transformer Hybrid Model for Skin Lesion Classification of Dermoscopic Images Using Focal Loss

Abstract: Skin cancers are the most cancers diagnosed worldwide, with an estimated > 1.5 million new cases in 2020. Use of computer-aided diagnosis (CAD) systems for early detection and classification of skin lesions helps reduce skin cancer mortality rates. Inspired by the success of the transformer network in natural language processing (NLP) and the deep convolutional neural network (DCNN) in computer vision, we propose an end-to-end CNN transformer hybrid model with a focal loss (FL) function to classify skin les… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
5
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
8
2

Relationship

0
10

Authors

Journals

citations
Cited by 14 publications
(5 citation statements)
references
References 50 publications
0
5
0
Order By: Relevance
“…The CNN model cannot extract features with dryness, brightness, and shining images. The deep CNN with transformer model [ 38 ] uses all parts of the input image by dividing the input image into tokens and applying the transformers directly to the sequence of input images. The deep CNN with transformer model does not highlight the localized information, is unable to capture the contextual information, and is applicable for large datasets.…”
Section: Resultsmentioning
confidence: 99%
“…The CNN model cannot extract features with dryness, brightness, and shining images. The deep CNN with transformer model [ 38 ] uses all parts of the input image by dividing the input image into tokens and applying the transformers directly to the sequence of input images. The deep CNN with transformer model does not highlight the localized information, is unable to capture the contextual information, and is applicable for large datasets.…”
Section: Resultsmentioning
confidence: 99%
“…Due to its ability to identify complex patterns in medical imaging data, DL has emerged as a promising tool for medical diagnosis, especially in the classification of skin diseases [10]. Utilizing various network architectures, such as Convolutional Neural Networks (CNN) [11], [12], an increasing body of research has been dedicated to applying DL techniques to classify skin diseases [13].…”
Section: Literature Reviewmentioning
confidence: 99%
“…However, CNN-based methods generally show limitations in modeling long-distance feature dependencies due to the local convolution operations and small receptive fields. To overcome this limitation, Vision Transformers have been developed and shown to improve feature extraction by building long-range feature interactions and capturing the global context of the features [6,7]. Particularly, Swin Transformer (SwinT) constructs a hierarchical representation of features by starting from small-sized patches and gradually merging neighboring patches in deeper Transformer layers [8].…”
Section: Introductionmentioning
confidence: 99%