2022
DOI: 10.3390/math10152597
|View full text |Cite
|
Sign up to set email alerts
|

Improving the Deeplabv3+ Model with Attention Mechanisms Applied to Eye Detection and Segmentation

Abstract: Research on eye detection and segmentation is even more important with mask-wearing measures implemented during the COVID-19 pandemic. Thus, it is necessary to build an eye image detection and segmentation dataset (EIMDSD), including labels for detecting and segmenting. In this study, we established a dataset to reduce elaboration for chipping eye images and denoting labels. An improved DeepLabv3+ network architecture (IDLN) was also proposed for applying it to the benchmark segmentation datasets. The IDLN was… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
6
0

Year Published

2023
2023
2025
2025

Publication Types

Select...
6
1

Relationship

1
6

Authors

Journals

citations
Cited by 9 publications
(6 citation statements)
references
References 43 publications
(62 reference statements)
0
6
0
Order By: Relevance
“…Cross-entropy loss [ 28 ]) (CE loss): It quantifies the disparity between the predicted value and the actual value on a per-pixel basis, considering all pixels within the image equally. It belongs to global loss.…”
Section: Methodsmentioning
confidence: 99%
“…Cross-entropy loss [ 28 ]) (CE loss): It quantifies the disparity between the predicted value and the actual value on a per-pixel basis, considering all pixels within the image equally. It belongs to global loss.…”
Section: Methodsmentioning
confidence: 99%
“…These techniques enable the model to effectively capture both global and local context. DeepLabv3+ integrates, which allows it to efficiently gather contextual information at several scales [22,23]. ASPP improves the model's capacity to detect items of different sizes and scales in an image by employing atrous (dilated) convolutions at many levels.…”
Section: Segmentationmentioning
confidence: 99%
“…The DeepLabv3+ model [43,44] is a variant of a typical fully convolutional neural network that has achieved good performance in using contextual information for semantic segmentation. In this paper, we propose an improved DeepLabv3+ network architecture, called IDLN [8], which is shown in Figure 4. The IDLN uses the Atrous Spatial Pyramid Pooling (ASPP) module [45] to capture contextual semantic features at different scales by using parallel hole convolution techniques with different expansion rates and retains the DeepLabv3+ model encoding-decoding structure.…”
Section: Eye Semantic Segmentation Modelmentioning
confidence: 99%
“…The performance of the eye segmentation model affects the effectiveness of our overall face occlusion automatic fatigue-driving detection method. We use the IDLN model to segment the eye region, which is trained on EIMDSD [8]. The experimental results are shown in Table 1.…”
Section: Eye Semantic Segmentationmentioning
confidence: 99%
See 1 more Smart Citation