2022
DOI: 10.3390/machines10050340
|View full text |Cite
|
Sign up to set email alerts
|

Object Detection via Gradient-Based Mask R-CNN Using Machine Learning Algorithms

Abstract: Object detection has received a lot of research attention in recent years because of its close association with video analysis and image interpretation. Detecting objects in images and videos is a fundamental task and considered as one of the most difficult problems in computer vision. Many machine learning and deep learning models have been proposed in the past to solve this issue. In the current scenario, the detection algorithm must calculate from beginning to end in the shortest amount of time possible. Th… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
7
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
7
1

Relationship

0
8

Authors

Journals

citations
Cited by 10 publications
(7 citation statements)
references
References 24 publications
0
7
0
Order By: Relevance
“…This loss function is computed as a weighted total sum of various losses during the training at every phase of the model on each proposal RoI, which is shown by Equation (2). This weighted loss is defined as [ 25 ]: where , , and represent the classification loss, bounding-box loss, and the average binary cross-entropy loss, respectively. The shows the convergence of the predictions to the true class.…”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…This loss function is computed as a weighted total sum of various losses during the training at every phase of the model on each proposal RoI, which is shown by Equation (2). This weighted loss is defined as [ 25 ]: where , , and represent the classification loss, bounding-box loss, and the average binary cross-entropy loss, respectively. The shows the convergence of the predictions to the true class.…”
Section: Methodsmentioning
confidence: 99%
“…This loss function is computed as a weighted total sum of various losses during the training at every phase of the model on each proposal RoI, which is shown by Equation ( 2). This weighted loss is defined as [25]: 3) and ( 4):…”
Section: Tilapia Detectionmentioning
confidence: 99%
“…However, in the multi-label problem, the annotations are related to the ingredients contained in the image, but there is no information about the locations. In our proposal, to estimate the ingredient locations, we decided to use an explainability algorithm like Grad-CAM++ because, as was observed in [39,40], Grad-CAM++ is a good alternative to estimate the object location and segmentation in an unsupervised way and can be used at the top of a neural network without retraining. Subsequently, a group-to-attend (GA) module forms groups of a predefined size K = 2 randomly.…”
Section: Rationalementioning
confidence: 99%
“…Grad-CAM++ was developed with the purpose of getting a visual explanation of the model prediction. However, they have also been successfully used as unsupervised object location [40] and segmentation [46]. Segmenting food images is a hard problem, even for supervised methods.…”
mentioning
confidence: 99%
“…In the future, there will be new potential applications of Grad-CAM: (i) Object localization. The Grad-CAM can localize objects [32] by highlighting the regions of the objects that contribute to the predictions of objects' presence. (ii) Image classification.…”
mentioning
confidence: 99%