2019
DOI: 10.1016/j.ifacol.2019.12.499
|View full text |Cite
|
Sign up to set email alerts
|

Deep Orange: Mask R-CNN based Orange Detection and Segmentation

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1

Citation Types

0
56
0
1

Year Published

2020
2020
2023
2023

Publication Types

Select...
4
3
1

Relationship

0
8

Authors

Journals

citations
Cited by 112 publications
(57 citation statements)
references
References 34 publications
0
56
0
1
Order By: Relevance
“…The impact of target dataset diversity and choosing a backbone model that is suitable for the target classes is much more important than the number of sample images in the dataset itself (as shown in Table 1 [ 111 , 113 , 114 ]).…”
Section: Discussionmentioning
confidence: 99%
See 2 more Smart Citations
“…The impact of target dataset diversity and choosing a backbone model that is suitable for the target classes is much more important than the number of sample images in the dataset itself (as shown in Table 1 [ 111 , 113 , 114 ]).…”
Section: Discussionmentioning
confidence: 99%
“…Most of the recently presented models, including YOLO-v3 and R-CNN family architectures, are supported by the FPN model, which enables small object detection and enhances semantic segmentation and multi-object detection (as shown in Table 1 [ 107 , 108 , 110 , 111 , 113 , 114 , 117 , 118 ]).…”
Section: Discussionmentioning
confidence: 99%
See 1 more Smart Citation
“…Mask R-CNN improves the loss function of segmentation, from the polynomial cross-entropy based on single-pixel softmax to the binary cross-entropy based on single-pixel sigmoid. 28 If the candidate frame is detected as a category, the binary cross-entropy will make the cross-entropy of this category to be calculated as the error value, only contribute to the ground truth (GT) of the specific K class, while the other classes do not contribute to the loss, 29 and in the back propagation, the loss function L only calculates and back propagates the GT classes, which decouples the segmentation and classification and effectively avoids the competition between classes. Another improvement is that Mask R-CNN adds the RoI align layer, Mask R-CNN uses bilinear interpolation to make the pooling result of the region of interest closer to the features before the non-pooling, thus reducing the error.…”
Section: Related Workmentioning
confidence: 99%
“…However, several challenges have also been identified which include 1) the lack of a large scale dataset; 2) the increasing need to integrate more disciplines and agricultural requirements; and 3) ensuring the robustness and accuracy in various complex situations. In another closely related work, Ganesh et al 2019 [15] use mask R-CNN to detect individual fruits and obtain pixel-wise mask for each detected fruit in the image.…”
mentioning
confidence: 99%