Abstract:Due to the sharp increase in household waste, its separate collection is essential in order to reduce the huge amount of household waste, since it is difficult to recycle trash without separate collection. However, since it is costly and time-consuming to separate trash manually, it is crucial to develop an automatic system for separate collection using deep learning and computer vision. In this paper, we propose two Anchor-free-based Recyclable Trash Detection Networks (ARTD-Net) which can recognize overlappe… Show more
“…The smaller YOLOv6 models (nano, tiny, and small) harness reparameterized VGG networks (RepBlock) with skip connections for training, which transition to simple 3x3 convolutional (RepConv) blocks during inference [22]. Meanwhile, the medium and large YOLOv6 models deploy reparameterized versions of the CSP backbone, termed CSP-StackRep, culminating in the EfficientRep backbone [21], [23].…”
This paper presents a comprehensive evaluation of various YOLO architectures for smoke and wildfire detection, including YOLOv5, YOLOv6, YOLOv7, YOLOv8, and YOLO-NAS. The study aims to assess their effectiveness in early detection of wildfires using the Foggia dataset, comprising 8,974 images specifically designed for this purpose. Performance evaluation employs metrics such as Recall, Precision, F1-score, and mean Average Precision to provide a holistic assessment of the models' performance. The study follows a rigorous methodology involving fixed epochs, continuous performance tracking, and unbiased testing. Results show that YOLOv5, YOLOv7, and YOLOv8 exhibit a balanced performance across all metrics in both validation and testing. YOLOv6 performs slightly lower in recall during validation but achieves a good balance on testing. YOLO-NAS variants excel in recall, making them suitable for critical applications. However, precision performance is lower for YOLO-NAS models. Visual results demonstrate that the top-performing models accurately identify most instances in the test set. However, they struggle with distant scenes and poor lighting conditions, occasionally detecting false positives. In favorable conditions, the models perform well in identifying relevant instances. We conclude that no single model excels in all aspects of smoke and wildfire detection. The choice of model depends on specific application requirements, considering accuracy, recall, and inference time. This research contributes to the field of computer vision in smoke and wildfire detection, providing a foundation for improving detection systems and mitigating the impact of wildfires. Researchers can build upon these findings to propose modifications and enhance the effectiveness of wildfire detection systems.
“…The smaller YOLOv6 models (nano, tiny, and small) harness reparameterized VGG networks (RepBlock) with skip connections for training, which transition to simple 3x3 convolutional (RepConv) blocks during inference [22]. Meanwhile, the medium and large YOLOv6 models deploy reparameterized versions of the CSP backbone, termed CSP-StackRep, culminating in the EfficientRep backbone [21], [23].…”
This paper presents a comprehensive evaluation of various YOLO architectures for smoke and wildfire detection, including YOLOv5, YOLOv6, YOLOv7, YOLOv8, and YOLO-NAS. The study aims to assess their effectiveness in early detection of wildfires using the Foggia dataset, comprising 8,974 images specifically designed for this purpose. Performance evaluation employs metrics such as Recall, Precision, F1-score, and mean Average Precision to provide a holistic assessment of the models' performance. The study follows a rigorous methodology involving fixed epochs, continuous performance tracking, and unbiased testing. Results show that YOLOv5, YOLOv7, and YOLOv8 exhibit a balanced performance across all metrics in both validation and testing. YOLOv6 performs slightly lower in recall during validation but achieves a good balance on testing. YOLO-NAS variants excel in recall, making them suitable for critical applications. However, precision performance is lower for YOLO-NAS models. Visual results demonstrate that the top-performing models accurately identify most instances in the test set. However, they struggle with distant scenes and poor lighting conditions, occasionally detecting false positives. In favorable conditions, the models perform well in identifying relevant instances. We conclude that no single model excels in all aspects of smoke and wildfire detection. The choice of model depends on specific application requirements, considering accuracy, recall, and inference time. This research contributes to the field of computer vision in smoke and wildfire detection, providing a foundation for improving detection systems and mitigating the impact of wildfires. Researchers can build upon these findings to propose modifications and enhance the effectiveness of wildfire detection systems.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.