To enable the apple picking robot to quickly and accurately detect apples under the complex background in orchards, we propose an improved You Only Look Once version 4 (YOLOv4) model and data augmentation methods. Firstly, the crawler technology is utilized to collect pertinent apple images from the Internet for labeling. For the problem of insufficient image data caused by the random occlusion between leaves, in addition to traditional data augmentation techniques, a leaf illustration data augmentation method is proposed in this paper to accomplish data augmentation. Secondly, due to the large size and calculation of the YOLOv4 model, the backbone network Cross Stage Partial Darknet53 (CSPDarknet53) of the YOLOv4 model is replaced by EfficientNet, and convolution layer (Conv2D) is added to the three outputs to further adjust and extract the features, which make the model lighter and reduce the computational complexity. Finally, the apple detection experiment is performed on 2670 expanded samples. The test results show that the EfficientNet-B0-YOLOv4 model proposed in this paper has better detection performance than YOLOv3, YOLOv4, and Faster R-CNN with ResNet, which are state-of-the-art apple detection model. The average values of Recall, Precision, and F1 reach 97.43%, 95.52%, and 96.54% respectively, the average detection time per frame of the model is 0.338 s, which proves that the proposed method can be well applied in the vision system of picking robots in the apple industry.
Abstract:Although optical remote sensing can intuitively detect algal bloom, it is limited by the weather conditions. Synthetic aperture radar (SAR) is not affected by inadequate weather conditions. According to visual interpretation of SAR images and comparisons of quasi-synchronized optical images, the gathering areas of algal bloom present as "dark regions" on SAR images. It is shown that using SAR to monitor the water surface is workable. However, dark regions may also be caused by other factors, such as low wind speeds. This challenges with SAR monitoring of algal bloom on the water surface. In this study, an improved K-means algorithm, combined with multi-Otsu thresholding algorithm, was proposed to segment the dark regions. After feature analysis and extraction of Sentinel-1A images, an algal bloom recognition model with a support vector machine (SVM) was applied to discriminate the algal bloom dark regions from the low wind dark regions. According the experimental results, the overall accuracy achieved 74.00% in Taihu Lake. Additionally, this method was also validated in Chaohu Lake and Danjiangkou Reservoir. Therefore, it can be concluded that SAR can provide a new technical means for monitoring algal bloom of inland lakes, particularly when it is cloudy and unsuitable for optical remote sensing. To obtain more information about algal bloom, multi-band and multi-polarization SAR images can be considered for future.
With the popular use of geotagging images, more and more research efforts have been placed on geographical scene classification. In geographical scene classification, valid spatial feature selection can significantly boost the final performance. Bag of visual words (BoVW) can do well in selecting feature in geographical scene classification; nevertheless, it works effectively only if the provided feature extractor is well-matched. In this paper, we use convolutional neural networks (CNNs) for optimizing proposed feature extractor, so that it can learn more suitable visual vocabularies from the geotagging images. Our approach achieves better performance than BoVW as a tool for geographical scene classification, respectively, in three datasets which contain a variety of scene categories.
The online detection of the three-dimensional (3D) shape of a weld pool is a key unsolved question for weld shape control. In this paper, a method to reconstruct the 3D shape of a weld pool boundary from a two-dimensional (2D) visual image is proposed. Firstly, a new 3D model was developed to describe the weld pool geometry in pulsed gas metal arc welding (P-GMAW). In this model, four parameters could be used to determine the weld pool shape: the maximum width W, the length of the pool tail L, the height of the rear of the pool H and the projection of the rear angle onto the work piece plane θL. Then, the clear weld pool images from the P-GMAW were captured from the upper side, and a series of algorithms were developed to extract the model geometrical parameters from these images. Finally, the 3D shape of the weld pool was reconstructed from the 2D images. The results were then validated with metallograph observations. The measurement error of this method was investigated, and methods for improvement were discussed.
Synthetic Aperture Radar (SAR), as a microwave sensor that can sense a target all day or night under all-weather conditions, is of great significance for detecting water resources, such as coastlines, lakes and rivers. This paper reviews literature published in the past 30 years in the field of water body extraction in SAR images, and makes some proposals that the community working with SAR image waterbody extraction should consider. Firstly, this review focuses on the main ideas and characteristics of traditional water body extraction on SAR images, mainly focusing on traditional Machine Learning (ML) methods. Secondly, how Deep Learning (DL) methods are applied and optimized in the task of water-body segmentation for SAR images is summarized from the two levels of pixel and image. We also pay more attention to the most popular networks, such as U-Net and its modified models, and novel networks, such as the Cascaded Fully-Convolutional Network (CFCN) and River-Net. In the end, an in-depth discussion is presented, along with conclusions and future trends, on the limitations and challenges of DL for water-body segmentation.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.