In recent years, ship detection in satellite remote sensing images has become an important research topic. Most existing methods detect ships by using a rectangular bounding box but do not perform segmentation down to the pixel level. This paper proposes a ship detection and segmentation method based on an improved Mask R-CNN model. Our proposed method can accurately detect and segment ships at the pixel level. By adding a bottom-up structure to the FPN structure of Mask R-CNN, the path between the lower layers and the topmost layer is shortened, allowing the lower layer features to be more effectively utilized at the top layer. In the bottom-up structure, we use channel-wise attention to assign weights in each channel and use the spatial attention mechanism to assign a corresponding weight at each pixel in the feature maps. This allows the feature maps to respond better to the target's features. Using our method, the detection and segmentation mAPs increased from 70.6% and 62.0% to 76.1% and 65.8%, respectively.
Plant disease detection has an inestimable effect on plant cultivation. Accurate detection of plant disease can control the spread of disease early and prevent unnecessary loss. Strawberry verticillium wilt is a soil-borne, multi-symptomatic disease. To detect strawberry verticillium wilt accurately, we first propose a disease detection network based on Faster R-CNN and multi-task learning to detect strawberry verticillium wilt. Then, the strawberry verticillium wilt detection network (SVWDN), which uses attention mechanisms in the feature extraction of the disease detection network, is proposed. SVWDN detects verticillium wilt according to the symptoms of detected plant components (i.e.,young leaves and petioles). Compared with other existing methods for detecting disease from the whole plant appearance, the SVWDN automatically classifies the petioles and young leaves while determining whether the strawberry has verticillium wilt. To provide a dataset for evaluating and testing our method, we construct a large dataset that contains 3, 531 images with 4 categories (Healthy_leaf, Healthy_petiole, Verticillium_leaf and Verticillium_petiole). Each image also has a label to indicate whether the strawberry is suffering from verticillium wilt. With the proposed strawberry verticillium wilt detection network, we achieved a mAP of 77.54% on object detection of 4 categories and 99.95% accuracy for strawberry verticillium wilt detection.
Fine-Grained Visual Categorization (FGVC) aims to distinguish between extremely similar subordinate-level categories within the same basic-level category. Existing research has proven the great importance of the discriminative features in FGVC but ignored the contributions for correct classification from other features, and the extracted features always contain more information about the obvious regions but less about subtle regions. In this paper, firstly, a novel module named forcing module is proposed to force the network to extract more diverse features for FGVC, which generates a suppression mask based on the class activation maps to suppress the most distinguishable regions, so as to force the network to extract other secondary distinguishable features as the final features. The forcing module consists of the original branch and the forcing branch. The original branch focuses on the primary discriminative regions while the forcing branch focuses on secondary discriminative regions. Secondly, in order to solve the problem that information of small-scale distinguishable features is lost seriously after multi-layer down-sampling, according to the class activation maps of the first prediction, the object is cropped and scaled as the second input. To reduce the prediction error, the first and second prediction probabilities are fused as the final prediction result. Experimental results indicate that the proposed method not only outperforms the baseline model by a large margin (3.7%, 5.9%, 3.1% respectively) on CUB-200-2011, Stanford-Cars, and FGVC-Aircraft, but also achieves state-of-the-art performance on FGVC-Aircraft.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.