In recent years, impressive performance of deep learning technology has been recognized in Synthetic Aperture Radar (SAR) Automatic Target Recognition (ATR). Since a large amount of annotated data is required in this technique, it poses a trenchant challenge to the issue of obtaining a high recognition rate through less labeled data. To overcome this problem, inspired by the contrastive learning, we proposed a novel framework named Batch Instance Discrimination and Feature Clustering (BIDFC). In this framework, different from that of the objective of general contrastive learning methods, embedding distance between samples should be moderate because of the high similarity between samples in the SAR images. Consequently, our flexible framework is equipped with adjustable distance between embedding, which we term as weakly contrastive learning. Technically, instance labels are assigned to the unlabeled data in per batch and random augmentation and training are performed few times on these augmented data. Meanwhile, a novel Dynamic-Weighted Variance loss (DWV loss) function is also posed to cluster the embedding of enhanced versions for each sample. Experimental results on the moving and stationary target acquisition and recognition (MSTAR) database indicate a 91.25% classification accuracy of our method fine-tuned on only 3.13% training data. Even though a linear evaluation is performed on the same training data, the accuracy can still reach 90.13%. We also verified the effectiveness of BIDFC in OpenSarShip database, indicating that our method can be generalized to other datasets. Our code is avaliable at: https://github.com/Wenlve-Zhou/BIDFCmaster.
Facial beauty prediction (FBP), as a frontier topic in the domain of artificial intelligence regarding anthropology, has witnessed some good results as deep learning technology progressively develops. However, it is still limited by the complexity of the deep structure network in need of a large number of parameters and high dimensions, easily leading to a great consumption of time. To solve this problem, this paper proposes a fast training FBP method based on local feature fusion and broad learning system (BLS). Firstly, two-dimensional principal component analysis (2DPCA) is employed to reduce the dimension of the local texture image so as to lessen its redundancy. Secondly, local feature fusion method is adopted to extract more advanced features through avoiding the effects from unstable illumination, individual differences, and various postures. Finally, extensional feature eigenvectors are input to the broad learning network to train an efficient FBP model, which effectively shortens operational time and improve its preciseness. Extensive experiments with the proposed method on large scale Asian female beauty database (LSAFBD) can be conducted within 13.33s while sustaining an accuracy of 58.97%, impressively outstripping other state-of-the-art methods in training speed. INDEX TERMS Facial beauty prediction (FBP); local feature fusion; broad learning system (BLS);
Traditional base station antenna measurement methods conducted with professional worker climbing towers tend to raise safety and inefficiency concerns in practical application. Designed to address the above problems, this paper proposes an intelligent and fully automatic antenna measurement unmanned aerial vehicle (UAV) system for mobile communication base station. Firstly, an antenna database, containing 19,715 images, named UAV-Antenna is constructed by image capturing with the help of UAVs flying around various base stations. Secondly, Mask R-CNN is adopted to train an optimal instance segmentation model on UAV-Antenna. Then, pixel coordinates and threshold are utilized for measuring antenna quantity and separate all antenna data for further measuring. Finally, a least squares method is employed for measuring antenna parameters. Experimental results show that the proposed method can not only satisfy the industry application standards, but also guarantee safety of labors and efficiency of performance.
Though Synthetic Aperture Radar (SAR) Automatic Target Recognition (ATR) via Convolutional Neural Networks (CNNs) has made huge progress toward deep learning, some key issues still remain unsolved due to the lack of sufficient samples and robust model. In this paper, we proposed an efficient transferred Max-Slice CNN (MS-CNN) with L2-Regularization for SAR ATR, which could enrich the features and recognize the targets with superior performance. Firstly, the data amplification method is presented to reduce the computational time and enrich the raw features of SAR targets. Secondly, the proposed MS-CNN framework with L2-Regularization is trained to extract robust features, in which the L2-Regularization is incorporated to avoid the overfitting phenomenon and further optimizing our proposed model. Thirdly, transfer learning is introduced to enhance the feature representation and discrimination, which could boost the performance and robustness of the proposed model on small samples. Finally, various activation functions and dropout strategies are evaluated for further improving recognition performance. Extensive experiments demonstrated that our proposed method could not only outperform other state-of-the-art methods on the public and extended MSTAR dataset but also obtain good performance on the random small datasets.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.