Recent advances in sensor based human activity recognition (HAR) have exploited deep hybrid networks to improve the performance. These hybrid models combine Convolutional Neural Networks (CNNs) and Recurrent Neural Networks (RNNs) to leverage their complementary advantages, and achieve impressive results. However, the roles and associations of different sensors in HAR are not fully considered by these models, leading to insufficient multi-modal fusion. Besides, the commonly used RNNs in HAR suffer from the 'forgetting' defect, which raises difficulties in capturing long-term information. To tackle these problems, an HAR framework composed of an Inertial Measurement Unit (IMU) fusion block and an applied ConvTransformer subnet is proposed in this paper. Inspired by the complementary filter, our IMU fusion block performs multi-modal fusion of commonly used sensors according to their physical relationships. Consequently, the features of different modalities can be aggregated more effectively. Then, the extracted features are fed into the applied ConvTransformer subnet for classification. Thanks to its convolutional subnet and self-attention layers, ConvTransformer can better capture local features and construct long-term dependencies. Extensive experiments on eight benchmark datasets demonstrate the superior performance of our framework. The source code will be published soon.
The task of the SHL recognition challenge 2021 is to recognize eight modes of locomotion-transportation based on radio sequential data collected by smartphones. These data includes GPS reception, GPS location, WiFi reception and GSM cell tower scans. In this challenge, our team (Transformers) presents a recognition scheme. First, a deep model (ConvTransformer) composed of convolutional and Transformer subnets is proposed. The convolutional subnet captures local features, and the Transformer subnet constructs longterm dependencies. Then, the ConvTransformer network is used to recognize different locomotion and transportation modes through location data. Finally, since the still and subway categories are easily confused, our team uses another ConvTransformer network to further classify them through cells and location data. Through cross validation techniques on the training dataset, our proposed scheme achieved a F1 score of 0.6838 on the validation dataset. CCS CONCEPTS• Human-centered computing -> Ubiquitous and mobile computing; • Computing methodologies -> Neural networks;
The fusion of multiple monitoring sensors is crucial to improve the accuracy and robustness of machinery fault diagnosis. However, existing fault diagnosis methods may underestimate the interference of noise in the multi-sensor fusion process, leading to unsatisfied performance. To handle this problem, this paper proposes a deep model based on the frequency adaptive wavelet pyramid. First, an adaptive frequency selection strategy is designed to prune the seriously polluted frequencies and only retain some key frequencies. Then, the self-attention mechanism is used to perform information fusion on the selected frequency bands of different sensors. Finally, a wavelet fusion pyramid is adopted by repeating the fusion process at multiple wavelet decomposition levels. In this way, different sensors can be fused in a more fine-grained manner. The experimental results on two multi-sensor-based fault diagnosis datasets demonstrate the anti-noise capability of our proposed method.
The fusion of multiple monitoring sensors is crucial to improve the accuracy and robustness of machinery fault diagnosis. However, existing fault diagnosis methods may underestimate the interference of noise in the multi‐sensor fusion process, leading to unsatisfied performance. To handle this problem, this paper proposes a deep model based on the frequency adaptive wavelet pyramid. First, an adaptive frequency selection strategy is designed to prune the seriously polluted frequencies and only retain some key frequencies. Then, the self‐attention mechanism is used to perform information fusion on the selected frequency bands of different sensors. Finally, a wavelet fusion pyramid is adopted by repeating the fusion process at multiple wavelet decomposition levels. In this way, different sensors can be fused in a more fine‐grained manner. The experimental results on two multi‐sensor‐based fault diagnosis datasets demonstrate the anti‐noise capability of the proposed method.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.