Jie Hu scite author profile

The central building block of convolutional neural networks (CNNs) is the convolution operator, which enables networks to construct informative features by fusing both spatial and channel-wise information within local receptive fields at each layer. A broad range of prior research has investigated the spatial component of this relationship, seeking to strengthen the representational power of a CNN by enhancing the quality of spatial encodings throughout its feature hierarchy. In this work, we focus instead on the channel relationship and propose a novel architectural unit, which we term the "Squeeze-and-Excitation" (SE) block, that adaptively recalibrates channel-wise feature responses by explicitly modelling interdependencies between channels. We show that these blocks can be stacked together to form SENet architectures that generalise extremely effectively across different datasets. We further demonstrate that SE blocks bring significant improvements in performance for existing state-of-the-art CNNs at slight additional computational cost. Squeeze-and-Excitation Networks formed the foundation of our ILSVRC 2017 classification submission which won first place and reduced the top-5 error to 2.251%, surpassing the winning entry of 2016 by a relative improvement of ∼25%. Models and code are available at https://github.com/hujie-frank/SENet.

show abstract

Squeeze-and-Excitation Networks

Shen

Albanie

et al. 2018

14,238

1,786

View full text Add to dashboard Cite

Acoustic scene classification (ASC) is one of the most popular problems in the field of machine listening. The objective of this problem is to classify an audio clip into one of the predefined scenes using only the audio data. This problem has considerably progressed over the years in the different editions of DCASE. It usually has several subtasks that allow to tackle this problem with different approaches. The subtask presented in this report corresponds to a ASC problem that is constrained by the complexity of the model as well as having audio recorded from different devices, known as mismatch devices (real and simulated). The work presented in this report follows the research line carried out by the team in previous years. Specifically, a system based on two steps is proposed: a two-dimensional representation of the audio using the Gamamtone filter bank and a convolutional neural network using squeeze-excitation techniques. The presented system outperforms the baseline by about 17 percentage points.

show abstract

An integrated AHP and VIKOR for design concept evaluation based on rough number

Zhu

et al. 2015

Advanced Engineering Informatics

225

131

View full text Add to dashboard Cite

2020 Chinese guidelines for ultrasound malignancy risk stratification of thyroid nodules: the C-TIRADS

et al. 2020

View full text Add to dashboard Cite

Thyroid nodules are very common all over the world, and China is no exception. Ultrasound plays an important role in determining the risk stratification of thyroid nodules, which is critical for clinical management of thyroid nodules. For the past few years, many versions of TIRADS (Thyroid Imaging Reporting and Data System) have been put forward by several institutions with the aim to identify whether nodules require fine-needle biopsy or ultrasound follow-up. However, no version of TIRADS has been widely adopted worldwide till date. In China, as many as ten versions of TIRADS have been used in different hospitals nationwide, causing a lot of confusion. With the support of the Superficial Organ and Vascular Ultrasound Group of the Society of Ultrasound in Medicine of the Chinese Medical Association, the Chinese-TIRADS that is in line with China's national conditions and medical status was established based on literature review, expert consensus, and multicenter data provided by the Chinese Artificial Intelligence Alliance for Thyroid and Breast Ultrasound.

show abstract

A Key Volume Mining Deep Framework for Action Recognition

et al. 2016

View full text Add to dashboard Cite

EMG‐Based Estimation of Limb Movement Using Deep Learning With Recurrent Convolutional Neural Networks

2017

View full text Add to dashboard Cite

A novel model based on deep learning is proposed to estimate kinematic information for myoelectric control from multi-channel electromyogram (EMG) signals. The neural information of limb movement is embedded in EMG signals that are influenced by all kinds of factors. In order to overcome the negative effects of variability in signals, the proposed model employs the deep architecture combining convolutional neural networks (CNNs) and recurrent neural networks (RNNs). The EMG signals are transformed to time-frequency frames as the input to the model. The limb movement is estimated by the model that is trained with the gradient descent and backpropagation procedure. We tested the model for simultaneous and proportional estimation of limb movement in eight healthy subjects and compared it with support vector regression (SVR) and CNNs on the same data set. The experimental studies show that the proposed model has higher estimation accuracy and better robustness with respect to time. The combination of CNNs and RNNs can improve the model performance compared with using CNNs alone. The model of deep architecture is promising in EMG decoding and optimization of network structures can increase the accuracy and robustness.

show abstract

Involution: Inverting the Inherence of Convolution for Visual Recognition

Wang

et al. 2021

214

View full text Add to dashboard Cite

Smooth and time-optimal S-curve trajectory planning for automated robots and machines

Fang

Liu

et al. 2019

Mechanism and Machine Theory

135

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

334 Leonard St

Brooklyn, NY 11211

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Jie Hu

Squeeze-and-Excitation Networks

Squeeze-and-Excitation Networks

An integrated AHP and VIKOR for design concept evaluation based on rough number

2020 Chinese guidelines for ultrasound malignancy risk stratification of thyroid nodules: the C-TIRADS

A Key Volume Mining Deep Framework for Action Recognition

EMG‐Based Estimation of Limb Movement Using Deep Learning With Recurrent Convolutional Neural Networks

Involution: Inverting the Inherence of Convolution for Visual Recognition

Smooth and time-optimal S-curve trajectory planning for automated robots and machines

Contact Info

Product

Resources

About