AttendNets: Tiny Deep Image Recognition Neural Networks for the Edge via Visual Attention Condensers

Wong, Alexander; Famouri, Mahmoud; Shafiee, Mohammad Javad

doi:10.48550/arxiv.2009.14385

Cited by 10 publications

(14 citation statements)

References 27 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Third, it can be observed that a majority of the layers of the TB-Net network architecture design is comprised of visual attention condensers (Wong et al, 2020b ), which are a variant of the highly efficient attention condenser self-attention mechanisms recently introduced in Wong et al ( 2020a ). More specifically, visual attention condensers produce condensed embedding characterizing joint spatial and cross-channel activation relationships and achieves selective attention accordingly to improve representational capability while maintaining very low architectural and computational complexity.…”

Section: Methodsmentioning

confidence: 99%

TB-Net: A Tailored, Self-Attention Deep Convolutional Neural Network Design for Detection of Tuberculosis Cases From Chest X-Ray Images

Wong

Lee²,

Rahmat-Khah³

et al. 2022

Front. Artif. Intell.

Self Cite

View full text Add to dashboard Cite

Tuberculosis (TB) remains a global health problem, and is the leading cause of death from an infectious disease. A crucial step in the treatment of tuberculosis is screening high risk populations and the early detection of the disease, with chest x-ray (CXR) imaging being the most widely-used imaging modality. As such, there has been significant recent interest in artificial intelligence-based TB screening solutions for use in resource-limited scenarios where there is a lack of trained healthcare workers with expertise in CXR interpretation. Motivated by this pressing need and the recent recommendation by the World Health Organization (WHO) for the use of computer-aided diagnosis of TB in place of a human reader, we introduce TB-Net, a self-attention deep convolutional neural network tailored for TB case screening. We used CXR data from a multi-national patient cohort to train and test our models. A machine-driven design exploration approach leveraging generative synthesis was used to build a highly customized deep neural network architecture with attention condensers. We conducted an explainability-driven performance validation process to validate TB-Net's decision-making behavior. Experiments on CXR data from a multi-national patient cohort showed that the proposed TB-Net is able to achieve accuracy/sensitivity/specificity of 99.86/100.0/99.71%. Radiologist validation was conducted on select cases by two board-certified radiologists with over 10 and 19 years of experience, respectively, and showed consistency between radiologist interpretation and critical factors leveraged by TB-Net for TB case detection for the case where radiologists identified anomalies. The proposed TB-Net not only achieves high tuberculosis case detection performance in terms of sensitivity and specificity, but also leverages clinically relevant critical factors in its decision making process. While not a production-ready solution, we hope that the open-source release of TB-Net as part of the COVID-Net initiative will support researchers, clinicians, and citizen data scientists in advancing this field in the fight against this global public health crisis.

show abstract

Section: Methodsmentioning

confidence: 99%

TB-Net: A Tailored, Self-Attention Deep Convolutional Neural Network Design for Detection of Tuberculosis Cases From Chest X-Ray Images

Wong

Lee²,

Rahmat-Khah³

et al. 2022

Front. Artif. Intell.

Self Cite

View full text Add to dashboard Cite

show abstract

“…We take into account several computational and bestpractice constraints which are formulated via the indicator function 1 g (•): i) the macroarchitecture design uses several parallel columns to significantly reduce the architectural and computational complexity with much greater disentanglement of learned features; ii) to reduce the considerable information loss caused by the pointwise strided convolutions used in residual networks [6] and RegNet architecture [10] here we restricted its use from the optimization; iii) antialiasing downsampling (AADS) [24] modules are to be used in the early network stage to improve network stability and robustness; iv) FLOPs within 20% of 100M FLOPs for edge compute scenarios. In the machine driven design exploration process, attention condensers (VAC) [20], [21] are used as an highly efficient self-attention module to learn and produce condensed embedding characterizing the joint local and cross-channel activation relationships. However, the machine-driven design exploration process automatically determines the best way to satisfy the defined constraints in designing the CellDefectNet architecture.…”

Section: Methodsmentioning

confidence: 99%

“…Early-stage self-attention: the VACs are leveraged heavily within the initial modules used in the network architecture. VAC was first introduced by Wong et al[21] for image classification. The VACs can help to better model activation relationships and improves selective attention.…”

mentioning

confidence: 99%

CellDefectNet: A Machine-designed Attention Condenser Network for Electroluminescence-based Photovoltaic Cell Defect Inspection

Xu¹,

Famouri²,

Bathla³

et al. 2022

Preprint

Self Cite

View full text Add to dashboard Cite

Photovoltaic cells are electronic devices that convert light energy to electricity, forming the backbone of solar energy harvesting systems. An essential step in the manufacturing process for photovoltaic cells is visual quality inspection using electroluminescence imaging to identify defects such as cracks, finger interruptions, and broken cells. A big challenge faced by industry in photovoltaic cell visual inspection is the fact that it is currently done manually by human inspectors, which is extremely time consuming, laborious, and prone to human error. While deep learning approaches holds great potential to automating this inspection, the hardware resource-constrained manufacturing scenario makes it challenging for deploying complex deep neural network architectures. In this work, we introduce CellDefectNet, a highly efficient attention condenser network designed via machine-driven design exploration specifically for electroluminesence-based photovoltaic cell defect detection on the edge. We demonstrate the efficacy of CellDefectNet on a benchmark dataset comprising of a diversity of photovoltaic cells captured using electroluminescence imagery, achieving an accuracy of ∼86.3% while possessing just 410K parameters (∼13× lower than EfficientNet-B0, respectively) and ∼115M FLOPs (∼12× lower than EfficientNet-B0) and ∼13× faster on an ARM Cortex A-72 embedded processor when compared to EfficientNet-B0.

show abstract

“…The number in each convolution module represents the number of channels. The numbers in each visual attention condenser represents the number of channels for the down-mixing layer, the embedding structure, and the up-mixing layer, respectively (details can be found in [15]). It can be observed that all Cancer-Net SCa architectures exhibit both great macroarchitecture and microarchitecture design diversity, with certain models exhibiting specific lightweight macroarchitecture design characteristics such as attention condenser and projection-expansion-projection-expansion (PEPE) design patterns comprised of channel dimensionality reduction, depthwise convolutions, and pointwise convolutions are highly diverse and heterogeneous, with a mix of spatial convolutions, pointwise convolutions, and depthwise convolutions, all with different microarchitecture designs.…”

Section: Diverse Heterogeneous Designsmentioning

confidence: 99%

“…The use of computer vision and machine learning for the diagnosis of pigmented skin lesions has been shown to be accurate and practical [5][6][7][9][10][11][12][13], and can improve biopsy decision making [10], as well as act as a pre-screening tool to reduce the amount of a time a professional spends on each case. Motivated by the challenge of skin cancer detection, and inspired by the open source and open access efforts of the research community, in this study we introduce Cancer-Net SCa, a suite of deep neural network designs tailored for the detection of skin cancer from dermoscopy images, one of which possesses a self-attention architecture design with attention condensers [14,15]. To construct Cancer-Net SCa, we leveraged a machine-driven design strategy that leverages human experience and ingenuity with the meticulousness and raw speed of machines.…”

Section: Introductionmentioning

confidence: 99%

Cancer-Net SCa: tailored deep neural network designs for detection of skin cancer from dermoscopy images

Lee

Pavlova

Famouri³

et al. 2022

BMC Med Imaging

Self Cite

View full text Add to dashboard Cite

Background Skin cancer continues to be the most frequently diagnosed form of cancer in the U.S., with not only significant effects on health and well-being but also significant economic costs associated with treatment. A crucial step to the treatment and management of skin cancer is effective early detection with key screening approaches such as dermoscopy examinations, leading to stronger recovery prognoses. Motivated by the advances of deep learning and inspired by the open source initiatives in the research community, in this study we introduce Cancer-Net SCa, a suite of deep neural network designs tailored for the detection of skin cancer from dermoscopy images that is open source and available to the general public. To the best of the authors’ knowledge, Cancer-Net SCa comprises the first machine-driven design of deep neural network architectures tailored specifically for skin cancer detection, one of which leverages attention condensers for an efficient self-attention design. Results We investigate and audit the behaviour of Cancer-Net SCa in a responsible and transparent manner through explainability-driven performance validation. All the proposed designs achieved improved accuracy when compared to the ResNet-50 architecture while also achieving significantly reduced architectural and computational complexity. In addition, when evaluating the decision making process of the networks, it can be seen that diagnostically relevant critical factors are leveraged rather than irrelevant visual indicators and imaging artifacts. Conclusion The proposed Cancer-Net SCa designs achieve strong skin cancer detection performance on the International Skin Imaging Collaboration (ISIC) dataset, while providing a strong balance between computation and architectural efficiency and accuracy. While Cancer-Net SCa is not a production-ready screening solution, the hope is that the release of Cancer-Net SCa in open source, open access form will encourage researchers, clinicians, and citizen data scientists alike to leverage and build upon them.

show abstract

AttendNets: Tiny Deep Image Recognition Neural Networks for the Edge via Visual Attention Condensers

Cited by 10 publications

References 27 publications

TB-Net: A Tailored, Self-Attention Deep Convolutional Neural Network Design for Detection of Tuberculosis Cases From Chest X-Ray Images

TB-Net: A Tailored, Self-Attention Deep Convolutional Neural Network Design for Detection of Tuberculosis Cases From Chest X-Ray Images

CellDefectNet: A Machine-designed Attention Condenser Network for Electroluminescence-based Photovoltaic Cell Defect Inspection

Cancer-Net SCa: tailored deep neural network designs for detection of skin cancer from dermoscopy images

Contact Info

Product

Resources

About