Combine-Net: An Improved Filter Pruning Algorithm

Wang, Jinghan; Li, Guangyue; Zhang, Wenzhao

doi:10.3390/info12070264

Cited by 2 publications

(3 citation statements)

References 13 publications

(31 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Some of them play a feeble role in the target detection process, and the cumulative impact of these parameters on the feature map is negligible, and removing these parameters has little impact on the accuracy of target detection; therefore, the parameters between model layers need to be further compressed and optimized. Model pruning is a widely used model compression technique, and from the perspective of pruning granularity, pruning methods can be classified as structured and unstructured pruning (Wang et al, 2021 ), Filter Pruning via Geometric Median (FPGM) (He et al, 2019 ) is a structured weight pruning. The essence of the algorithm is to identify the geometric median close filters present in the network and achieve the purpose of streamlining the weights to accelerate inference by eliminating the redundant filters and their associated input-output relations.…”

Section: Methodsmentioning

confidence: 99%

An Industrial-Grade Solution for Crop Disease Image Detection Tasks

Dai

Fan

2022

Front. Plant Sci.

View full text Add to dashboard Cite

Crop leaf diseases can reflect the current health status of the crop, and the rapid and automatic detection of field diseases has become one of the difficulties in the process of industrialization of agriculture. In the widespread application of various machine learning techniques, recognition time consumption and accuracy remain the main challenges in moving agriculture toward industrialization. This article proposes a novel network architecture called YOLO V5-CAcT to identify crop diseases. The fast and efficient lightweight YOLO V5 is chosen as the base network. Repeated Augmentation, FocalLoss, and SmoothBCE strategies improve the model robustness and combat the positive and negative sample ratio imbalance problem. Early Stopping is used to improve the convergence of the model. We use two technical routes of model pruning, knowledge distillation and memory activation parameter compression ActNN for model training and identification under different hardware conditions. Finally, we use simplified operators with INT8 quantization for further optimization and deployment in the deep learning inference platform NCNN to form an industrial-grade solution. In addition, some samples from the Plant Village and AI Challenger datasets were applied to build our dataset. The average recognition accuracy of 94.24% was achieved in images of 59 crop disease categories for 10 crop species, with an average inference time of 1.563 ms per sample and model size of only 2 MB, reducing the model size by 88% and the inference time by 72% compared with the original model, with significant performance advantages. Therefore, this study can provide a solid theoretical basis for solving the common problems in current agricultural disease image detection. At the same time, the advantages in terms of accuracy and computational cost can meet the needs of agricultural industrialization.

show abstract

Section: Methodsmentioning

confidence: 99%

An Industrial-Grade Solution for Crop Disease Image Detection Tasks

Dai

Fan

2022

Front. Plant Sci.

View full text Add to dashboard Cite

show abstract

“…In the unstructured pruning, 45 unimportant or least important connections and neurons in the pretrained model are removed depending on the value of the weight magnitudes. Weight pruning optimizes the DL model by removing the unimportant weights from the neural network.…”

Section: Model Compression Strategies For Edge Computingmentioning

confidence: 99%

“…Representing the model weight with lower bit‐width parameters reduces the inference latency and storage. A few quantization approaches 45 are k‐means clustering along with Huffman encoding, binary quantization, and 1‐bit quantization to represent a 32‐bit number to a 1‐bit integer.…”

Section: Model Compression Strategies For Edge Computingmentioning

confidence: 99%

Memory optimization at Edge for Distributed Convolution Neural Network

Naveen

Kounte

2022

Trans Emerging Tel Tech

View full text Add to dashboard Cite

Internet of Things (IoT) edge intelligence has emerged by optimizing the deep learning (DL) models deployed on resource‐constraint devices for quick decision‐making. In addition, edge intelligence reduces network overload and latency by bringing intelligent analytics closer to the source. On the other hand, DL models need a lot of computing resources. As a result, they have high computational workloads and memory footprint, making it impractical to deploy and execute on IoT edge devices with limited capabilities. In addition, existing layer‐based partitioning methods generate many intermediate results, resulting in a huge memory footprint. In this article, we propose a framework to provide a comprehensive solution that enables the deployment of convolutional neural networks (CNNs) onto distributed IoT devices for faster inference and reduced memory footprint. This framework considers a pretrained YOLOv2 model, and a weight pruning technique is applied to the pre‐trained model to reduce the number of non‐contributing parameters. We use the fused layer partitioning method to vertically partition the fused layers of the CNN and then distribute the partition among the edge devices to process the input. In our experiment, we have considered multiple Raspberry Pi as edge devices. Raspberry Pi with a neural computing stick is a gateway device to combine the results from various edge devices and get the final output. Our proposed model achieved inference latency of 5 to ∼$$ \sim $$7 seconds for 3prefix×3$$ 3\times 3 $$ to 5prefix×5$$ 5\times 5 $$ fused layer partitioning for five devices with a 9% improvement in memory footprint.

show abstract

Combine-Net: An Improved Filter Pruning Algorithm

Cited by 2 publications

References 13 publications

An Industrial-Grade Solution for Crop Disease Image Detection Tasks

An Industrial-Grade Solution for Crop Disease Image Detection Tasks

Memory optimization at Edge for Distributed Convolution Neural Network

Contact Info

Product

Resources

About