On-Device Learning Systems for Edge Intelligence: A Software and Hardware Synergy Perspective

Zhou, Qihua; Qu, Zhihao; Guo, Song; Luo, Boyuan; Guo, Jingcai; Xu, Zhenda; Akerkar, Rajendra

doi:10.1109/jiot.2021.3063147

Cited by 26 publications

(10 citation statements)

References 60 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Information storage is not typically an issue with cloud systems, but every query involves delays, both in the cloud and on-device. [15,57,74]. To reduce the delay, we analyzed several lossless text compression algorithms including, Run-length encoding, Shannon-Fano encoding, Arithmetic encoding, Huffman encoding, and LZW compression [61].…”

Section: Huffman Encoding and Decodingmentioning

confidence: 99%

ODSearch: A Fast and Resource Efficient On-device Information Retrieval for Mobile and Wearable Devices

Rong¹,

Rawassizadeh²

2022

Preprint

View full text Add to dashboard Cite

Mobile and wearable technologies have promised significant changes to the healthcare industry. Although cutting-edge communication and cloud-based technologies have allowed for these upgrades, their implementation and popularization in low-income countries have been challenging. We propose ODSearch, an On-device Search framework equipped with a natural language interface for mobile and wearable devices. To implement search, ODSearch employs compression and Bloom filter, it provides near real-time search query responses without network dependency. Our experiments were conducted on a mobile phone and smartwatch. We compared ODSearch with current state-of-the-art search mechanisms, and it outperformed them on average by 55 times in execution time 1 , 26 times in energy usage, and 2.3% in memory utilization.

show abstract

Section: Huffman Encoding and Decodingmentioning

confidence: 99%

ODSearch: A Fast and Resource Efficient On-device Information Retrieval for Mobile and Wearable Devices

Rong¹,

Rawassizadeh²

2022

Preprint

View full text Add to dashboard Cite

show abstract

“…Moreover, it is also necessary to develop on-device learning solutions to support self-learning aerial computing systems. For example, improved network architecture, training optimization, and hardware design were used to accelerate ondevice data training [159]. A streamlined slimming framework was developed and combined with a consecutive tensor layer to improve the training rates.…”

Section: B Resource Managementmentioning

confidence: 99%

Aerial Computing: A New Computing Paradigm, Applications, and Challenges

Pham,

Ruby,

Fang

et al. 2022

Preprint

View full text Add to dashboard Cite

In existing computing systems, such as edge computing and cloud computing, several emerging applications and practical scenarios are mostly unavailable or only partially implemented. To overcome the limitations that restrict such applications, the development of a comprehensive computing paradigm has garnered attention in both academia and industry. However, a gap exists in the literature owing to the scarce research, and a comprehensive computing paradigm is yet to be systematically designed and reviewed. This study introduces a novel concept, called aerial computing, via the amalgamation of aerial radio access networks and edge computing, which attempts to bridge the gap. Specifically, first, we propose a novel comprehensive computing architecture that is composed of low-altitude computing, high-altitude computing, and satellite computing platforms, along with conventional computing systems. We determine that aerial computing offers several desirable attributes: global computing service, better mobility, higher scalability and availability, and simultaneity. Second, we comprehensively discuss key technologies that facilitate aerial computing, including energy refilling, edge computing, network softwarization, frequency spectrum, multiaccess techniques, artificial intelligence, and big data. In addition, we discuss vertical domain applications (e.g., smart cities, smart vehicles, smart factories, and smart grids) supported by aerial computing. Finally, we highlight several challenges that need to be addressed and their possible solutions.

show abstract

“…Fixed-point quantization could effectively reduce this high energy consumption and optimize the effect of neural network training [29]. There have been many recent studies focusing on fixed-point quantization [22,[30][31][32][33][34][35], which is summarized in Table 2. Proposed work Focuses [31] Integer-only quantization scheme 8-bit quantization, quantized inference framework, quantized training framework [32] INT8 training method (Octo) Quantization error, INT8 training [33] Relaxed Quantization (RQ) Network discretization, "Smooth" quantization procedure [34] Quantization-interval-learning (QIL) Quantization in low bit-width network, Trainable quantization interval [35] Data-free quantization method (DFQ) algorithm…”

Section: Introductionmentioning

confidence: 99%

“…Jacob B et al proposed a quantized inference framework, which could quantize both the weights and activations in the neural network [31]. Based on the first two works, a lightweight INT8 training method was proposed, in which both forward and backward stages were optimized by fixed point quantization [32]. The above researches deeply discuss the problems encountered when fixed-point quantization is applied to neural network training and give effective solutions.…”

Section: Introductionmentioning

confidence: 99%

A Secure and Effective Energy-Aware Fixed-Point Quantization Scheme for Asynchronous Federated Learning

Zhen¹,

Wu²,

Liu³

et al. 2023

Computers, Materials &Amp; Continua

View full text Add to dashboard Cite

Asynchronous federated learning (AsynFL) can effectively mitigate the impact of heterogeneity of edge nodes on joint training while satisfying participant user privacy protection and data security. However, the frequent exchange of massive data can lead to excess communication overhead between edge and central nodes regardless of whether the federated learning (FL) algorithm uses synchronous or asynchronous aggregation. Therefore, there is an urgent need for a method that can simultaneously take into account device heterogeneity and edge node energy consumption reduction. This paper proposes a novel Fixed-point Asynchronous Federated Learning (FixedAsynFL) algorithm, which could mitigate the resource consumption caused by frequent data communication while alleviating the effect of device heterogeneity. FixedAsynFL uses fixed-point quantization to compress the local and global models in AsynFL. In order to balance energy consumption and learning accuracy, this paper proposed a quantization scale selection mechanism. This paper examines the mathematical relationship between the quantization scale and energy consumption of the computation/communication process in the FixedAsynFL. Based on considering the upper bound of quantization noise, this paper optimizes the quantization scale by minimizing communication and computation consumption. This paper performs pertinent experiments on the MNIST dataset with several edge nodes of different computing efficiency. The results show that the FixedAsynFL algorithm with an 8-bit quantization can significantly reduce the communication data size by 81.3% and save the computation energy in the training phase by 74.9% without significant loss of accuracy. According to the experimental results, we can see that the proposed AsynFixedFL algorithm can effectively solve the problem of device heterogeneity and energy consumption limitation of edge nodes.

show abstract

On-Device Learning Systems for Edge Intelligence: A Software and Hardware Synergy Perspective

Cited by 26 publications

References 60 publications

ODSearch: A Fast and Resource Efficient On-device Information Retrieval for Mobile and Wearable Devices

ODSearch: A Fast and Resource Efficient On-device Information Retrieval for Mobile and Wearable Devices

Aerial Computing: A New Computing Paradigm, Applications, and Challenges

A Secure and Effective Energy-Aware Fixed-Point Quantization Scheme for Asynchronous Federated Learning

Contact Info

Product

Resources

About