2022
DOI: 10.48550/arxiv.2204.04705
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

SplitNets: Designing Neural Architectures for Efficient Distributed Computing on Head-Mounted Systems

Abstract: We design deep neural networks (DNNs) and corresponding networks' splittings to distribute DNNs' workload to camera sensors and a centralized aggregator on head mounted devices to meet system performance targets in inference accuracy and latency under the given hardware resource constraints. To achieve an optimal balance among computation, communication, and performance, a splitaware neural architecture search framework, SplitNets, is introduced to conduct model designing, splitting, and communication reductio… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
0
0

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(2 citation statements)
references
References 41 publications
(82 reference statements)
0
0
0
Order By: Relevance
“…Finally, in the case of splitting NNs, trends from offloading can carry over to single-device multi-processor architectures. An emerging approach is the use of a nearsensor unit with pre-processing and compression layers (the encoder) and a central unit processing the bulk of computation (the decoder) (GOMEZ et al, 2022;DONG et al, 2022). According to Abrash (2021), this is pointed as key HW/SW evolution for enabling AR devices because the communication between sensor and compute elements can be the most significant factor in energy consumption.…”
Section: Offloading For Augmented Realitymentioning
confidence: 99%
See 1 more Smart Citation
“…Finally, in the case of splitting NNs, trends from offloading can carry over to single-device multi-processor architectures. An emerging approach is the use of a nearsensor unit with pre-processing and compression layers (the encoder) and a central unit processing the bulk of computation (the decoder) (GOMEZ et al, 2022;DONG et al, 2022). According to Abrash (2021), this is pointed as key HW/SW evolution for enabling AR devices because the communication between sensor and compute elements can be the most significant factor in energy consumption.…”
Section: Offloading For Augmented Realitymentioning
confidence: 99%
“…Compression is usually achieved via the addition of an extraneous reduction component, which invalidates the assumption of a fixed architecture. One proposed solution is to use the ratio between encoder and decoder as a metric (DONG et al, 2022), as the tradeoff between computation and precision in the whole model should be monotonic.…”
Section: The Direct-inverse Tradeoffmentioning
confidence: 99%