ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2021
DOI: 10.1109/icassp39728.2021.9414416
|View full text |Cite
|
Sign up to set email alerts
|

Compressing Local Descriptor Models for Mobile Applications

Abstract: Feature-based image matching has been significantly improved through the use of deep learning and new large datasets. However, there has been little work addressing the computational cost, model size, and matching accuracy tradeoffs for the state of the art models. In this paper, we consider these practical aspects and improve the state-of-theart HardNet model through the use of depthwise separable layers and an efficient tensor decomposition. We propose the Convolution-Depthwise-Pointwise (CDP) layer, which p… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
2
1

Relationship

1
2

Authors

Journals

citations
Cited by 3 publications
(2 citation statements)
references
References 30 publications
0
2
0
Order By: Relevance
“…This has led to the development of optimized GPU kernels that bridge the gap between the theoretical FLOP improvements and the practical on-device latency. Both CPdecomposition [17] and Tucker decomposition [45] have also been used to construct or compress pre-trained models [21,23,35]. Another line of work has explored the use of tensor networks as a mathematical framework for generalising tensor decomposition in the context of deep learning [14,46].…”
Section: A Related Workmentioning
confidence: 99%
“…This has led to the development of optimized GPU kernels that bridge the gap between the theoretical FLOP improvements and the practical on-device latency. Both CPdecomposition [17] and Tucker decomposition [45] have also been used to construct or compress pre-trained models [21,23,35]. Another line of work has explored the use of tensor networks as a mathematical framework for generalising tensor decomposition in the context of deep learning [14,46].…”
Section: A Related Workmentioning
confidence: 99%
“…Moreover, we show that knowledge of the object of interest allow ours and existing methods to be built with far fewer parameters without any degradation in performance. The result is a method that can be deployed in resource-limited systems, a line of research that is actively pursued by the scientific community [5,6,7]. To the best of our knowledge, our method is the first that uses an input from a single timestep to directly forecast the key pose the instant the pick or place action is performed, and the time taken to get to the key pose (Figure 1).…”
Section: Introductionmentioning
confidence: 99%