2023
DOI: 10.1109/tpami.2022.3162528
|View full text |Cite
|
Sign up to set email alerts
|

SegBlocks: Block-Based Dynamic Resolution Networks for Real-Time Segmentation

Abstract: SegBlocks reduces the computational cost of existing neural networks, by dynamically adjusting the processing resolution of image regions based on their complexity. Our method splits an image into blocks and downsamples blocks of low complexity, reducing the number of operations and memory consumption. A lightweight policy network, selecting the complex regions, is trained using reinforcement learning. In addition, we introduce several modules implemented in CUDA to process images in blocks. Most important, ou… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
3
3

Relationship

0
6

Authors

Journals

citations
Cited by 9 publications
(1 citation statement)
references
References 65 publications
0
1
0
Order By: Relevance
“…As a result, structured pruning frameworks are the preferred option when aiming to accelerate inference on general purpose hardware. A body of work across structured and unstructured pruning methods, attempts to induce structure in otherwise randomly sparse networks S. Gray & Kingma (2017); Ren et al (2018); Wen et al (2020); Verelst & Tuytelaars (2020). This is often referred to as block sparsity and consists in subdividing the matrix representations of inputs or weights into tiles (e.g.…”
Section: Related Workmentioning
confidence: 99%
“…As a result, structured pruning frameworks are the preferred option when aiming to accelerate inference on general purpose hardware. A body of work across structured and unstructured pruning methods, attempts to induce structure in otherwise randomly sparse networks S. Gray & Kingma (2017); Ren et al (2018); Wen et al (2020); Verelst & Tuytelaars (2020). This is often referred to as block sparsity and consists in subdividing the matrix representations of inputs or weights into tiles (e.g.…”
Section: Related Workmentioning
confidence: 99%