2020
DOI: 10.1016/j.dsp.2019.102591
|View full text |Cite
|
Sign up to set email alerts
|

DeepBinaryMask: Learning a binary mask for video compressive sensing

Abstract: In this paper, we propose a novel encoder-decoder neural network model referred to as DeepBinaryMask for video compressive sensing. In video compressive sensing one frame is acquired using a set of coded masks (sensing matrix) from which a number of video frames is reconstructed, equal to the number of coded masks. The proposed framework is an end-to-end model where the sensing matrix is trained along with the video reconstruction. The encoder learns the binary elements of the sensing matrix and the decoder is… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

2
38
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
7
2

Relationship

0
9

Authors

Journals

citations
Cited by 63 publications
(46 citation statements)
references
References 34 publications
(61 reference statements)
2
38
0
Order By: Relevance
“…Our approach is motivated by recent advances in hardware, autodifferentiation tools, and optimization algorithms for deep learning. While some recent work investigates joint optimization of either binary masks or color filter arrays with neural network post-processing for video compressed sensing or demosaicking [Chakrabarti 2016;Iliadis et al 2016], they do not consider the optimization of phase-modulating optical elements such as lenses and do not consider diffraction in their forward model. With this work, we thus take first steps towards utilizing the full potential of end-to-end optimization for computational camera design.…”
Section: Related Workmentioning
confidence: 99%
“…Our approach is motivated by recent advances in hardware, autodifferentiation tools, and optimization algorithms for deep learning. While some recent work investigates joint optimization of either binary masks or color filter arrays with neural network post-processing for video compressed sensing or demosaicking [Chakrabarti 2016;Iliadis et al 2016], they do not consider the optimization of phase-modulating optical elements such as lenses and do not consider diffraction in their forward model. With this work, we thus take first steps towards utilizing the full potential of end-to-end optimization for computational camera design.…”
Section: Related Workmentioning
confidence: 99%
“…For compressive video recovery, Iliadis etal. [23], [24] propose Deep Fully-Connected Network, where the encoder learns binary sensing mask and the decoder determines the reconstruction of the video. These approaches effectively avoid the expensive computation in traditional approaches and have achieved promising image/video reconstruction performance.…”
Section: Related Workmentioning
confidence: 99%
“…It permits the removal of small regions that are disjoint forms the larger objects without distorting the small features within the large objects. Neural networks have been proposed and a variety of works in binary reconstruction of grayscale images [45,46]. This technique is mainly used in X-ray images and MRI images and helps in the detection of cancer.…”
Section: Binary Reconstruction Of Gray Scale and Nonlinear Processingmentioning
confidence: 99%