2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2021
DOI: 10.1109/cvpr46437.2021.00817
|View full text |Cite
|
Sign up to set email alerts
|

Beyond Image to Depth: Improving Depth Prediction using Echoes

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
17
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
5
3

Relationship

1
7

Authors

Journals

citations
Cited by 33 publications
(18 citation statements)
references
References 32 publications
1
17
0
Order By: Relevance
“…Further, recently Liu et al [46] showed that, the local aggregation operators used in various point cloud processing techniques, if carefully tuned, provide similar performances. Parida et al [47] showed that using region/point based properties from echoes or type of material can help in learning more robust representations. Therefore, the proposed modifications, with explicit prior on geometry, can be extended to other point cloud based deep networks and potentially motivate future works with simpler and more efficient networks for processing point clouds.…”
Section: Discussionmentioning
confidence: 99%
“…Further, recently Liu et al [46] showed that, the local aggregation operators used in various point cloud processing techniques, if carefully tuned, provide similar performances. Parida et al [47] showed that using region/point based properties from echoes or type of material can help in learning more robust representations. Therefore, the proposed modifications, with explicit prior on geometry, can be extended to other point cloud based deep networks and potentially motivate future works with simpler and more efficient networks for processing point clouds.…”
Section: Discussionmentioning
confidence: 99%
“…On the same line of thought researchers has unified VAE with variant of transformers for various other applications such as story generation [28], response generation [55], sentiment analysis [29], and 3D human pose generation [56]. Recently, audio and visual modalities have been used jointly to improve various tasks such as zero-shot learning [57], depth estimation [58] etc.…”
Section: Previous Workmentioning
confidence: 99%
“…Audio-Visual Learning: Recent research bridges the audio and vision for various cross-model learning tasks. Some have achieved remarkable performance in audio-visual action recognition [52,40,58], audio-visual correspondence [8,6,7], audio-visual synchronization [68,55,102], visual sound separation [31,93,92,99,39,100,79,101], visual to auditory [98,36,96,71,86], audio spatialisation [23,88,67,96,38,71,86,69], and audio-visual navigation [16,17,18,37,27,15,63]. In this work, we leverage audio-visual learning for better perceiving the geometrical structure of environment.…”
Section: Related Workmentioning
confidence: 99%
“…Several animal species, such as bats, dolphins, and some nocturnal birds, perceive spatial layout and locate objects through echolocation [73,38,69]. By using two ears to receive spatial sound, one can determine the objects' location by the Interaural Time Difference (ITD) and Interaural Level Difference (ILD).…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation