ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2019
DOI: 10.1109/icassp.2019.8683142
|View full text |Cite
|
Sign up to set email alerts
|

Seeing through Sounds: Predicting Visual Semantic Segmentation Results from Multichannel Audio Signals

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
19
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
4
2

Relationship

0
6

Authors

Journals

citations
Cited by 18 publications
(20 citation statements)
references
References 11 publications
0
19
0
Order By: Relevance
“…Furthermore, Binaural sounds are different from general stereo sounds. Another similar work to ours is done by Irie et al [52]. While their goal is also to predict semantic segmentation results from multichannel audio signals, our method differs significantly: 1) our method considers multiple semantic classes, 2) our method works in unconstrained real outdoor environment instead of a controlled lab setting, and 3) our method addresses multiple tasks.…”
Section: Related Workmentioning
confidence: 98%
See 1 more Smart Citation
“…Furthermore, Binaural sounds are different from general stereo sounds. Another similar work to ours is done by Irie et al [52]. While their goal is also to predict semantic segmentation results from multichannel audio signals, our method differs significantly: 1) our method considers multiple semantic classes, 2) our method works in unconstrained real outdoor environment instead of a controlled lab setting, and 3) our method addresses multiple tasks.…”
Section: Related Workmentioning
confidence: 98%
“…Therefore, human class is not considered in this work. We would like to point out that due to these challenges, many of the recent works [6], [52] only deals with one class in real world environment. Another potential future direction is to investigate the usefulness of Harmonic Convolutional Networks (HCN) [82] for our auditory tasks.…”
Section: Limitations and Future Workmentioning
confidence: 99%
“…This is because some classes are very rare in the middle-sized dataset, which already takes a great deal of effort to create. In comparison, the recent works [19,28] only deals with one class in real world environment. To our best knowledge, we are the first to work with multiple classes in an unconstrained real environment.…”
Section: Limitations and Future Workmentioning
confidence: 99%
“…This is because some classes are very rare in the middle-sized dataset. Due to this challenge, the recent works [19,29] only deals with one class in real world environment.…”
Section: Limitations and Future Workmentioning
confidence: 99%
See 1 more Smart Citation