2019
DOI: 10.48550/arxiv.1904.07933
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Audio-Visual Model Distillation Using Acoustic Images

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2021
2021
2021
2021

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(1 citation statement)
references
References 0 publications
0
1
0
Order By: Relevance
“…These may imply that, by the definition of learnability [12], the task is not a fully learnable problem only with unsupervised data in our setting, which is static-image based single-channel audio source localization, but can be fixed with even a small amount of relevant prior knowledge. Although the sound localization task is not effectively addressed with our unsupervised learning approach with static images and mono audios, other methods that use spatial microphones [25], [53], [54], [55] or temporal information, motion [8] and synchronization [18], with multiple frames have been shown to perform well on this task with unsupervised algorithms.…”
Section: Discussionmentioning
confidence: 99%
“…These may imply that, by the definition of learnability [12], the task is not a fully learnable problem only with unsupervised data in our setting, which is static-image based single-channel audio source localization, but can be fixed with even a small amount of relevant prior knowledge. Although the sound localization task is not effectively addressed with our unsupervised learning approach with static images and mono audios, other methods that use spatial microphones [25], [53], [54], [55] or temporal information, motion [8] and synchronization [18], with multiple frames have been shown to perform well on this task with unsupervised algorithms.…”
Section: Discussionmentioning
confidence: 99%