Static Visual Spatial Priors for DoA Estimation

Swietojanski, Paweł; Mikšík, Ondřej

doi:10.1109/icassp40776.2020.9053825

Search citation statements

Order By: Relevance

Paper Sections

Select...

Software1

Semantic Scene Understanding1

Citation Types

Supporting

Mentioning

Contrasting

Year Published

2020

Publication Types

Select...

Other1

Relationship

Self Cite0

Independent1

Authors

Journals

Cited by 1 publication

(2 citation statements)

References 43 publications

(49 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…5 B). When presence of a user is detected and the device has some update ready, or a voicetrigger is spotted, the device wakes up and faces the user [47], starting to process at the same time the audio-visual data. This, depending on the compute requirements can happen either on device or in the cloud (Fig.…”

Section: Softwarementioning

confidence: 99%

“…Additionally, we estimate direction of arrival (DOA) θ s for each of the detected sounds ′ s ′ using a set of DOA estimates from the raw signal (as many as detected acoustic events at each given time step), which are then mapped to x, y, z coordinates 3 . This process can leverage an additional semantic information from vision stream, as shown in [47]. The most likely pairs {acoustic_event, θ s } for co-occurring events are estimated in the spatial model using visual data.…”

Section: Semantic Scene Understandingmentioning

confidence: 99%

See 1 more Smart Citation