2022
DOI: 10.1167/jov.22.14.3098
|View full text |Cite
|
Sign up to set email alerts
|

How do early blind individuals experience auditory motion?

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
12
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
4
3
1

Relationship

0
8

Authors

Journals

citations
Cited by 9 publications
(12 citation statements)
references
References 0 publications
0
12
0
Order By: Relevance
“…Finally, as described in [22], the previous processing is considered as a type of low-pass filter that gradually restores input image's information at lowfrequency signals through the self-attention mechanism at different spatial dimensions. Since it tends to reduce high-frequency signals, a 7×7 convolutional layer is used as a high-pass filter to complement the high-frequency signals.…”
Section: Network Architecturementioning
confidence: 99%
“…Finally, as described in [22], the previous processing is considered as a type of low-pass filter that gradually restores input image's information at lowfrequency signals through the self-attention mechanism at different spatial dimensions. Since it tends to reduce high-frequency signals, a 7×7 convolutional layer is used as a high-pass filter to complement the high-frequency signals.…”
Section: Network Architecturementioning
confidence: 99%
“…We notice that the vertical hybrid layout design leads to unsatisfactory performance in detail recovery. A possible reason is that MSAs at the end of each stage act as spatial smoothing and aggregation [20], thus neglecting details unavoidably. To this end, we propose the I-PLDE module, a parallel branch emphasizing local detail on the top of the vertical hybrid design, inspired by the "divide-and-conquer" idea in [30,32].I-PLDE consists of a 1x1 convolution to match hidden dimension with its parallel branch, three stacked depth-wise convolution layers and an window embedding operation.…”
Section: Dudornext: Towards Hybridizing Cnns and Vitsmentioning
confidence: 99%
“…Finally, DuDoRNet generates (sometimes wrong) details of higher frequency, which are seen especially in soft tissues; while Dual-SwinIR generates a smoother image. In recent works in decomposing Transformer from the basic theory [20] to empirical network design [18,28], a potential direction for modernizing deep learning models arises: hybridizing CNNs and ViTs. While several work [30,32,20] in computer vision show the effectiveness of hybrid structures, there is no research systematically studying a hybrid model for MRI reconstruction.…”
Section: Introductionmentioning
confidence: 99%
“…23 The visual transformer backbone was chosen over a convolutional backbone since convolutional models are considered more vulnerable to high-frequency noise. 24 This is motivated by earlier empirical findings and the concept that convolutional filters are mainly sensitive to textures (high-pass filter), while the multi-head self-attentions in visual transformers capture distant semantic relevances in an image effectively (low-pass filter). We hypothesize that the gap between our real and synthetic data is predominantly found in the textures of objects and less in their shape.…”
Section: Modelmentioning
confidence: 99%