2022
DOI: 10.37965/jait.2022.0145
|View full text |Cite
|
Sign up to set email alerts
|

HSCA-Net: A Hybrid Spatial-Channel Attention Network in Multi-Scale Feature Pyramid for Document Layout Analysis

Abstract: Document images often contain various page components and complex logical structures, which makes document layout analysis task challenging. For most deep learning based document layout analysis methods, convolutional neural networks (CNNs) are adopted as the image feature extraction networks. In this paper, a hybrid spatial-channel attention network (HSCA-Net) is proposed to improve feature extraction capability by exerting attention mechanism to explore more salient properties within document pages. The HSCA… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
1
1

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(1 citation statement)
references
References 31 publications
0
1
0
Order By: Relevance
“…Thereby, some end-to-end algorithms [17][18][19][20][21] were proposed to convert inputs of text line images into outputs of label sequences without the pre-extraction features. The convolutional recurrent neural network (CRNN) [19] succeeded in English text sequence labeling.…”
Section: Introductionmentioning
confidence: 99%
“…Thereby, some end-to-end algorithms [17][18][19][20][21] were proposed to convert inputs of text line images into outputs of label sequences without the pre-extraction features. The convolutional recurrent neural network (CRNN) [19] succeeded in English text sequence labeling.…”
Section: Introductionmentioning
confidence: 99%