Proceedings of the 7th International Conference on Ubiquitous Information Management and Communication 2013
DOI: 10.1145/2448556.2448616
|View full text |Cite
|
Sign up to set email alerts
|

Document page retrieval based on geometric layout features

Abstract: Today, the keyword retrieval method is most standard and popular, and has been widely used in many applications. However, even the keyword retrieval method cannot always satisfy various types of information search subjects, because various kinds of information resources such as image data, graphics data, etc. must be managed in multi-media society, in addition to the worddependent information. Of course, the methods which are more or less applicable to the characteristics of data resources such as structure, d… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
5
0

Year Published

2015
2015
2023
2023

Publication Types

Select...
1
1

Relationship

1
1

Authors

Journals

citations
Cited by 2 publications
(5 citation statements)
references
References 15 publications
0
5
0
Order By: Relevance
“…This can affect the results because the implementation of the library used deviates from the environment of web browsers used by users. The importance of layout information of websites in information retrieval can be seen in papers [32]- [34]. The paper [32] uses the properties of div and table DOM elements.…”
Section: Information Retrievalmentioning
confidence: 99%
See 1 more Smart Citation
“…This can affect the results because the implementation of the library used deviates from the environment of web browsers used by users. The importance of layout information of websites in information retrieval can be seen in papers [32]- [34]. The paper [32] uses the properties of div and table DOM elements.…”
Section: Information Retrievalmentioning
confidence: 99%
“…The main conclusions in the paper are that people have consistent decisions about which blocks are essential and that, in addition to spatial features, better results can be obtained by integrating the content properties of the pages. Finally, the paper [34] presents a prototype tool that uses information about the layout of elements on web pages in the form of images. The disadvantage of the last paper is that the tool has not been adequately tested on actual data.…”
Section: Information Retrievalmentioning
confidence: 99%
“…Of course, the search method that we should investigate here does not replace the ordinary keyword search method, but is desirable to be complimentarily used by some persons who cannot operate the keyword search method effectually or is useful when some persons cannot make use of keyword search successfully in case that they forgot suddenly the appropriate technical-terms or they are unfamiliar to the keywords. Our first idea is to focus on the geometric layout structure in 2-dimensional document page [22,23]. Then, our second idea is to use a relevant/irrelevant feedback control mechanism with a view to selecting a more suitable page from a set of ones which were retrieved in the first step [11,20].…”
Section: Figure 1: Example Of Pages In Japanese Bookmentioning
confidence: 99%
“…Namely, we consider how to attain our objective through the above 4 technical issues successively: indexing focused on position; similarity based on physical measurement; automatic extraction of technicalterms/keywords; and relevant/irrelevant feedback control. These issues have already been investigated individually as our research projects: [22] in 1), [13,22] in 2), [15] in 3), and [11,20] in 4) are ours.…”
Section: Frameworkmentioning
confidence: 99%
See 1 more Smart Citation