Structural extraction from visual layout of documents

Rosenfeld, Binyamin; Feldman, Ronen; Aumann, Yonatan

doi:10.1145/584792.584828

Cited by 16 publications

(10 citation statements)

References 4 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…al. [19] are based on templates that characterize each part of the document. These templates are either extracted manually or semi-automatically.…”

Section: Related Workmentioning

confidence: 99%

“…al. [19] devised a learning algorithm to extract information (author, title, date, etc) that relies on a general procedure for structural extraction. Their proposed technique enables the automatic extraction of entities from the document based on their visual characteristics and relative position in the document layout.…”

Section: Related Workmentioning

confidence: 99%

“…Zhuang [25] present a method for cross-media retrieval, in which cross-media features are integrated with multimedia data via a cross-reference graph model so as to improve retrieval accuracy progressively by learning associations between objects present in the model. We note that these two works have improved on the work in [19] by identifying the relations between different media objects . But whereas these approaches need to be trained using supervised structure learning algorithms, our canonical representation of documents is completely unsupervised and generic enough to support a wide range of multimedia data types.…”

Section: Related Workmentioning

confidence: 99%

See 2 more Smart Citations

Web news categorization using a cross-media document graph

Iria

Ciravegna

Magalhães

2009

Proceedings of the ACM International Conference on Image and Video Retrieval

View full text Add to dashboard Cite

In this paper we propose a multimedia categorization framework that is able to exploit information across different parts of a multimedia document (e.g., a Web page, a PDF, a Microsoft Office document). For example, a Web news page is composed by text describing some event (e.g., a car accident) and a picture containing additional information regarding the real extent of the event (e.g., how damaged the car is) or providing evidence corroborating the text part. The framework handles multimedia information by considering not only the document's text and images data but also the layout structure which determines how a given text block is related to a particular image. The novelties and contributions of the proposed framework are: (1) support of heterogeneous types of multimedia documents; (2) a documentgraph representation method; and (3) the computation of crossmedia correlations. Moreover, we applied the framework to the tasks of categorising Web news feed data, and our results show a significant improvement over a single-medium based framework.

show abstract

“…al. [19] are based on templates that characterize each part of the document. These templates are either extracted manually or semi-automatically.…”

Section: Related Workmentioning

confidence: 99%

Section: Related Workmentioning

confidence: 99%

Section: Related Workmentioning

confidence: 99%

See 1 more Smart Citation

Web news categorization using a cross-media document graph

Iria

Ciravegna

Magalhães

2009

Proceedings of the ACM International Conference on Image and Video Retrieval

View full text Add to dashboard Cite

show abstract

“…The document layout and extracted cross-references (e.g., captions) may suggest how each text segment relates to each image, examples include (Arasu and Garcia-Molina 2003;Crescenzi et al 2001;Rosenfeld et al 2002). Arasu and Garcia-Molina (2003), Crescenzi et al (2001) and Rosenfeld et al (2002) approaches are based on (manually or semi-automatically extracted) templates that characterise each part of the document. Rosenfeld et al (2002) implement a learning algorithm to extract information such as the author, title and date.…”

Section: Semi-automated Knowledge Acquisitionmentioning

confidence: 99%

Applying semantic web technologies to knowledge sharing in aerospace engineering

et al. 2008

View full text Add to dashboard Cite

This paper details an integrated methodology to optimise knowledge reuse and sharing, illustrated with a use case in the aeronautics domain. It uses ontologies as a central modelling strategy for the capture of knowledge from legacy documents via automated means, or directly in systems interfacing with knowledge workers, via user-defined, webbased forms. The domain ontologies used for knowledge capture also guide the retrieval of the knowledge extracted from the data using a semantic search system that provides support for multiple modalities during search. This approach has been applied and evaluated successfully within the aero-

show abstract

“…Rosenfeld et al [6] and Zhai et al [9] suggested a structure extraction method for PDF and Web documents using probabilistic approaches, such as the machine-learning and tree-graph-matching algorithms, respectively. These approaches need to prepare a large amount of annotated data, and the models made from the data are dependent on the data.…”

Section: Introductionmentioning

confidence: 99%

Structure Extraction from Presentation Slide Information

Hayama

Nanba

Kunifuji

2008

Lecture Notes in Computer Science

View full text Add to dashboard Cite

Abstract. Electronic presentations are used in numerous scenarios, such as lectures and meetings. In recent years, the widespread use of electronic presentations means that presentation slide data is increasing as one of industry's most important information resources. Therefore, it is necessary to develop a practical usage method for the reutilisation of the data on slides. An approach to achieve this is to focus on visual structure information within a slide, because visual structure information is one of the most valuable, easy to understand methods for humans. However, since visual structure information is not explicitly defined in the slide data itself, computers have difficulty comprehending structure information directly. In this paper, we propose a method of extracting structure information from slide information. The proposed method is composed of two steps: organising objects within the slide as units, such as title, body text, figure and table, and structuring the units as a hierarchy tree based on a top-down approach.

show abstract

Structural extraction from visual layout of documents

Cited by 16 publications

References 4 publications

Web news categorization using a cross-media document graph

Web news categorization using a cross-media document graph

Applying semantic web technologies to knowledge sharing in aerospace engineering

Structure Extraction from Presentation Slide Information

Contact Info

Product

Resources

About