2019
DOI: 10.11591/ijece.v9i6.pp5463-5470
|View full text |Cite
|
Sign up to set email alerts
|

A performance of comparative study for semi-structured web data extraction model

Abstract: <span lang="EN-US">The extraction of information from multi-sources of web is an essential yet complicated step for data analysis in multiple domains. In this paper, we present a data extraction model based on visual segmentation, DOM tree and JSON approach which is known as Wrapper Extraction of Image using DOM and JSON (WEIDJ) for extracting semi-structured data from biodiversity web. The large number of information from multiple sources of web which is image’s information will be extracted using three… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2020
2020
2022
2022

Publication Types

Select...
3

Relationship

2
1

Authors

Journals

citations
Cited by 3 publications
(3 citation statements)
references
References 16 publications
0
3
0
Order By: Relevance
“…This method is useful in handling the structure of data, whether it is structured, semi-structured or unstructured. The second part is related to the knowledge based 1 shows general models for three web data extraction models; DOM [23], WHDJ [24] and WEIDJ [25]. In addition to the basic capabilities of WEIDJ, our extractor also provides several other useful and user's friendly features.…”
Section: Methodsmentioning
confidence: 99%
“…This method is useful in handling the structure of data, whether it is structured, semi-structured or unstructured. The second part is related to the knowledge based 1 shows general models for three web data extraction models; DOM [23], WHDJ [24] and WEIDJ [25]. In addition to the basic capabilities of WEIDJ, our extractor also provides several other useful and user's friendly features.…”
Section: Methodsmentioning
confidence: 99%
“…To support data-sharing to people (user), metadata can help designing powerful features for information search, such as query by title and query by author and so forth [33]- [36]. Then, to enable data-sharing among systems, metadata elements with similar values are connected to provide complete information about biodiversity [37]. The workflow of proposed data-sharing is shown in Figure 6 below.…”
Section: A Data Managementmentioning
confidence: 99%
“…The Document Object Model (DOM) is a programming API for HTML and XML documents. People can create and build documents using DOM [15]. Besides that this model can be used to manipulate elements and contents of HTML and XML documents such as add, modify or delete [13,16].…”
Section: A Dom Treementioning
confidence: 99%