Proceedings of the Fifth ACM International Workshop on Web Information and Data Management - WIDM '03 2003
DOI: 10.1145/956718.956719
|View full text |Cite
|
Sign up to set email alerts
|

Finding similar identities among objects from multiple web sources

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
9
0

Year Published

2004
2004
2015
2015

Publication Types

Select...
3
2
1

Relationship

0
6

Authors

Journals

citations
Cited by 7 publications
(9 citation statements)
references
References 0 publications
0
9
0
Order By: Relevance
“…And also check for its typographical errors when comparing two XML elements. The concern with accuracy was later approached by Carvalho and da Silva, in [12]. Although not specifically focused on XML, their work proposes a solution to the problem of integrating treestructured data extracted from the web.…”
Section: IIImentioning
confidence: 99%
“…And also check for its typographical errors when comparing two XML elements. The concern with accuracy was later approached by Carvalho and da Silva, in [12]. Although not specifically focused on XML, their work proposes a solution to the problem of integrating treestructured data extracted from the web.…”
Section: IIImentioning
confidence: 99%
“…These differences arise because data is affected by many factors, such as external noise, human subjectivity, and un-calibrated measuring tools. While some systems attempt to match features by introspecting their properties [18], we avoid exhaustive attribute comparisons as they tend to increase computational complexity when many geographic features are present. To establish an efficient and effective representation of semantic relationships, we define semantic footprints and their components in the subsections below.…”
Section: Concept Of Semantic Footprintsmentioning
confidence: 99%
“…They also focus on efficiency and effectiveness of their filtering techniques, but not on the effectiveness of the actual duplicate detection. The only approach we are aware of that considers recall and precision of their XML similarity joins is [4]. They present four different strategies to define the similarity function using the vector space model.…”
Section: Object Identification For Hierarchical and Xml Datamentioning
confidence: 99%