Proceedings 18th International Conference on Data Engineering
DOI: 10.1109/icde.2002.994725
|View full text |Cite
|
Sign up to set email alerts
|

Attribute classification using feature analysis

Abstract: The basis of many systems that integrate data from multiple sources is a set of correspondences between source schemata and a target schema. Correspondences express a relationship between sets of source attributes, possibly from multiple sources, and a set of target attributes. Clio is an integration tool that assists users in dening value correspondences between attributes [1].In real life scenarios there may be many sources and the source relations may have many attributes. Users can get lost and might miss … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
14
0

Publication Types

Select...
5
3
1

Relationship

0
9

Authors

Journals

citations
Cited by 22 publications
(14 citation statements)
references
References 1 publication
0
14
0
Order By: Relevance
“…In their scheme, the data is effectively treated as categorical. Other related works in this area include the work of He, Chang and Han [11] on schema matching for the deep web and work on using distributional signatures for value mapping by Kang et al [12] and Naumann et al [13]. For related work in the Al community, we refer the reader to the survey by Doan and Halevy [2].…”
Section: G Our Contributionsmentioning
confidence: 99%
“…In their scheme, the data is effectively treated as categorical. Other related works in this area include the work of He, Chang and Han [11] on schema matching for the deep web and work on using distributional signatures for value mapping by Kang et al [12] and Naumann et al [13]. For related work in the Al community, we refer the reader to the survey by Doan and Halevy [2].…”
Section: G Our Contributionsmentioning
confidence: 99%
“…Previous generic tools include Cupid, SimilarityFlooding, and Clio [34][35][36]. As discussed, COMA++ is also generic and supports several schema languages, including XSD, OWL, and relational schemas, and it is able to deal with complex distributed XML schemas.…”
Section: Previous Solutions Vs Coma++mentioning
confidence: 99%
“…Many schema matching systems perform data profiling to create attribute features, such as data type, average value length, and patterns, to compare feature vectors and align those attributes with the best matching ones [98,109].…”
Section: Use-cases For Data Profilingmentioning
confidence: 99%