2015
DOI: 10.1016/j.is.2015.01.011
|View full text |Cite
|
Sign up to set email alerts
|

Similarity sets: A new concept of sets to seamlessly handle similarity in database management systems

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
6
0
3

Year Published

2017
2017
2021
2021

Publication Types

Select...
3
2
2

Relationship

0
7

Authors

Journals

citations
Cited by 8 publications
(9 citation statements)
references
References 31 publications
0
6
0
3
Order By: Relevance
“…Many researchers have been proposing strategies to support similarity comparison in Relational Database Management Systems -RDBMS [Silva et al 2015, Pola et al 2015, Budíková et al 2012, Belohlavek and Vychodil 2010, commonly by extending Relational Operators. The vast majority of them focuses on the Selection [Silva et al 2013] in which similarity awareness is achieved by means of range queries, nearest neighbors queries, and their many variants.…”
Section: Basic Concepts and Related Workmentioning
confidence: 99%
“…Many researchers have been proposing strategies to support similarity comparison in Relational Database Management Systems -RDBMS [Silva et al 2015, Pola et al 2015, Budíková et al 2012, Belohlavek and Vychodil 2010, commonly by extending Relational Operators. The vast majority of them focuses on the Selection [Silva et al 2013] in which similarity awareness is achieved by means of range queries, nearest neighbors queries, and their many variants.…”
Section: Basic Concepts and Related Workmentioning
confidence: 99%
“…For example, range and nearest queries are unary operators that retrieve similar elements based on a list of parameters, such as the query center, radius or the number of nearest neighbors to find. Another class of unary operator can be employed to data set extraction, such as the SimSet [1] extraction technique that filters a data set by eliminating near-duplicates based on a given radius threshold.…”
Section: Introductionmentioning
confidence: 99%
“…Today, many real data sets include, besides the traditional numeric values and small texts, more complex data objects such as images, audio files, videos, time series, genetic data elements, large graphs, long texts, fingerprints, and many others (POLA et al, 2013;ZEZULA et al, 2006;SILVA et al, 2010). One central distinction between traditional and complex data is that the latter must be compared by similarity, since comparisons by identity (=) are in most cases senseless and/or unfeasible for data of a more complex nature (MARRI et al, 2014;MARRI et al, 2016;JACOX;SAMET, 2008;KALASHNIKOV, 2013;POLA et al, 2015;SILVA et al, 2013;SILVA et al, 2010;TANG et al, 2016a). To illustrate this fact, let us consider again the division query about cities and crops, but now using a more realistic variation of our toy dataset in which we do not have cities carefully partitioned into regions and neither textual tags ready to be used to describe each region.…”
Section: Problem and Motivationmentioning
confidence: 99%
“…Many researchers have been proposing strategies to support similarity comparison in Relational Database Management Systems -RDBMS (SILVA et al, 2010;POLA et al, 2013;BUDÍKOVÁ;ZEZULA, 2012;BARIONI et al, 2009;BELOHLAVEK;VYCHODIL, 2010), commonly by means of extending operators of the Relational Algebra. For example, recent works focus on the Join (SILVA et al, 2015; KALASHNIKOV, 2013; SILVA; PEARSON, 2012; SILVA; AREF; ALI, 2010), Selection (SILVA et al, 2013;SANTOS et al, 2013), Grouping and Aggregation (TANG et al, 2016a;TANG et al, 2016b;ALI, 2009), Union (POLA et al, 2015;MARRI et al, 2016), Intersection (POLA et al, 2015MARRI et al, 2014;MARRI et al, 2016) and Difference (POLA et al, 2015;MARRI et al, 2016). However, to the best of our knowledge, no one focuses on the Division.…”
Section: Problem and Motivationmentioning
confidence: 99%
See 1 more Smart Citation