2015
DOI: 10.2498/cit.1002509
|View full text |Cite
|
Sign up to set email alerts
|

Crowdsourcing for Query Processing onWeb Data: A Case Study on the Skyline Operator

Abstract: In recent years, crowdsourcing has become a powerful tool to bring human intelligence into information processing. This is especially important for Web data which in contrast to well-maintained databases is almost always incomplete and may be distributed over a variety of sources. Crowdsourcing allows to tackle many problems which are not yet attainable using machine-based algorithms alone: in particular, it allows to perform database operators on incomplete data as human workers can be used to provide values … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
16
0

Year Published

2016
2016
2023
2023

Publication Types

Select...
4
1

Relationship

0
5

Authors

Journals

citations
Cited by 7 publications
(16 citation statements)
references
References 19 publications
0
16
0
Order By: Relevance
“…The crowd-sourcing strategy is based on incorporating human workers to attain improved results, while the advanced heuristic offers an alternative offline solution for times when crowd-sourcing may not be a feasible option, for example when the missing data are not easily available for the crowd or the costs of crowd-sourcing are prohibitive. We conclude that the approach introduced in [4] has failed to generate an accurate value when the missing rate is very high. Besides, the approach incurred high time latency and monetary cost when estimating the missing values from the crowd.…”
Section: Introductionmentioning
confidence: 91%
See 3 more Smart Citations
“…The crowd-sourcing strategy is based on incorporating human workers to attain improved results, while the advanced heuristic offers an alternative offline solution for times when crowd-sourcing may not be a feasible option, for example when the missing data are not easily available for the crowd or the costs of crowd-sourcing are prohibitive. We conclude that the approach introduced in [4] has failed to generate an accurate value when the missing rate is very high. Besides, the approach incurred high time latency and monetary cost when estimating the missing values from the crowd.…”
Section: Introductionmentioning
confidence: 91%
“…However, the correctness of the skyline might be deteriorated when relying on heuristics rules to identify the skylines. It may be that certain dominated tuples are included in the skyline results (false positive) and/or certain tuples which should be included in the skyline results are omitted (false negative) [4]. Furthermore, it is most likely that the relative error between the actual and the estimated values become very high if heuristics rules cannot capture the semantic relationship between the attributes; also if the missing rate in the database is high as it impacts the quality of the estimated values.…”
Section: B Skyline Queries On Crowd-sourced-enabled Incomplete Databmentioning
confidence: 99%
See 2 more Smart Citations
“…For Pareto models, PDP and PCP are not mutually expressive. While Pareto orders are widely studied in fields like voting theory [10] (unanimity), allocation problems [1] (Pareto optimality), decision making, database queries [2,7] (skyline operator) and economics (Pareto efficiency), there exists no general study of PDP or PCP based on Pareto orders so far. Pareto orders give a natural way of comparing alternatives; one alternative is better than another if it is better on all relevant evaluation functions (different criteria by which the alternatives can be evaluated).…”
Section: Introductionmentioning
confidence: 99%