Blink and it's done

Agarwal, Sameer; Iyer, Anand; Panda, Aurojit; Madden, Samuel; Mozafari, Barzan; Stoica, Ion

doi:10.14778/2367502.2367533

Cited by 70 publications

(4 citation statements)

References 4 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Beyond these technique-specific solutions, another way to remedy the effects of tailored and, therefore, potentially skewed samples is to combine the chunks of tailorable sampling with a "baseline sampling" that remains constant throughout the analysis. A similar idea is used in BlinkDB [1], where both a uniform sample and a set of stratified samples are maintained. Here, combining multiple samples provides "tighter approximation errors" and "significantly reduces [...] the subset error".…”

Section: Impact After Samplingmentioning

confidence: 99%

“…Depending on what task an analyst wants to perform on that data, there are different ways for how to make the sampling of that data most useful: For example, to gain an initial overview of the data, it makes sense to draw a uniform sample that helps depict the distribution of all three attributes. In the sample depicted in subfigure (1), the densely populated region in the center of the plot stands out. On the other hand, to analyze the local distribution of the Boolean attribute, it is more useful to sample the data along a regular grid, such that the density in each grid cell is even throughout the sample, which puts the focus on the Boolean attribute.…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Tailorable Sampling for Progressive Visual Analytics

Hogräfer

Schulz

2024

IEEE Trans. Visual. Comput. Graphics

View full text Add to dashboard Cite

Fig. 1: Enabling tailorable PVA-sampling using a pipeline that structures the sampling process into three steps: linearization, subdivision, and selection. The steps are depicted here along the data they operate on: The linearization takes in the input data structure and transforms it into linear structure, which is then subdivided into bins in the subdivision step. The last step then produces the chunks forwarded into the PVA process by progressively selecting appropriate items from each bin. Different linearization, subdivision, and selection strategies can be combined to progressively sample a given dataset in various ways.

show abstract

Section: Impact After Samplingmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

Tailorable Sampling for Progressive Visual Analytics

Hogräfer

Schulz

2024

IEEE Trans. Visual. Comput. Graphics

View full text Add to dashboard Cite

show abstract

“…Prior the process of forwarding the query to the list of particular resources, the system implemented a semantic optimization to minimize the overall execution cost (Arens et al, 1994). Query processing algorithm must provide real-time answers along with fault-tolerant assumption, and could scale to petabytes of data and thousands of resources in a fault-tolerant manner (Agarwal et al, 2012). However, we assume that not all queries are associated with an explicit evidence of user's feedback and often queries are semantically associated with similar implicit feedback.…”

Section: Query Processingmentioning

confidence: 99%

A New Adaptive Indexing for Real-Time Web Search

Al-akashi

Inkpen

2022

International Journal of Intelligent Information Technologies

View full text Add to dashboard Cite

Adaptive indexing is an alternative to the self-tuning methods. It is especially useful in the scenario of unpredictable workload, and there is no idle time to invest in index creation. The authors present their ongoing work on a new realistic adaptive indexing that transforms the previous data crawling offline approach to a data-driven online approach. The proposed approach consists of three tasks: topic prediction, resource selection, and results combination and ranking. They work simultaneously to retrieve highly relevant results to the user's query in real time. To make the index highly refreshed and up-to-date, they collected data from highly prominent resources (e.g., Facebook, Twitter, Wikipedia, etc.). The empirical results showed that the proposed model is better than the traditional models that work offline and spend hours or days for building the index in different periods. In addition, the experiments showed that the training results are highly relevant for adhoc and diversity tasks.

show abstract

“…The technique of caching intermediate results is one of the widely used query optimization technique [22], extended by Safaeei A et al [23] based on multiple sliding windows to improve execution of overlapping queries with common subexpressions. Laptev et al [24] presented EARL system and Agarwal et al [25] proposed the BlinkDB, those iteratively works for collecting larger samples to reach at the desired accuracy. The Shark, presented in [26] caches inter query data with the help of shared memory concept.…”

Section: Related Workmentioning

confidence: 99%

An Efficient Query Optimizer with Materialized Intermediate Views in Distributed and Cloud Environment

Bachhav

Kharat

Shelar³

2021

Teh. glas. (Online)

View full text Add to dashboard Cite

In cloud computing environment hardware resources required for the execution of query using distributed relational database system are scaled up or scaled down according to the query workload performance. Complex queries require large scale of resources in order to complete their execution efficiently. The large scale of resource requirements can be reduced by minimizing query execution time that maximizes resource utilization and decreases payment overhead of customers. Complex queries or batch queries contain some common subexpressions. If these common subexpressions evaluated once and their results are cached, they can be used for execution of further queries. In this research, we have come up with an algorithm for query optimization, which aims at storing intermediate results of the queries and use these by-products for execution of future queries. Extensive experiments have been carried out with the help of simulation model to test the algorithm efficiency

show abstract

Blink and it's done

Cited by 70 publications

References 4 publications

Tailorable Sampling for Progressive Visual Analytics

Tailorable Sampling for Progressive Visual Analytics

A New Adaptive Indexing for Real-Time Web Search

An Efficient Query Optimizer with Materialized Intermediate Views in Distributed and Cloud Environment

Contact Info

Product

Resources

About