In today's world, business transactional data has become the critical part of all business-related decisions. For this purpose, complex analytical queries have been run on transactional data to get the relevant information, from therein, for decision making. These complex queries consume a lot of time to execute as data is spread across multiple disparate locations. Materializing views in the data warehouse can be used to speed up processing of these complex analytical queries. Materializing all possible views is infeasible due to storage space constraint and view maintenance cost. Hence, a subset of relevant views needs to be selected for materialization that reduces the response time of analytical queries. Optimal selection of subset of views is shown to be an NP-Complete problem. In this article, a non-Pareto based genetic algorithm, is proposed, that selects Top-K views for materialization from a multidimensional lattice. An experiments-based comparison of the proposed algorithm with the most fundamental view selection algorithm, HRUA, shows that the former performs comparatively better than the latter. Thus, materializing views selected by using the proposed algorithm would improve the query response time of analytical queries and thereby facilitate in decision making.
A data warehouse system uses materialized views extensively in order to speedily tackle analytical queries. Considering that all possible views cannot be materialized due to maintenance cost and storage constraints, the selection of an appropriate set of views to materialize that achieve an optimal trade-off among query response time, maintenance cost, and the storage constraint becomes an essential necessity. The selection of such an appropriate set of views for materialization is referred to as the materialized views selection problem, which is an NP-Complete problem. In the last two decades, several new selection approaches, based on heuristics, have been proposed. Most of these have used a single objective or weighted sum approach to address the various constraints. In this article, an attempt has been made to address the bi-objective materialized view selection problem, where the objective is to minimize the view evaluation cost of materialized views and the view evaluation cost of the non-materialized views, using the Improved Strength Pareto Evolutionary Algorithm. The experimental results show that the proposed multi-objective view selection algorithm is able to select the Top-K views that achieves a reasonable trade-off between the two objectives. Materializing these selected views would reduce the query response times for analytical queries and thereby facilitates the decision-making process.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.