|
RouPar: Routinely and Mixed Query-Driven Approach for Data Partitioning
Ladjel Bellatreche1, Amira Kerkad1, Sebastian Breß2, and Dominique Geniet1
1LIAS/ENSMA, Poitiers University Futuroscope, France
bellatreche@ensma.fr
amira.kerkad@ensma.fr
dgeniet@ensma.fr
2University of Magdeburg D-39016, Germany
bress@iti.cs.uni-magdeburg.de
Abstract. With the big data era and the cloud, several applications are designed around analytical aspects, where the data warehousing technology is in the heart of their construction chain. The interaction between queries in such environments represents a big challenge due to three dimensions: (i) the routinely aspects of queries, (ii) their large number, and (iii) the high operation sharing between queries. In the context of very large databases, these operations are expensive and need to be optimized. The horizontal data partitioning ( ) is a pre-condition for designing extremely large databases in several environments: centralized, distributed, parallel and cloud. It aims to reduce the cost of these operations. In , the optimization space of potential candidates for partitioning grows exponentially with the problem size making the problem NP-hard. In this paper, we propose a new approach based on query interactions to select a partitioning schema of a data warehouse in a divide and conquer manner to achieve an improved trade-off between the optimization algorithm’s speed and the quality of the solution. The effectiveness of our approach is proven through a validation using the Star Schema Benchmark (100 GB) on Oracle11g.
LNCS 8185, p. 309 ff. Full article in PDF | BibTeX
lncs@springer.com
© Springer-Verlag Berlin Heidelberg 2013
|