Workload-Driven Adaptive Data Partitioning and Distribution – The Cumulus Approach

Fetai, Ilir and Murezzan, Damian and Schuldt, Heiko. (2015) Workload-Driven Adaptive Data Partitioning and Distribution – The Cumulus Approach. In: IEEE International Conference on Big Data : proceedings. pp. 1688-1697.

Full text not available from this repository.

Official URL: http://edoc.unibas.ch/40143/

Downloads: Statistics Overview


Cloud environments usually feature several geographically distributed data centers. In order to increase the scalability of applications, many Cloud providers partition data and distribute these partitions across data centers to balance the load. However, if the partitions are not carefully chosen, it might lead to distributed transactions. This is particularly expensive when applications require strong consistency guarantees. The additional synchronization needed for atomic commitment would strongly impact transaction throughput and could even completely undo the gain that can be achieved by load balancing. Hence, it is beneficial to avoid distributed transactions as much as possible by partitioning the data in such a way that transactions can be executed locally. As access patterns of characteristic transaction workloads may change over time, the partitioning also needs to be dynamically updated. In this paper we introduce Cumulus, an adaptive data partitioning approach which is able to identify characteristic access patterns of transaction mixes, to determine data partitions based on these patterns, and to dynamically re-partition data if the access patterns change. In the evaluation based on the TPC-C benchmark, we show that Cumulus significantly increases the overall system performance in an OLTP setting compared to static data partitioning approaches. Moreover, we show that Cumulus is able to adapt to workload shifts at runtime by generating partitions that match the actual workload and to re-configure the system on the fly.
Faculties and Departments:05 Faculty of Science > Departement Mathematik und Informatik > Informatik > Databases and Information Systems (Schuldt)
UniBasel Contributors:Schuldt, Heiko and Fetai, Ilir and Murezzan, Damian
Item Type:Conference or Workshop Item, refereed
Conference or workshop item Subtype:Conference Paper
Note:Publication type according to Uni Basel Research Database: Conference paper
Related URLs:
Last Modified:22 Mar 2018 11:52
Deposited On:22 Mar 2018 11:52

Repository Staff Only: item control page