You are here

Cache Oblivious Scheduling of Shared Workloads

TitleCache Oblivious Scheduling of Shared Workloads
Publication TypeConference Paper
Year of Publication2015
AuthorsBär, A., L. Golab, S. Ruehrup, M. Schiavone, and P. Casas
Conference Name31st IEEE International Conference on Data Engineering (ICDE)
Date Published05/2015
PublisherIEEE
Conference LocationSeoul, Korea
Abstract

Shared workload optimization is feasible if the set of tasks to be executed is known in advance, as is the case in updating a set of materialized views or executing an extract-transform-load workflow. In this paper, we consider dataintensive shared workloads with precedence constraints arising from data dependencies, i.e., before executing some task, other tasks may have to run first and generate some data needed by the next task(s). While there has been previous work on identifying common subexpressions in shared workloads and task re-ordering to enable shared scans, in this paper we go a step further and solve the problem of scheduling shared data-intensive workloads in a cache-oblivious way. Our solution relies on a novel formulation of precedence constrained scheduling with the additional constraint that once a data item is in the cache, all tasks that require this data item should execute as soon as possible thereafter. The intuition behind this formulation is that the longer a data item remains in the cache, the more likely it is to be evicted regardless of the cache size. We give an optimal ordering algorithm using A* search over the space of possible orderings, and we propose efficient and effective heuristics that obtain nearly-optimal results in much less time. We present experimental results on real-life data warehouse workloads and the TCP-DS benchmark to validate our claims.

Citation KeyBae2015
Refereed DesignationRefereed
Project year: 
Third year
WP(s) associated with the paper: 
WP3 - Large-scale data analysis
Partner(s) associated with the paper's author(s): 
Forschungszentrum Telekommunikation Wien Gmbh
Other
Is this an OFFICIALLY supported mPlane paper?: 
Yes