Predicting completion times of batch query workloads using interaction-aware models and simulation

Mumtaz Ahmad, Songyun Duan, Ashraf Aboulnaga, Shivnath Babu

Research output: Chapter in Book/Report/Conference proceedingConference contribution

33 Citations (Scopus)

Abstract

A question that database administrators (DBAs) routinely need to answer is how long a batch query workload will take to complete. This question arises, for example, while planning the execution of different report-generation workloads to fit within available time windows. To answer this question accurately, we need to take into account that the typical workload in a database system consists of mixes of concurrent queries. Interactions among different queries in these mixes need to be modeled, rather than the conventional approach of considering each query separately. This paper presents a new approach for estimating workload completion times that takes the significant impact of query interactions into account. This approach builds performance models using an experiment-driven technique, by sampling the space of possible query mixes and fitting statistical models to the observed performance at these samples. No prior assumptions are made about the internal workings of the database system or the cause of query interactions, making the models robust and portable. We show that a careful choice of sampling and statistical modeling strategies can result in accurate models, and we present a novel interaction-aware workload simulator that uses these models to estimate workload completion times. An experimental evaluation with complex TPC-H queries on IBM DB2 shows that this approach consistently predicts workload completion times with less than 20% error.

Original languageEnglish
Title of host publicationAdvances in Database Technology - EDBT 2011
Subtitle of host publication14th International Conference on Extending Database Technology, Proceedings
Pages449-460
Number of pages12
DOIs
Publication statusPublished - 18 Apr 2011
Event14th International Conference on Extending Database Technology: Advances in Database Technology, EDBT 2011 - Uppsala, Sweden
Duration: 22 Mar 201124 Mar 2011

Publication series

NameACM International Conference Proceeding Series

Other

Other14th International Conference on Extending Database Technology: Advances in Database Technology, EDBT 2011
CountrySweden
CityUppsala
Period22/3/1124/3/11

    Fingerprint

Keywords

  • Algorithms
  • Experimentation
  • Performance

ASJC Scopus subject areas

  • Software
  • Human-Computer Interaction
  • Computer Vision and Pattern Recognition
  • Computer Networks and Communications

Cite this

Ahmad, M., Duan, S., Aboulnaga, A., & Babu, S. (2011). Predicting completion times of batch query workloads using interaction-aware models and simulation. In Advances in Database Technology - EDBT 2011: 14th International Conference on Extending Database Technology, Proceedings (pp. 449-460). (ACM International Conference Proceeding Series). https://doi.org/10.1145/1951365.1951419