TY - GEN
T1 - Parameter Curation for Benchmark Queries
AU - Gubichev, Andrey
AU - Boncz, Peter
PY - 2014
Y1 - 2014
N2 - In this paper we consider the problem of generating parameters for benchmark queries so these have stable behavior despite being executed on datasets (real-world or synthetic) with skewed data distributions and value correlations. We show that uniform random sampling of the substitution parameters is not well suited for such benchmarks, since it results in unpredictable runtime behavior of queries. We present our approach of Parameter Curation with the goal of selecting parameter bindings that have consistently low-variance intermediate query result sizes throughout the query plan. Our solution is illustrated with IMDB data and the recently proposed LDBC Social Network Benchmark (SNB).
AB - In this paper we consider the problem of generating parameters for benchmark queries so these have stable behavior despite being executed on datasets (real-world or synthetic) with skewed data distributions and value correlations. We show that uniform random sampling of the substitution parameters is not well suited for such benchmarks, since it results in unpredictable runtime behavior of queries. We present our approach of Parameter Curation with the goal of selecting parameter bindings that have consistently low-variance intermediate query result sizes throughout the query plan. Our solution is illustrated with IMDB data and the recently proposed LDBC Social Network Benchmark (SNB).
UR - http://www.scopus.com/inward/record.url?scp=84922390960&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84922390960&partnerID=8YFLogxK
U2 - 10.1007/978-3-319-15350-6_8
DO - 10.1007/978-3-319-15350-6_8
M3 - Conference contribution
VL - 8904
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 113
EP - 129
BT - Performance Characterization and Benchmarking: Traditional to Big Data - 6th TPC Technology Conference, TPCTC 2014, Revised Selected Papers
PB - Springer/Verlag
T2 - 6th TPC Technology Conference on Performance Evaluation and Benchmarking, TPCTC 2014 held in conjunction with 40th International Conference on Very Large Data Bases, VLDB 2014
Y2 - 1 September 2014 through 5 September 2014
ER -