mirror of
https://github.com/apache/impala.git
synced 2026-01-10 09:00:16 -05:00
Currently for hbase scan nodes we use a constant estimate of 1GB which is generally a gross over-estimation. This patch improves upon those estimates by using huerestics based on how hbase rows are stored and fetched and how the scanners interact with the internal memory pool. Testing: Added/Modified resource requirements planner test. Added a junit test for the estimation logic. Change-Id: I583545c3f5e454854f111871c5fbc4f108ae4bff Reviewed-on: http://gerrit.cloudera.org:8080/11306 Reviewed-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com> Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>
This directory contains Impala test workloads. The directory layout for the workloads should follow: workloads/ <data set name>/<data set name>_dimensions.csv <- The test dimension file <data set name>/<data set name>_core.csv <- A test vector file <data set name>/<data set name>_pairwise.csv <data set name>/<data set name>_exhaustive.csv <data set name>/queries/<query test>.test <- The queries for this workload