mirror of
https://github.com/apache/impala.git
synced 2025-12-26 14:02:53 -05:00
* Use saturating arithmetic in Expr.getNumDistinctValues() to avoid overflows. * Avoid double-adding with checkedAdd() * Fix incorrect logic with multiple groups - each group cannot return more than the input rows, but with multiple groups it can add up to more than the input rows. Testing: Updated planner tests from part 1 to reflect bugfixes. Added targeted cardinality tests to verify behaviour with and without stats. Updated other planner tests that changed as a result of this fixed. Ran exhaustive tests. Change-Id: Ieed41d60c0e0dfeca64035e919cb8c28a054a9ab Reviewed-on: http://gerrit.cloudera.org:8080/14132 Reviewed-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com> Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>
This directory contains Impala test workloads. The directory layout for the workloads should follow: workloads/ <data set name>/<data set name>_dimensions.csv <- The test dimension file <data set name>/<data set name>_core.csv <- A test vector file <data set name>/<data set name>_pairwise.csv <data set name>/<data set name>_exhaustive.csv <data set name>/queries/<query test>.test <- The queries for this workload