mirror of
https://github.com/apache/impala.git
synced 2026-01-26 03:01:30 -05:00
With the improvements in the incremental stats memory representation (IMPALA-7424), the per column per partition stats estimate should be reduced to account for the compressed memory footprint. Doing some experiments on various test tables, I see the size is down by 50-70%. This patch reduces the size estimate by 50% (conservative). Ideally we don't need to estimate on the Catalog server during serialization since we can compute the byte sizes by looping through all the partitions. However this patch retains the current logic to keep it consistent with "compute incremental stats" analysis. Change-Id: I347b41d9b298d7cd73ec812692172e0511415eee Reviewed-on: http://gerrit.cloudera.org:8080/11706 Reviewed-by: Bharath Vissapragada <bharathv@cloudera.com> Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>