Files
impala/testdata/bin/compute-table-stats.sh
Lenni Kuff 36e9fe1c1a Run compute table stats statements using Hive CLI
This works around a problem with computing table stats via the Hive Meta Store client
API. When executing these stements via the MetaStoreClient, all tables were getting a
num_rows=0 value returned from the ANALYZE TABLE query.
2014-01-08 10:49:19 -08:00

16 lines
805 B
Bash
Executable File

#!/bin/bash
# Copyright (c) 2012 Cloudera, Inc. All rights reserved.
# Runs compute table stats over the Impala test tables.
#
set -e
set -u
# Run compute stats over as many of the tables used in the Planner tests as possible.
# Due to Hive bugs HIVE-4119 and HIVE-4122, these tables need to be chosen carefully or
# Hive will either crash or fail with an error when executing the COMPUTE STATS query.
python ${IMPALA_HOME}/tests/util/compute_table_stats.py --db_names=functional\
--table_names="alltypes,alltypesagg,alltypesaggmultifilesnopart,alltypesaggnonulls,
alltypessmall,alltypestiny,hbasealltypessmall,hbasestringids,jointbl,dimtbl"
python ${IMPALA_HOME}/tests/util/compute_table_stats.py --db_names=tpch \
--table_names=customer,lineitem,nation,orders,part,partsupp,region,supplier