impala

mirror of https://github.com/apache/impala.git synced 2026-02-02 06:00:36 -05:00

Files

stiga-huang 6ea15409b8 IMPALA-11208: Fix uninitialized counter of CollectionItemsRead in orc-scanner

CollectionItemsRead in the runtime profile counts the total number of
nested collection items read by the scan node. Only created for scans
that support nested types, e.g. Parquet or ORC.

Each scanner thread maintains its local counter and merges it into
HdfsScanNode counter for each row batch. However, the local counter in
orc-scanner is uninitialized, leading to weird values. This patch simply
initializes it to 0 and adds test coverage.

Tests:
Add profile verification for this counter on some existing query tests.
Note that there are some implementation difference between Parquet and
ORC scanners (e.g. in predicate pushdown). So we will see different
counter results in some query. I just pick some queries that have
consistent counters.

Change-Id: Id7783d1460ac9b98e94d3a31028b43f5a9884f99
Reviewed-on: http://gerrit.cloudera.org:8080/18528
Reviewed-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>
Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>

2022-05-18 23:59:58 +00:00

functional-planner

IMPALA-11141: Use exact data types in IN-list filter

2022-04-27 03:30:41 +00:00

functional-query

IMPALA-11208: Fix uninitialized counter of CollectionItemsRead in orc-scanner

2022-05-18 23:59:58 +00:00

perf-regression

IMPALA-9709: Remove Impala-lzo from the development environment

2020-06-15 23:42:12 +00:00

targeted-perf

IMPALA-2581: LIMIT can be propagated down into some aggregations

2021-09-22 20:42:10 +00:00

targeted-stress

IMPALA-9709: Remove Impala-lzo from the development environment

2020-06-15 23:42:12 +00:00

tpcds

IMPALA-10034: Add remaining TPC-DS queries to workload.

2020-08-24 16:02:45 +00:00

tpcds-insert

IMPALA-10384: Make partition names consistent between BE and FE

2020-12-11 19:51:28 +00:00

tpcds-unmodified

IMPALA-9709: Remove Impala-lzo from the development environment

2020-06-15 23:42:12 +00:00

tpch

IMPALA-11208: Fix uninitialized counter of CollectionItemsRead in orc-scanner

2022-05-18 23:59:58 +00:00

tpch_nested

IMPALA-9604: Add TPCH-nested tests for column masking

2020-06-17 06:54:50 +00:00

README

Move functional data loading to new framework + initial changes for workload directory structure

2014-01-08 10:44:18 -08:00

README

This directory contains Impala test workloads. The directory layout for the workloads should follow:

workloads/
   <data set name>/<data set name>_dimensions.csv  <- The test dimension file
   <data set name>/<data set name>_core.csv  <- A test vector file
   <data set name>/<data set name>_pairwise.csv
   <data set name>/<data set name>_exhaustive.csv
   <data set name>/queries/<query test>.test <- The queries for this workload