impala

mirror of https://github.com/apache/impala.git synced 2026-01-09 06:05:09 -05:00

Files

Zoltan Borok-Nagy 881e00a8bf IMPALA-6538: Fix read path when Parquet min/max statistics contain NaN

If the first number in a row group written by Impala is NaN,
then Impala writes incorrect statistics in the metadata.
This will result in incorrect results when filtering the
data.

This commit fixes the read path when encountering NaNs in
Parquet min/max statistics. If min and max are both NaN, we
can't use the statistics at all. If only one of them is NaN,
the other still can be used.

I added some tests to QueryTest/parqet-stats.test

Change-Id: If3897fc1426541239223670812f59e2bed32f455
Reviewed-on: http://gerrit.cloudera.org:8080/9358
Reviewed-by: Tim Armstrong <tarmstrong@cloudera.com>
Tested-by: Impala Public Jenkins

2018-02-22 00:57:46 +00:00

queries

IMPALA-6538: Fix read path when Parquet min/max statistics contain NaN

2018-02-22 00:57:46 +00:00

functional-query_core.csv

IMPALA-3718: Support subset of functional-query for Kudu

2016-09-14 22:11:04 +00:00

functional-query_dimensions.csv

Starting Kudu as part of the run-all.sh command / data loading

2015-06-01 15:53:34 -07:00

functional-query_exhaustive.csv

IMPALA-3718: Support subset of functional-query for Kudu