impala

mirror of https://github.com/apache/impala.git synced 2026-02-01 12:00:22 -05:00

Files

Zoltan Borok-Nagy 5d044e0cb2 IMPALA-6542: Fix inconsistent write path of Parquet min/max statistics

Quick fix of Parquet write path until the Parquet community
agrees on the ordering of floating point numbers.

The behavior follows the way fmax()/fmin() works, ie. Impala
will only write NaN into the stats when all the values are NaNs.
This behavior is aligned with the quick fix of Parquet-CPP.

Added e2e tests as well.

Change-Id: I3957806948f7c661af4be5495f2ec92d1e9fc9d6
Reviewed-on: http://gerrit.cloudera.org:8080/9381
Reviewed-by: Lars Volker <lv@cloudera.com>
Tested-by: Impala Public Jenkins

2018-03-08 07:34:41 +00:00

functional-planner

Revert IMPALA-4835 and dependent changes

2018-03-03 04:22:12 +00:00

functional-query

IMPALA-6542: Fix inconsistent write path of Parquet min/max statistics

2018-03-08 07:34:41 +00:00

hive-benchmark

Refactor testing framework to generate Avro tables.

2014-01-08 10:48:45 -08:00

perf-regression

IMPALA-3311: fix string data coming out of aggs in subplans

2016-05-12 23:06:36 -07:00

targeted-perf

IMPALA-6495: fix targeted-perf for new column alias syntax

2018-02-09 16:49:54 +00:00

targeted-stress

IMPALA-4674: Part 2: port backend exec to BufferPool

2017-08-05 01:03:02 +00:00

tpcds

IMPALA-5478: Run TPCDS queries with decimal_v2 enabled

2018-01-18 03:28:51 +00:00

tpcds-insert

[CDH5] Modified TPCDS schema and queries to match Impala TPCDS kit

2014-08-08 02:20:40 -07:00

tpch

Revert IMPALA-4835 and dependent changes

2018-03-03 04:22:12 +00:00

tpch_nested

IMPALA-4924: Enable Decimal V2 by default

2018-01-25 04:33:11 +00:00

README

Move functional data loading to new framework + initial changes for workload directory structure

2014-01-08 10:44:18 -08:00

README

This directory contains Impala test workloads. The directory layout for the workloads should follow:

workloads/
   <data set name>/<data set name>_dimensions.csv  <- The test dimension file
   <data set name>/<data set name>_core.csv  <- A test vector file
   <data set name>/<data set name>_pairwise.csv
   <data set name>/<data set name>_exhaustive.csv
   <data set name>/queries/<query test>.test <- The queries for this workload