mirror of
https://github.com/apache/impala.git
synced 2025-12-19 09:58:28 -05:00
This change adds EE tests in test_parquet_byte_stream_split_encoding.py that check that Impala returns the correct error message when it encounters a table that contains a parquet file with Byte Stream Split encoding. To regenerate the test files, run the parquet_files_generator.py script in the testdata/parquet_byte_stream_split_encoding/ folder. Change-Id: If5eff8bf51fe246a9d0250e38c470b821fec75d9 Reviewed-on: http://gerrit.cloudera.org:8080/22124 Reviewed-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com> Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>
The doubles_byte_stream_split.parquet and floats_byte_stream_split.parquet files were generated with the parquet_files_generator.py script. The script is using PyArrow (https://arrow.apache.org/docs/python). To regenerate the files, run: python3 parquet_files_generator.py