Files
impala/testdata/workloads/functional-query/functional-query_exhaustive.csv
Eyizoha bb1ec12c1f IMPALA-12431: Support reading compressed JSON file
This patch adds the functionality to read compressed JSON files for the
JSON scanner. Because the decompression code can largely be reused from
HdfsTextScanner, this patch moves that part of the code from
HdfsTextScanner to HdfsScanner so that HdfsJsonScanner can also call it.
As it reuses the relevant code from the TEXT scanner, the compression
formats supported by the Json scanner are the same as those supported by
the TEXT scanner.

Tests
 - Most of the existing end-to-end JSON format tests can run on
   compressed JSON format too.

Change-Id: I2471855d97d4cdd51363b321055e6b06aa6d81e8
Reviewed-on: http://gerrit.cloudera.org:8080/20482
Reviewed-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>
Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>
2024-01-18 06:35:32 +00:00

2.7 KiB

1# Generated File.
2file_format: text, dataset: functional, compression_codec: none, compression_type: none
3file_format: text, dataset: functional, compression_codec: def, compression_type: block
4file_format: text, dataset: functional, compression_codec: gzip, compression_type: block
5file_format: text, dataset: functional, compression_codec: bzip, compression_type: block
6file_format: text, dataset: functional, compression_codec: snap, compression_type: block
7file_format: seq, dataset: functional, compression_codec: none, compression_type: none
8file_format: seq, dataset: functional, compression_codec: def, compression_type: block
9file_format: seq, dataset: functional, compression_codec: def, compression_type: record
10file_format: seq, dataset: functional, compression_codec: gzip, compression_type: block
11file_format: seq, dataset: functional, compression_codec: gzip, compression_type: record
12file_format: seq, dataset: functional, compression_codec: bzip, compression_type: block
13file_format: seq, dataset: functional, compression_codec: bzip, compression_type: record
14file_format: seq, dataset: functional, compression_codec: snap, compression_type: block
15file_format: seq, dataset: functional, compression_codec: snap, compression_type: record
16file_format: rc, dataset: functional, compression_codec: none, compression_type: none
17file_format: rc, dataset: functional, compression_codec: def, compression_type: block
18file_format: rc, dataset: functional, compression_codec: gzip, compression_type: block
19file_format: rc, dataset: functional, compression_codec: bzip, compression_type: block
20file_format: rc, dataset: functional, compression_codec: snap, compression_type: block
21file_format: avro, dataset: functional, compression_codec: none, compression_type: none
22file_format: avro, dataset: functional, compression_codec: def, compression_type: block
23file_format: avro, dataset: functional, compression_codec: snap, compression_type: block
24file_format: parquet, dataset: functional, compression_codec: none, compression_type: none
25file_format: orc, dataset: functional, compression_codec: def, compression_type: block
26file_format: hbase, dataset: functional, compression_codec: none, compression_type: none
27file_format: kudu, dataset: functional, compression_codec: none, compression_type: none
28file_format: json, dataset: functional, compression_codec: none, compression_type: none
29file_format: json, dataset: functional, compression_codec: def, compression_type: block
30file_format: json, dataset: functional, compression_codec: gzip, compression_type: block
31file_format: json, dataset: functional, compression_codec: bzip, compression_type: block
32file_format: json, dataset: functional, compression_codec: snap, compression_type: block