mirror of
https://github.com/apache/impala.git
synced 2026-01-03 06:00:52 -05:00
Adding the ability to read compressed text. Reading the compression type from the
file descriptors. Trying to homogenize a bit more the interface of the scanners.
Removing the LZO_TEXT file format, since it was not actually a file format.
Modifying the tests to load and test also text/{snap,gzip,bzip} databases.
Note that this patch requires some changes to Impala-lzo as well.
Change-Id: Ic0742ba11f106ba545050bdb71795efbff70ef74
Reviewed-on: http://gerrit.sjc.cloudera.com:8080/3549
Reviewed-by: Ippokratis Pandis <ipandis@cloudera.com>
Reviewed-by: Ishaan Joshi <ishaan@cloudera.com>
Tested-by: Ippokratis Pandis <ipandis@cloudera.com>
Reviewed-on: http://gerrit.sjc.cloudera.com:8080/3651
Tested-by: jenkins
2.1 KiB
2.1 KiB
| 1 | # Generated File. |
|---|---|
| 2 | file_format: text, dataset: functional, compression_codec: none, compression_type: none |
| 3 | file_format: text, dataset: functional, compression_codec: gzip, compression_type: block |
| 4 | file_format: text, dataset: functional, compression_codec: bzip, compression_type: block |
| 5 | file_format: text, dataset: functional, compression_codec: snap, compression_type: block |
| 6 | file_format: text, dataset: functional, compression_codec: lzo, compression_type: block |
| 7 | file_format: seq, dataset: functional, compression_codec: none, compression_type: none |
| 8 | file_format: seq, dataset: functional, compression_codec: def, compression_type: block |
| 9 | file_format: seq, dataset: functional, compression_codec: def, compression_type: record |
| 10 | file_format: seq, dataset: functional, compression_codec: gzip, compression_type: block |
| 11 | file_format: seq, dataset: functional, compression_codec: gzip, compression_type: record |
| 12 | file_format: seq, dataset: functional, compression_codec: bzip, compression_type: block |
| 13 | file_format: seq, dataset: functional, compression_codec: bzip, compression_type: record |
| 14 | file_format: seq, dataset: functional, compression_codec: snap, compression_type: block |
| 15 | file_format: seq, dataset: functional, compression_codec: snap, compression_type: record |
| 16 | file_format: rc, dataset: functional, compression_codec: none, compression_type: none |
| 17 | file_format: rc, dataset: functional, compression_codec: def, compression_type: block |
| 18 | file_format: rc, dataset: functional, compression_codec: gzip, compression_type: block |
| 19 | file_format: rc, dataset: functional, compression_codec: bzip, compression_type: block |
| 20 | file_format: rc, dataset: functional, compression_codec: snap, compression_type: block |
| 21 | file_format: avro, dataset: functional, compression_codec: none, compression_type: none |
| 22 | file_format: avro, dataset: functional, compression_codec: def, compression_type: block |
| 23 | file_format: avro, dataset: functional, compression_codec: snap, compression_type: block |
| 24 | file_format: parquet, dataset: functional, compression_codec: none, compression_type: none |
| 25 | file_format: hbase, dataset: functional, compression_codec: none, compression_type: none |