impala/testdata at master - impala - Gitea: Git with a cup of tea

jprdonnelly/impala

mirror of https://github.com/apache/impala.git synced 2026-02-03 09:00:39 -05:00

Files

History

Arnab Karmakar 6a0eedf4af IMPALA-13299: Support CREATE TABLE LIKE for Iceberg from HDFS sources

This patch enables creating Iceberg tables from non-Iceberg HDFS source
tables (Parquet, ORC, etc.) using CREATE TABLE LIKE with STORED BY ICEBERG.
This provides a metadata-only operation to convert table schemas to Iceberg
format without copying data.

Supported source types: Parquet, ORC, Avro, Text, and other HDFS-based formats
Not supported: Kudu tables, JDBC tables, Paimon tables

Use case: This is particularly useful for Apache Hive 3.1 environments where
CTAS (CREATE TABLE AS SELECT) with STORED BY ICEBERG is not supported - that
feature requires Hive 4.0+. Users can use CREATE TABLE LIKE to create the
Iceberg schema, then use INSERT INTO to migrate data.

Testing:
- Comprehensive tests covering schema conversion with various data types,
  partitioned and external tables, complex types (STRUCT, ARRAY, MAP)
- Bidirectional conversion tests (non-Iceberg → Iceberg and reverse)
- Hive interoperability tests verifying data round-trips correctly

Change-Id: Id162f217e49e9f396419b09815b92eb7f351881e
Reviewed-on: http://gerrit.cloudera.org:8080/23733
Reviewed-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>
Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>

2026-02-02 16:29:43 +00:00

..

…

AllTypesErrorNoNulls

…

…

…

avro_schema_resolution

IMPALA-10451: Fix avro table loading failures caused by HIVE-24157

2024-05-09 16:22:56 +00:00

…

bad_parquet_data

…

…

…

IMPALA-12349: Support Apache Hive 2.x in Impala

2026-01-06 16:02:01 +00:00

IMPALA-10319: Support arbitrary encodings on Text files

2025-06-01 21:31:00 +00:00

IMPALA-12349: Support Apache Hive 2.x in Impala

2026-01-06 16:02:01 +00:00

IMPALA-14501: Migrate most scripts from impala-python to impala-python3

2025-10-22 16:30:17 +00:00

ComplexTypesTbl

IMPALA-12159: Support ORDER BY for collections of variable length types in select list

2023-12-06 22:09:05 +00:00

compressed_formats

…

configs/catalog_configs

IMPALA-14016: Add multi-catalog support for local catalog mode

2025-09-19 15:03:59 +00:00

CustomerMultiBlock

…

IMPALA-14092 Part2: Support querying of paimon data table via JNI

2025-12-05 18:19:57 +00:00

IMPALA-14092 Part2: Support querying of paimon data table via JNI

2025-12-05 18:19:57 +00:00

…

empty_parquet_page_source_impala10186

IMPALA-10186: Fix writing empty parquet page

2023-05-19 12:02:42 +00:00

…

IMPALA-13594: Read Puffin stats also from older snapshots

2025-01-23 15:25:59 +00:00

impala-profiles

IMPALA-13624: Implement textual representation for aggregate event sequences

2025-05-14 00:21:54 +00:00

ImpalaDemoDataset

…

[tools] Add .gitignore for new files

2024-06-14 04:14:33 +00:00

…

IMPALA-13675: OAuth AuthN Support for Impala Shell

2025-06-05 21:15:47 +00:00

…

LineItemMultiBlock

IMPALA-11350: Add virtual column FILE__POSITION for Parquet tables

2022-08-12 19:21:55 +00:00

IMPALA-5081: Add codegen_opt_level query option

2023-10-23 21:11:47 +00:00

max_nesting_depth

IMPALA-13053: Update test to use ORC files

2024-05-04 11:19:02 +00:00

migrated_iceberg

IMPALA-13364: Schema resolution doesn't work for migrated partitioned Iceberg tables that have complex types

2024-10-02 17:18:46 +00:00

multi_compression_parquet_data

…

…

…

parquet_byte_stream_split_encoding

IMPALA-13211: Add negative test for Parquet Byte Stream Split encoding

2025-01-02 18:09:38 +00:00

parquet_nested_types_encodings

…

parquet_schema_resolution

…

scale_test_metadata

IMPALA-13020 (part 2): Split out external vs internal Thrift max message size

2024-05-17 19:07:49 +00:00

TblWithRaggedColumns

…

…

…

tinytable_seq_snap

…

…

…

…

IMPALA-13804: Use redacted statement in live table

2025-02-27 22:39:44 +00:00

IMPALA-13299: Support CREATE TABLE LIKE for Iceberg from HDFS sources

2026-02-02 16:29:43 +00:00

__init__.py

…

.gitignore

…