impala

mirror of https://github.com/apache/impala.git synced 2025-12-19 18:12:08 -05:00

Author	SHA1	Message	Date
Riza Suminto	0292e296c0	IMPALA-14426: Deflake TestImpalaShell.test_cancellation TestImpalaShell.test_cancellation start failing when run with Python 3.9 with following error message RuntimeError: reentrant call inside <_io.BufferedWriter name='<stderr>'> This patch is a quick fix the by changing the stderr write from using print() to os.write(). Note that the thread-safetyness isssue within _signal_handler in impala_shell.py during query cancellation still remains. Testing: Run and pass test_cancellation in RHEL9 with Python 3.9. Change-Id: I5403c7b8126b1a35ea841496fdfb6eb93e83376e Reviewed-on: http://gerrit.cloudera.org:8080/23416 Reviewed-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com> Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>	2025-09-12 20:49:22 +00:00
Noemi Pap-Takacs	821c7347d1	IMPALA-13267: Display number of partitions for Iceberg tables Before this change, query plans and profile reported only a single partition even for partitioned Iceberg tables, which was misleading for users. Now we can display the number of scanned partitions correctly for both partitioned and unpartitioned Iceberg tables. This is achieved by extracting the partition values from the file descriptors and storing them in the IcebergContentFileStore. Instead of storing this information redundantly in all file descriptors, we store them in one place and reference the partition metadata in the FDs with an id. This also gives the opportunity to optimize memory consumption in the Catalog and Coordinator as well as reduce network traffic between them in the future. Time travel is handled similarly to oldFileDescMap. In that case we don't know the total number of partitions in the old snapshot, so the output is [Num scanned partitions]/unknown. Testing: - Planner tests - E2E tests - partition transforms - partition evolution - DROP PARTITION - time travel Change-Id: Ifb2f654bc6c9bdf9cfafc27b38b5ca2f7b6b4872 Reviewed-on: http://gerrit.cloudera.org:8080/23113 Reviewed-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com> Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>	2025-09-12 20:36:10 +00:00
Riza Suminto	968559da26	IMPALA-14398: Deflake test_load_table_with_primary_key_attr test_load_table_with_primary_key_attr has been intermittently fail with following error message since Python3 switch in IMPALA-14333. dbm.error: db type could not be determined Error happen inside load_unique_col_metadata. But both load_unique_col_metadata and persist_unique_col_metadata are seemingly unused anymore. While the root cause is still unknown, commenting load_unique_col_metadata in db_connection.py can make the tests pass. Tests are not affected because the they do not assert against any unique columns. This patch implement that workaround until we can revisit the issue with query generator infrastructure. Fix few flake8 warnings in db_connection.py. Testing: Loop and pass test_show_create_table 10 times using downstream jenkins. Change-Id: I2caf6b9cc5a4f06237323cb32bbd47afef576fa1 Reviewed-on: http://gerrit.cloudera.org:8080/23412 Reviewed-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com> Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>	2025-09-11 21:12:14 +00:00
Daniel Vanko	492b2b7f46	IMPALA-14406: Fix test_column_case_sensitivity for newer Iceberg versions New test introduced in IMPALA-14290 depends on the Iceberg versions, because newer ones (e.g. 1.5.2) will show {"start_time_month":"646","end_time_day":"19916"} instead of {"start_time_day":null,"end_time_month":null,"start_time_month":"646","end_time_day":"19916"} The test now accepts both cases. Testing: * ran query_test/test_iceberg.py with both Iceberg 1.3.1 and 1.5.2 Change-Id: I17e368ac043d1fbf80a78dcac6ab1be5a297b6ea Reviewed-on: http://gerrit.cloudera.org:8080/23389 Reviewed-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com> Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>	2025-09-11 18:28:46 +00:00
Riza Suminto	7fabd27096	IMPALA-14411: enable_workload_mgmt should work with V2 profile Impalad crash (hitting DCHECK) when both enable_workload_mgmt and gen_experimental_profile enabled. This is because lambda function process_exec_profile expect "Averaged Fragment" node exist in query profile. But it is actually not exist in V2 query profile. This patch fix the issue by gathering ScratchBytesWritten, ScannerIoWaitTime, and DataCacheHitBytes counters differently in V2 profile. Testing: - Add TestQueryLogTableHS2::test_with_experimental_profile. - Manually start minicluster with both enable_workload_mgmt and gen_experimental_profile flag enabled. Run few queries and confirm no crash happen. Also verify that the columns of sys.impala_query_log that summarize the scan node counters are correct. Change-Id: Iccb4ad9279b0d66479b1e7816ffc732028e71734 Reviewed-on: http://gerrit.cloudera.org:8080/23396 Reviewed-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com> Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>	2025-09-11 09:45:38 +00:00
jichen0919	826c8cf9b0	IMPALA-14081: Support create/drop paimon table for impala This patch mainly implement the creation/drop of paimon table through impala. Supported impala data types: - BOOLEAN - TINYINT - SMALLINT - INTEGER - BIGINT - FLOAT - DOUBLE - STRING - DECIMAL(P,S) - TIMESTAMP - CHAR(N) - VARCHAR(N) - BINARY - DATE Syntax for creating paimon table: CREATE [EXTERNAL] TABLE [IF NOT EXISTS] [db_name.]table_name ( [col_name data_type ,...] [PRIMARY KEY (col1,col2)] ) [PARTITIONED BY (col_name data_type [COMMENT 'col_comment'], ...)] STORED AS PAIMON [LOCATION 'hdfs_path'] [TBLPROPERTIES ( 'primary-key'='col1,col2', 'file.format' = 'orc/parquet', 'bucket' = '2', 'bucket-key' = 'col3', ]; Two types of paimon catalogs are supported. (1) Create table with hive catalog: CREATE TABLE paimon_hive_cat(userid INT,movieId INT) STORED AS PAIMON; (2) Create table with hadoop catalog: CREATE [EXTERNAL] TABLE paimon_hadoop_cat STORED AS PAIMON TBLPROPERTIES('paimon.catalog'='hadoop', 'paimon.catalog_location'='/path/to/paimon_hadoop_catalog', 'paimon.table_identifier'='paimondb.paimontable'); SHOW TABLE STAT/SHOW COLUMN STAT/SHOW PARTITIONS/SHOW FILES statements are also supported. TODO: - Patches pending submission: - Query support for paimon data files. - Partition pruning and predicate push down. - Query support with time travel. - Query support for paimon meta tables. - WIP: - Complex type query support. - Virtual Column query support for querying paimon data table. - Native paimon table scanner, instead of jni based. Testing: - Add unit test for paimon impala type conversion. - Add unit test for ToSqlTest.java. - Add unit test for AnalyzeDDLTest.java. - Update default_file_format TestEnumCase in be/src/service/query-options-test.cc. - Update test case in testdata/workloads/functional-query/queries/QueryTest/set.test. - Add test cases in metadata/test_show_create_table.py. - Add custom test test_paimon.py. Change-Id: I57e77f28151e4a91353ef77050f9f0cd7d9d05ef Reviewed-on: http://gerrit.cloudera.org:8080/22914 Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com> Reviewed-by: Riza Suminto <riza.suminto@cloudera.com>	2025-09-10 21:24:49 +00:00
stiga-huang	a41c5cbfdd	IMPALA-14416: JniFrontend.getDbs() should handle InconsistentMetadataFetchException JniFrontend.getDbs() returns the thrift representation of all the dbs. This might trigger multiple getPartialCatalogObject requests to catalogd and could fail in InconsistentMetadataFetchException, e.g. if a db is removed after coordinator fetching the db name list and before coordinator fetching the msDb of that db. This patch fixes the issue by retrying the above steps when hitting InconsistentMetadataFetchException, similar to what other methods in Frontend do. Adds getThriftDbs() in Frontend to directly return the thrift db list so JniFrontend can use it directly and the retry can be added inside Frontend.java. TestAuthorization.test_local_catalog_show_dbs_with_transient_db is an existing test to verify a similar problem. Running this test with authorization disabled can reproduce the current bug. So this patch extracts the test code into TestLocalCatalogRetries._run_show_dbs_with_transient_db() and share it in both authz enabled and disabled tests. Tests - Ran TestLocalCatalogRetries.test_show_dbs_retry 60 times. Without the fix, it fails in about a dozen times. Change-Id: Ib337f88a2ac0f35142417f6cee51d30497f12845 Reviewed-on: http://gerrit.cloudera.org:8080/23402 Reviewed-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com> Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>	2025-09-10 10:51:07 +00:00
Abhishek Rawat	f4c0c396ff	IMPALA-14175: Generate impala-udf-devel package using the build script Added '-udf_devel_package' option to buildall.sh. This generates impala-udf-devel rpm which includes udf headers and static libraries - ImpalaUdf-retail.a and ImpalaUdf-debug.a. Testing: - Tested that rpm is generated using build script: ./buildall.sh -release_and_debug -notests -udf_devel_package - Tested that the rpm is also generated using standalone script: ./bin/make-impala-udf-devel-rpm.sh - Generated impala-udf-devel package and tested compiling impala_udf_samples: https://github.com/cloudera/impala-udf-samples Change-Id: I5b85df9c3f680a7e5551f067a97a5650daba9b50 Reviewed-on: http://gerrit.cloudera.org:8080/23060 Reviewed-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com> Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>	2025-09-09 22:42:05 +00:00
Zoltan Borok-Nagy	711797e7fb	IMPALA-14349: Encode FileDescriptors in time in loading Iceberg Tables With this patch we create Iceberg file descriptors from LocatedFileStatus objects during IcebergFileMetadataLoader's parallelListing(). This has the following benefits: * We parallelize the creation of Iceberg file descriptor objects * We don't need to maintain a large hash map with all the LocatedFileStatus objects at once. Now we only need to keep a few LocatedFileStatus objects per partition in memory while we are converting them to Iceberg file descriptors. I.e., the GC is free to destroy the LocatedFileStatus objects we don't use anymore. This patch retires startup flag 'iceberg_reload_new_files_threshold'. Since IMPALA-13254 we only list partitions that have new data files, and we load them in parallel, i.e. efficient incremental table loading is already covered. From that point the startup flag only added unnecessary code complexity. Measurements I created two tables (from tpcds.store_sales) to measure table loading times for large tables: Table #1: PARTITIONED BY SPEC(ss_item_sk, BUCKET(5, ss_sold_time_sk)) partitions: 107818 files: 754726 Table #2: PARTITIONED BY SPEC(ss_item_sk) partitions: 18000 files: 504224 Time taken in IcebergFileMetadataLoader.load() during full table reload: +----------+-------+------+---------+ \| \| Base \| New \| Speedup \| +----------+-------+------+---------+ \| Table #1 \| 17.3s \| 8.1s \| 2.14 \| \| Table #2 \| 7.8s \| 4.3s \| 1.8 \| +----------+-------+------+---------+ I measured incremental table loading only for Table #2 (since there are more files per partition this is the worse scenario for the new code, as it only uses file listings, and each new file were created in a separate partition) Time taken in IcebergFileMetadataLoader.load() during incremental table reload: +------------+------+------+---------+ \| #new files \| Base \| New \| Speedup \| +------------+------+------+---------+ \| 1 \| 1.4s \| 1.6s \| 0.9 \| \| 100 \| 1.5s \| 1.9s \| 0.8 \| \| 200 \| 1.5s \| 1.5s \| 1 \| +------------+------+------+---------+ We lose a few tenths of a second, but I think the simplified code justifies it. Testing: * some tests were updated because we we don't have startup flag 'iceberg_reload_new_files_threshold' anymore Change-Id: Ia1c2a7119d76db7ce7c43caec2ccb122a014851b Reviewed-on: http://gerrit.cloudera.org:8080/23363 Reviewed-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com> Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>	2025-09-09 20:34:01 +00:00
Riza Suminto	dbac6ab13a	IMPALA-14400: Fix deadlock in CatalogServiceCatalog.getDbProperty() IMPALA-13850 (part 4) modify CatalogServiceCatalog.getDb() to delay looking up catalog cache until initial reset() is complete. EventProcessor can start processing event before reset() happen and obtain versionLock_.readLock() when calling CatalogServiceCatalog.getDbProperty(). Later on, it will hit deadlock when attempting to obtain versionLock_.writeLock() through getDb() / waitInitialResetCompletion(). This lock upgrade from read to write is unsafe. This patch mitigate the issue by changing waitInitialResetCompletion() to not acquire write lock. After this patch, it will sleep for 100ms before loop and checking again if initial reset has complete. Modified CatalogResetManager.fetchingDbs_ to ConcurrentLinkedQueue so that isActive() can be called without holding write lock. Add helper class ReadLockAndLookupDb and WriteLockAndLookupDb. Both will call waitInitialResetCompletion() before obtaining the appropriate lock. In case of WriteLockAndLookupDb, it additionally will call resetManager_.waitOngoingMetadataFetch() to ensure dbCache_ lookup is safe for write purpose. Skip calling catalog_.startEventsProcessor() in JniCatalog constructor. Instead, let CatalogServiceCatalog.reset() start it at the end of cache population. Added @Nullable annotations on CatalogServiceCatalog methods that can return null. Fixed some null check warnings that shows up afterwards. Remove dead code CatalogServiceCatalog.addUserIfNotExists() and CatalogOpExecutor.getCurrentEventId(). Testing: Increase TRIGGER_RESET_METADATA_DELAY from 1s to 3s in test_metadata_after_failover_with_delayed_reset. It was easy to hit the deadlock with 3s delay before the patch. No more deadlock happen after the patch. Run and pass test_catalogd_ha.py and test_restart_services.py exhaustively. Change-Id: I3162472ea9531add77886bf1d0d73460ff34d07a Reviewed-on: http://gerrit.cloudera.org:8080/23382 Reviewed-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com> Tested-by: Riza Suminto <riza.suminto@cloudera.com>	2025-09-09 19:07:37 +00:00
Mihaly Szjatinya	4577cab3e8	IMPALA-13806: Avoid per-function std::locale creation A new std::locale is constructed at each mask*() function call while in UTF8_MODE. Instead use a static local object. Change-Id: I9a611ba1b175b0ab1c8f0d1de3b2439be70a68f7 Reviewed-on: http://gerrit.cloudera.org:8080/23380 Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com> Reviewed-by: Quanlong Huang <huangquanlong@gmail.com>	2025-09-09 07:14:54 +00:00
Daniel Vanko	321429eac6	IMPALA-14237: Fix Iceberg partition values encoding This patch modifies the string overload of IcebergFunctions::TruncatePartitionTransform so that it always handles strings as UTF-8-encoded ones, because the Iceberg specification states that that strings are UTF-8 encoded. Also, for an Iceberg table UrlEncode is called in not the Hive-compatible way, rather than the standard way, similar to Java's URLEncoder.encode() (which the Iceberg API also uses) to conform with existing practices by Hive, Spark and Trino. This included a change in the set of characters which are not escaped to follow the URL Standard's application/x-www-form-urlencoded format. [1] Also renamed it from ShouldNotEscape to IsUrlSafe for better readability. Testing: * add and extend e2e tests to check partitions with Unicode characters * add be tests to coding-util-test.cc [1]: https://url.spec.whatwg.org/#application-x-www-form-urlencoded-percent-encode-set Change-Id: Iabb39727f6dd49b76c918bcd6b3ec62532555755 Reviewed-on: http://gerrit.cloudera.org:8080/23190 Reviewed-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com> Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>	2025-09-08 18:54:07 +00:00
Zoltan Borok-Nagy	062ba4071a	IMPALA-14018: Configure OAUTH2 with Lakekeeper and fix Impala's config handling This patch adds Keycloak as Identity Provider for Lakekeeper, so now we can test Impala's Iceberg REST Catalog with an OAuth2 authentication (Client-Credential) flow. The Keycloak instance is pre-configured with a Lakekeeper realm that contain the necessary clients, users, scopes and roles. Manual testing also revealed that our Iceberg REST Catalog configuration is incomplete. This patch refactors config handling in a way that both Iceberg native configuration options and Trino-specific configuration options can be used with Impala. This will help users use their Trino connectors with Impala. By default Impala uses Iceberg 1.3 which assumes that the Iceberg REST server is also the authentication server. It is not always true, e.g. Lakekeeper cannot even function as the authententication server, but it can work with external authentication servers. Btw, this is why we needed Keycloak in the first place. It means if someone wants to try out Lakekeeper+Impala with Oauth2, they need to configure Impala with Iceberg 1.5. Testing * manual testing with Iceberg 1.5 Change-Id: Ie5785cb72773e188b1de7c7924cc6f0b1f96de33 (cherry picked from commit a9cb94986a5791be2adcb2f7c576272a9c22e79c) Reviewed-on: http://gerrit.cloudera.org:8080/23156 Reviewed-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com> Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>	2025-09-08 13:43:28 +00:00
Steve Carlin	8b057881c7	IMPALA-14102: [part 2] Fixed the JoinTranspose rule. This fix is needed before the join optimization fix can be committed. The JoinTranspose rule provided by Calcite was having 2 issues: 1) For tpcds-q10 and q35, an exception was being thrown. There is a bug in the Calcite code when the Join-Project gets matched but the Join is of reltype SemiJoin. In this case, the Projects do not get created correctly and the exception gets thrown. 2) We only want to transpose a Project above a Join if there is an underlying Join underneath the Project. The whole purpose is to be able to create adjacent Join RelNodes. We do not have to transpose the Project when it is not sandwiched between two Join nodes. It is preferable to keep it underneath the Join since the row width calculation would be affected (the Project may reduce the number of columns, thus reducing the row width). This commit extends the given JoinProjectTranspose rule by Calcite and handles these added restrictions. Change-Id: I7f62ec030fc8fbe36e6150bf96c9673c44b7da1b Reviewed-on: http://gerrit.cloudera.org:8080/23313 Reviewed-by: Aman Sinha <amsinha@cloudera.com> Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>	2025-09-08 02:49:21 +00:00
Fang-Yu Rao	1ff4e1b682	IMPALA-13767: Do not treat CTEs as names of actual tables This patch implements an additional check when collecting table names that are used in the given query. Specifically, for a table name that is not fully qualified, we make sure the derived fully qualified table name is not a common table expression (CTE) of a SqlWithItem in a WITH clause since such CTE's are not actual tables. Testing: - Added a test in test_ranger.py to verify the issue is fixed. Change-Id: I3f51af42d64cdcff3c26ad5a96c7f53ebef431b3 Reviewed-on: http://gerrit.cloudera.org:8080/23209 Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com> Reviewed-by: Fang-Yu Rao <fangyu.rao@cloudera.com>	2025-09-06 05:25:03 +00:00
Yida Wu	e486f3e3c3	IMPALA-14385: Fix crashes using sha2() in FIPS CentOS 7 This commit fixes a crash in the sha2() function that occurs when Impala is run on a FIPS enabled OS, particularly CentOS 7. Running sha2() with 384 or 512-bit lengths would cause the impalad to crash with an OpenSSL assertion failure: "Low level API call to digest SHA384 forbidden in FIPS mode!" The root cause was the direct use of low-level OpenSSL API calls like SHA384(), SHA512(). OpenSSL 1.0 (used in RHEL/CentOS 7) is particularly strict and forbids these calls in FIPS mode, causing the module to terminate the process. This patch changes to use the high-level, FIPS compliant EVP_Digest API to perform the hash in sha2() function implementation. Tests: Ran sha2() in FIPS enabled CentOs 7 after the change and succeeded. Passed exhaustive tests. Change-Id: I694532350285534fd935c92b7a78bed91ded3cb5 Reviewed-on: http://gerrit.cloudera.org:8080/23373 Reviewed-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com> Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>	2025-09-04 21:00:50 +00:00
Yida Wu	c78f7a7290	IMPALA-14392: Fix a crash in AdmissionD in GetQueryStatus PrintId() The GetQueryStatus RPC could crash due to a use-after-free error when accessing the reqest. When a query was rejected, the function would call RespondAndReleaseRpc(), which can free the "req" object, and then attempt to access req->query_id() for logging and to delete the entry from the admission_state_map_. This commit fixes the crash by moving the call to RespondAndReleaseRpc() to the end of the function. This change aligns this function with others like AdmissionControlService::ReleaseQuery(), which also deletes from admission_state_map_ before RespondAndReleaseRpc(), it ensures that all logic is completed before the RPC resources are released. Tests: Reproduced the issue by running 100 times TestAdmissionControllerStressWithACService::test_mem_limit, and after the change, it can successfully run 100 times. Passed exhaustive tests. Change-Id: I688954c5c671671cc2dc669ecfdf2405476302d7 Reviewed-on: http://gerrit.cloudera.org:8080/23379 Reviewed-by: Michael Smith <michael.smith@cloudera.com> Tested-by: Michael Smith <michael.smith@cloudera.com>	2025-09-04 15:54:03 +00:00
Daniel Vanko	edd5ff6e2a	IMPALA-14290: Make Iceberg partitioning column names case insensitive When creating or altering partitions of Iceberg tables, Impala only accepts column names if they are in lowercase and throws ImpalaRuntimeException otherwise. This patch allows the usage of other cases as well in PARTITION SPEC clauses. IcebergPartitionField converts field names to lower case in its constructor, similar to ColumnDef and PartitionKeyValue. Testing: * ran existing tests * add new test with mixed letter case columns Change-Id: I4080a6b7468fff940435e2e780322d4ba1f0de49 Reviewed-on: http://gerrit.cloudera.org:8080/23334 Reviewed-by: Daniel Becker <daniel.becker@cloudera.com> Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>	2025-09-04 13:20:52 +00:00
Michael Smith	db92c88a4c	IMPALA-13417: Run mvn clean on all Java projects Runs mvn clean on all Java subprojects - instead of just ext-data-source - to avoid build failures when files from other versions of the code and dependencies are left behind. Change-Id: I8cf540f90adbff327de98f900059bfa3bbc8ef22 Reviewed-on: http://gerrit.cloudera.org:8080/23374 Reviewed-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com> Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>	2025-09-04 04:15:18 +00:00
stiga-huang	0dfed88861	IMPALA-14383: Fix crash in casting timestamp string with timezone offsets to DATE Timestamp string can have a timezone offset at its end, e.g. "2025-08-31 06:23:24.9392129 +08:00" has "+08:00" as the timezone offset. When casting strings to DATE type, we try to find the default format by matching the separators, i.e. '-', ':', ' ', etc in SimpleDateFormatTokenizer::GetDefaultFormatContext(). The one that matches this example is DEFAULT_DATE_TIME_CTX[] which represents the default date/time context for "yyyy-MM-dd HH:mm:ss.SSSSSSSSS". The fractional part at the end can have length from 0 to 9, matching DEFAULT_DATE_TIME_CTX[0] to DEFAULT_DATE_TIME_CTX[9] respectively. When calculating which item in DEFAULT_DATE_TIME_CTX is the matched format, we use the index as str_len - 20 where 20 is the length of "yyyy-MM-dd HH:mm:ss.". This causes the index overflow if the string length is larger than 29. A wild pointer is returned from GetDefaultFormatContext(), leading crash in following codes. This patch fixes the issue by adding a check to make sure the string length is smaller than the max length of the default date time format, i.e. DEFAULT_DATE_TIME_FMT_LEN (29). Longer strings will use DateTimeFormatContext created lazily. Note that this just fixes the crash. Converting timestamp strings with timezone offset at the end to DATE type is not supported yet and will be followed up in IMPALA-14391. Tests - Added e2e tests on constant expressions. Also added a test table with such timestamp strings and added test on it. Change-Id: I36d73f4a71432588732b2284ac66552f75628a62 Reviewed-on: http://gerrit.cloudera.org:8080/23371 Reviewed-by: Daniel Becker <daniel.becker@cloudera.com> Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>	2025-09-03 20:54:59 +00:00
stiga-huang	6f3deabb9d	IMPALA-14330: set a valid createEventId in global INVALIDATE METADATA In global INVALIDATE METADATA (catalog reset), catalogd creates IncompleteTable for all the known table names. However, the createEventId is uninitialized so remain as -1. Tables could be dropped unintentionally by stale DropTable or AlterTableRename events. Ideally when catalogd creates an IncompleteTable during reset(), it should fetch the latest event on that table and use its event id as the createEventId. However, fetching such event ids for all tables is impractical to finish in a reasonable time. It also adds a significant load on HMS. As a compromise, this patch uses the current event id when the reset() operation starts, and sets it to all IncompleteTable objects created in this reset operation. This is enough to handle self CreateTable / DropTable / AlterTableRename events since such self-events generated before that id will be skipped. Such self-events generated after that id are triggered by concurrent DDLs which will wait until the corresponding table list is updated in reset(). The DDL will also update createEventId to skip stale DropTable / AlterTableRename events. Concurrent CreateTable DDLs could set a stale createEventId if their HMS operation finish before reset() and their catalog operations finish after reset() creates the table. To address this, we add a check in setCreateEventId() to skip stale event ids. The current event id of reset() is also used in DeleteEventLog to track tables removed by this operation. Refactored IncompleteTable.createUninitializedTable() to force passing a createEventId as a parameter. To ease debugging, adds logs when a table is added/removed in HMS events processing. Also adds logs when the catalog version of a table changes and adds logs when start processing a rename event. This patch also refactors CatalogOpExecutor.alterTableOrViewRename() by extracting some codes into methods. A race issue is identified and fixed that DeleteEventLog should be updated before renameTable() updates the catalog cache so the removed old table won't be added back by concurrently processing of a stale CREATE_TABLE event. _run_ddls_with_invalidation in test_concurrent_ddls.py could still fail with timeout when running with sync_ddl=true. The reason is when the DDL hits IMPALA-9135 and hangs, it needs catalogd to send new catalog updates to reach the max waiting attempts (see waitForSyncDdlVersion()). However, if all other concurrent threads already finish, there won't be any new catalog updates so the DDL will wait forever and finally result in the test timed out. To workaround this, this patch adds another concurrent thread that keeps creating new tables until the test finish. Tests: - Ran the following tests in test_concurrent_ddls.py 10 rounds. Each round takes 11 mins. - test_ddls_with_invalidate_metadata - test_ddls_with_invalidate_metadata_sync_ddl - test_mixed_catalog_ddls_with_invalidate_metadata - test_mixed_catalog_ddls_with_invalidate_metadata_sync_ddl - test_local_catalog_ddls_with_invalidate_metadata - test_local_catalog_ddls_with_invalidate_metadata_sync_ddl - test_local_catalog_ddls_with_invalidate_metadata_unlock_gap Change-Id: I6506821dedf7701cdfa58d14cae5760ee178c4ec Reviewed-on: http://gerrit.cloudera.org:8080/23346 Reviewed-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com> Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>	2025-09-03 20:49:51 +00:00
Riza Suminto	28cff4022d	IMPALA-14333: Run impala-py.test using Python3 Running exhaustive tests with env var IMPALA_USE_PYTHON3_TESTS=true reveals some tests that require adjustment. This patch made such adjustment, which mostly revolves around encoding differences and string vs bytes type in Python3. This patch also switch the default to run pytest with Python3 by setting IMPALA_USE_PYTHON3_TESTS=true. The following are the details: Change hash() function in conftest.py to crc32() to produce deterministic hash. Hash randomization is enabled by default since Python 3.3 (see https://docs.python.org/3/reference/datamodel.html#object.__hash__). This cause test sharding (like --shard_tests=1/2) produce inconsistent set of tests per shard. Always restart minicluster during custom cluster tests if --shard_tests argument is set, because test order may change and affect test correctness, depending on whether running on fresh minicluster or not. Moved one test case from delimited-latin-text.test to test_delimited_text.py for easier binary comparison. Add bytes_to_str() as a utility function to decode bytes in Python3. This is often needed when inspecting the return value of subprocess.check_output() as a string. Implement DataTypeMetaclass.__lt__ to substitute DataTypeMetaclass.__cmp__ that is ignored in Python3 (see https://peps.python.org/pep-0207/). Fix WEB_CERT_ERR difference in test_ipv6.py. Fix trivial integer parsing in test_restart_services.py. Fix various encoding issues in test_saml2_sso.py, test_shell_commandline.py, and test_shell_interactive.py. Change timeout in Impala.for_each_impalad() from sys.maxsize to 2^31-1. Switch to binary comparison in test_iceberg.py where needed. Specify text mode when calling tempfile.NamedTemporaryFile(). Simplify create_impala_shell_executable_dimension to skip testing dev and python2 impala-shell when IMPALA_USE_PYTHON3_TESTS=true. The reason is that several UTF-8 related tests in test_shell_commandline.py break in Python3 pytest + Python2 impala-shell combo. This skipping already happen automatically in build OS without system Python2 available like RHEL9 (IMPALA_SYSTEM_PYTHON2 env var is empty). Removed unused vector argument and fixed some trivial flake8 issues. Several test logic require modification due to intermittent issue in Python3 pytest. These include: Add _run_query_with_client() in test_ranger.py to allow reusing a single Impala client for running several queries. Ensure clients are closed when the test is done. Mark several tests in test_ranger.py with SkipIfFS.hive because they run queries through beeline + HiveServer2, but Ozone and S3 build environment does not start HiveServer2 by default. Increase the sleep period from 0.1 to 0.5 seconds per iteration in test_statestore.py and mark TestStatestore to execute serially. This is because TServer appears to shut down more slowly when run concurrently with other tests. Handle the deprecation of Thread.setDaemon() as well. Always force_restart=True each test method in TestLoggingCore, TestShellInteractiveReconnect, and TestQueryRetries to prevent them from reusing minicluster from previous test method. Some of these tests destruct minicluster (kill impalad) and will produce minidump if metrics verifier for next tests fail to detect healthy minicluster state. Testing: Pass exhaustive tests with IMPALA_USE_PYTHON3_TESTS=true. Change-Id: I401a93b6cc7bcd17f41d24e7a310e0c882a550d4 Reviewed-on: http://gerrit.cloudera.org:8080/23319 Reviewed-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com> Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>	2025-09-03 10:01:29 +00:00
jasonmfehr	789991c6cc	IMPALA-13237: [Patch 8] - OpenTelemetry Traces for DML/DDL Queries and Handle Leading Comments Trace DML/DDL Queries * Adds tracing for alter, compute, create, delete, drop, insert, invalidate metadata, and with queries. * Stops tracing beeswax queries since that protocol is deprecated. * Adds Coordinator attribute to Init and Root spans for identifying where the query is running. Comment Handling * Corrects handling of leading comments, both inline and full line. Previously, queries with comments before the first keyword were always ignored. * Adds be ctest tests for determining whether or not a query should be traced. General Improvements * Handles the case where the first query keyword is followed by a newline character or an inline comment (without or with spaces between). * Corrects traces for errored/cancelled queries. These cases short-circuit the normal query processing code path and have to be handled accordingly. * Ends the root span when the query ends instead of waiting for the ClientRequestState to go out of scope. This change removes use-after-free issues caused by reading from ClientRequestState when the SpanManager went out of scope during that object's dtor. * Simplified minimum tls version handling because the validators on the ssl_minimum_version eliminate invalid values that previously had to be accounted for. * Removes the unnecessary otel_trace_enabled() function. * Fixes IMPALA-14314 by waiting for the full trace to be written to the output file before asserting that trace. Testing * Full test suite passed. * ASAN/TSAN builds passed. * Adds new ctest test. * Adds custom cluster tests to assert traces for the new supported query types. * Adds custom cluster tests to assert traces for errored and cancelled queries. Generated-by: Github Copilot (Claude Sonnet 3.7) Change-Id: Ie9e83d7f761f3d629f067e0a0602224e42cd7184 Reviewed-on: http://gerrit.cloudera.org:8080/23279 Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com> Reviewed-by: Riza Suminto <riza.suminto@cloudera.com>	2025-09-03 04:38:36 +00:00
Steve Carlin	83499e5be7	IMPALA-14102: [part 1] Calcite Planner: optimize join rule This is part 1 of the commit for optimizing join rules for Calcite. This commit is just a copy of the LoptOptimizeJoinRule.java from Calcite v1.37 for subsequent modification. The purpose of this commit is to serve as a placeholder starting point so we can easily see the customized changes that are made by comparing Impala specific modifications for the rule which will be done in subsequent commits for IMPALA-14102. Change-Id: I63daf6dacf0547a0488c1ecf0bc185b548e00d87 Reviewed-on: http://gerrit.cloudera.org:8080/23312 Reviewed-by: Michael Smith <michael.smith@cloudera.com> Reviewed-by: Aman Sinha <amsinha@cloudera.com> Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>	2025-09-03 04:05:09 +00:00
Steve Carlin	e74495e656	IMPALA-14101: [part 2] Calcite planner: Add cost model calculations This commit adds the cost model and calculations to be used in the join optimizer rule. The ImpalaCost object implements the RelOptCost interface and contains values which contribute to a cost. The ImpalaCost object roughly mirrors the Calcite VolcanoCost object with some slight variations. The ImpalaCost object only looks at the cpu and io cost and ignores the rowCount cost. The rowCount cost is not needed because it is already baked into the cpu and io results. That is to say, we determine the cpu cost and io cost by using the rowCount cost. The ImpalaCost object is generated in the ImpalaRelMdNonCumulativeCost class which is called from Calcite for a given RelNode. The cost generated by this object uses the various inputs of the RelNode to calculate the cpu and io time for the given logical node. Note that this is a non-cumulative cost. A cumulative cost exists within Calcite as well, but there was no need to change the cumulative cost logic. The cost is used by the Calcite LoptOptimizeJoinRule when determining join ordering. It will compare costs of different join ordering and choose the join ordering with a lower cost. With the current iteration, we only customize the costs for Impala for aggregates, table scans, and joins. A TODO in this commit is to allow various cpu and io costs to be configurable. Change-Id: I1e52b0e11e9a6d5814b0313117dd9c56602f3ff5 Reviewed-on: http://gerrit.cloudera.org:8080/23311 Reviewed-by: Riza Suminto <riza.suminto@cloudera.com> Reviewed-by: Michael Smith <michael.smith@cloudera.com> Tested-by: Riza Suminto <riza.suminto@cloudera.com>	2025-09-03 04:05:09 +00:00
jasonmfehr	8d663765d7	IMPALA-14382: Fix Null Pointer Dereference Fixes a potential null pointer dereference when log level >= 2. Adds 'build' as a valid EE test helper directory as VSCode creates this directory. Tested locally by running test_scanners from the query_test EE test suite using a release build of Impala and log level 2. Minidumps were not generated during this test run but were generated during the same test run without this fix applied. Generated-by: Github Copilot (Claude Sonnet 3.7) Change-Id: I91660aa84407c17ffb7cd3c721d4f3f0a844d61d Reviewed-on: http://gerrit.cloudera.org:8080/23365 Reviewed-by: Riza Suminto <riza.suminto@cloudera.com> Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>	2025-09-03 01:14:53 +00:00
Fang-Yu Rao	0b9e2b2cd1	IMPALA-13011: Support authorization for Calcite in Impala This patch adds support for authorization when Calcite is the planner. Specifically, this patch focuses on the authorization of table-level and column-level privilege requests, including the case when a table is a regular view, whether the view was created by a superuser. Note that CalciteAnalysisDriver would throw an exception from analysis() if given a query that requires table masking, i.e., column masking or row filtering, since this feature is not yet supported by the Calcite planner. Moreover, we register the VIEW_METADATA privilege for each function involved in the given query. We hardcode the database associated with the function to 'BuiltinsDb', which is a bit hacky. We should not be doing this once each function could be associated with a database when we are using the Calcite planner. We may need to change Calcite's parser for this. The issue reported in IMPALA-13767 will be taken care of in another separate patch and hence this patch could incorrectly register the privilege request for a common table expression (CTE) in a WITH clause, preventing a legitimate user from executing a query involving CTE's. Testing: - We manually verified that the patch could pass the test cases in AuthorizationStmtTest#testPrivilegeRequests() except for "with t as (select * from alltypes) select * from t", for which the fix will be provided via IMPALA-13767. - Added various tests in test_ranger.py. Change-Id: I9a7f7e4dc9a86a2da9e387832e552538e34029c1 Reviewed-on: http://gerrit.cloudera.org:8080/22716 Reviewed-by: Riza Suminto <riza.suminto@cloudera.com> Reviewed-by: Michael Smith <michael.smith@cloudera.com> Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>	2025-09-03 00:15:22 +00:00
Riza Suminto	42e448875b	IMPALA-13731: Deflake TestAsyncLoadData.test_async_load This patch attempt to deflake TestAsyncLoadData::test_async_load by allowing exec_end_state == PENDING if enable_async_load_data. This is OK because, if enable_async_load_data is true, async_exec_thread_ might be slightly slower to start and transition the query to RUNNING. wait_time is also relaxed to at least 2 seconds because wait_start not strictly starts at the point when the async_exec_thread_ starts. This patch does not change assertion / behavior in sync code path (enable_async_ddl_execution=false). Testing: Loop and pass TestAsyncLoadData 50 times. Change-Id: I8776a432e60a1167ad54778e81046421df15cf37 Reviewed-on: http://gerrit.cloudera.org:8080/23360 Reviewed-by: Michael Smith <michael.smith@cloudera.com> Reviewed-by: Jason Fehr <jfehr@cloudera.com> Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>	2025-09-02 23:36:54 +00:00
Michael Smith	ed0dfc665e	IMPALA-14363: Always close WebClient Make WebClient autocloseable and always close it, either in test cleanup or using the Java try-with-resources statement. Change-Id: Ib6e03558ce31ee527e87bf6632d39d1d94076472 Reviewed-on: http://gerrit.cloudera.org:8080/23355 Reviewed-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com> Tested-by: Michael Smith <michael.smith@cloudera.com>	2025-09-02 23:33:36 +00:00
Csaba Ringhofer	843de44788	IMPALA-13125: Fix pairwise test vector generation Replaced allpairspy with a homemade pair finder that seems to find a somewhat less optimal (larger) covering vector set but works reliably with filters. For details see tests/common/test_vector.py Also fixes a few test issues uncovered. Some fixes are copied from https://gerrit.cloudera.org/#/c/23319/ Added the possibility of shuffling vectors to get a different test set (env var IMPALA_TEST_VECTOR_SEED). By default the algorithm is deterministic so the test set won't change between runs (similarly to allpairspy). Added a new constraint to test only a single compression per file format in some tests to reduce the number of new vectors. EE + custom_cluster test count in exhaustive runs: before patch: ~11000 after patch: ~16000 without compression constraint: ~17000 Change-Id: I419c24659a08d8d6592fadbbd5b764ff73cbba3e Reviewed-on: http://gerrit.cloudera.org:8080/23342 Reviewed-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com> Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>	2025-08-28 15:27:02 +00:00
Sai Hemanth Gantasala	b67a9cecb3	IMPALA-13593: Enable event processor to consume ALTER_PARTITIONS events from metastore HIVE-27746 introduced ALTER_PARTITIONS event type which is an optimization of reducing the bulk ALTER_PARTITION events into a single event. The components version is updated to pick up this change. It would be a good optimization to include this in Impala so that the number of events consumed by event processor would be significantly reduced and help event processor to catch up with events quickly. This patch enables the ability to consume ALTER_PARTITIONS event. The downside of this patch is that, there is no before_partitions object in the event message. This can cause partitions to be refreshed even on trivial changes to them. HIVE-29141 will address this concern. Testing: - Added an end-to-end test to verify consuming the ALTER_PARTITIONS event. Also, bigger time outs were added in this test as there was flakiness observed while looping this test several times. Change-Id: I009a87ef5e2c331272f9e2d7a6342cc860e64737 Reviewed-on: http://gerrit.cloudera.org:8080/22554 Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com> Reviewed-by: Csaba Ringhofer <csringhofer@cloudera.com>	2025-08-28 06:53:32 +00:00
stiga-huang	669d74f467	IMPALA-14171: Deflake TestQueryExecTimeLimit.test_exec_time_limit_long_plan The test expects a "SELECT..LIMIT 1" query on tpch.lineitem to finish in 2s. This could be impacted by other concurrent tests when memory reservation is used up. This patch marks the test to run serially to avoid the impact from other tests. Change-Id: Ibbb2f1a34e24c83a3d2c69d2daa4dece8d94ec1e Reviewed-on: http://gerrit.cloudera.org:8080/23351 Reviewed-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com> Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>	2025-08-27 15:41:21 +00:00
Steve Carlin	048b5689fd	IMPALA-14080: Support LocalFsTable table types in Calcite planner. IMPALA-13947 changes the use_local_catalog default to true. This causes failure for when the use_calcite_planner query option is set to true. The Calcite planner was only handling HdfsTable table types. It will now handle LocalFsTable table types as well. Currently, if table num rows is missing from table, Calcite planner will load all partitions to estimate by iterating all partitions. This is inefficent in local catalog mode and ideally should happen later after partition prunning. Follow up work is needed to improve this. Testing: Reenable local catalog mode in TestCalcitePlanner.test_calcite_frontend TestWorkloadManagementSQLDetailsCalcite.test_tpcds_8_decimal Co-authored-by: Riza Suminto Change-Id: Ic855779aa64d11b7a8b19dd261c0164e65604e44 Reviewed-on: http://gerrit.cloudera.org:8080/23341 Reviewed-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com> Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>	2025-08-27 03:17:13 +00:00
Riza Suminto	fd9de7a2e1	IMPALA-14348: Fix TSAN issue after IMPALA-14327 IMPALA-14327 reveal TSAN issue in impala-hs2-server.cc. ClientRequestState::returns_result_set() was accessed without holding ClientRequestState::lock_ in impala-hs2-server.cc. This patch fix the issue by obtaining ClientRequestState::lock_ first before accessing ClientRequestState::returns_result_set() and ClientRequestState::result_metadata(). Both accesses are done inside ImpalaServer::SetupResultsCacheing so lock_ only need to be obtained once. Filed IMPALA-14359 to follow up investigation. Testing: Run and pass TSAN build. Change-Id: I41fc25cea5b4ef7b4b9daac54b8665fa76ceb1cd Reviewed-on: http://gerrit.cloudera.org:8080/23343 Reviewed-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com> Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>	2025-08-27 00:32:55 +00:00
stiga-huang	774c01c2b5	IMPALA-14306: Frontend.getDataSrcs() should retry for InconsistentMetadataFetchException After IMPALA-14283, coordinator will throw an InconsistentMetadataFetchException when witness catalogd service ID changes. The Frontend code should retry the request to use fresh metadata. This patch resolves the TODO item in Frontend.getDataSrcs() to add the retry as other methods do. Also change the catch clause in LocalCatalog.getDataSources() so InconsistentMetadataFetchException can be thrown from it. Tests: - Ran TestExtDataSources.test_catalogd_ha_failover 100 times. Change-Id: I483423680a5c953aaf3446b62c8b8ee08d6c6385 Reviewed-on: http://gerrit.cloudera.org:8080/23347 Reviewed-by: Wenzhe Zhou <wzhou@cloudera.com> Reviewed-by: Riza Suminto <riza.suminto@cloudera.com> Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>	2025-08-26 21:55:57 +00:00
Zoltan Borok-Nagy	e4364cd244	IMPALA-14358: Event processing can invalidate Iceberg tables IMPALA-12829 added extra code to CatalogServiceCatalog's reloadTableIfExists() that can throw a ClassCastException when it reloads an Iceberg table. When it happens during event processing the event processor invalidates the table. This usually happens when another engine updates an Iceberg table. It then causes slow table loading times as the tables need to be fully reloaded instead of just doing an incremental table loading. This patches fixes the ClassCastException by moving the cast into an if statement. Testing * e2e tests added Change-Id: I892cf326a72024674facad6750893352b982c658 Reviewed-on: http://gerrit.cloudera.org:8080/23349 Reviewed-by: Csaba Ringhofer <csringhofer@cloudera.com> Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>	2025-08-26 19:41:29 +00:00
Gabriella Gyorgyevics	b4ad047886	IMPALA-13648: Implement a decoder and an encoder for Byte Stream Split encoding The decoder can read one, or multiple values at a time from the given buffer. When reading multiple values at a time, they could be read with a stride. The encoder adds values one by one, until there are no more values to add, or the output given couldn't fit any more. Optionally, the encoder can recieve a buffer already filled with values. The encoding happens upon calling `FinalizePage()` Both the encoder and decoder can be used with either a template size_t value, or a value given in the constructor. This value is the size of the type to be coded in bytes. * The template option is more optimized, but it only supports 4 and 8 byte types. * The constructor option is less optimized, but it can recieve any number as the byte size. To use the constructor passed number, set the number passed in the template to 0, otherwise pass the number of bytes in the template. Note, that neither the encoder, nor the decoder are integrated with Impala yet, so reading or writing data with byte stream split encoding is not yet possible. -------------------------------- Tests --------------------------------- Created decoder tests for * basic functionality, * decoding values one by one * decoding values in batch * decoding values combining the previous two * the stride feature * skipping a number of values Created encoder tests for * basic functionality * putting values in one by one * giving the encoder a prepopulated buffer * finalizing the page Created two-way tests for the following cases: * encoding then decoding one by one * encoding then decoding in batch * encoding then decoding with stride * decoding one by one then encoding * decoding in batch then encoding * decoding with stride then encoding Each of these tests is run on a data set of up to 200 values. These tests are run on every supported type. Change-Id: I71755d992536d70a22b8fdbfee1afce3aec81c26 Reviewed-on: http://gerrit.cloudera.org:8080/23239 Reviewed-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com> Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>	2025-08-26 17:04:05 +00:00
Zoltan Borok-Nagy	2fb56afb5e	IMPALA-14336: Avoid loading tables during table listing in the IcebergMetaProvider IcebergMetaProvider unnecessarily loads Iceberg tables in loadTableList(). Table loading is a slow operation which can make simple table listings painfully slow. This behavior is also contrast to CatalogdMetaProvider which lists tables without loading them. In our tests there were unloadable Iceberg tables which was never intended, some test tables were just wrongly created under iceberg_test/hadoop_catalog/, but they didn't use the HadoopCatalog. Normally we can assume that the tables returned by an Iceberg REST Catalog are loadable. Even if they are not, it shouldn't be too problematic to get an exception a bit later. Also, the new behavior is aligned with CatalogdMetaProvider, i.e. the tables are listed without fully loading them, and we only get an error when we want to use an unloadable table. This patch moves the Iceberg tables out from iceberg_test/hadoop_catalog/ that do not conform to HadoopCatalog. Testing * existing tests updated with the new paths Change-Id: I9ff75a751be5ad4b5159a1294eaaa304049c454a Reviewed-on: http://gerrit.cloudera.org:8080/23326 Reviewed-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com> Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>	2025-08-22 19:14:33 +00:00
Zoltan Borok-Nagy	90d32eae92	IMPALA-14231: Stop HMS when Iceberg REST Catalog tests are running This patch stops the Hive Metastore until the Iceberg REST Catalog tests are running. It is needed to verify that Impala is functioning without the presence of HMS. Testing * custom cluster tests and ranger tests updated Change-Id: I52577501b730e186bd9ee963543796132e42d97b Reviewed-on: http://gerrit.cloudera.org:8080/23183 Reviewed-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com> Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>	2025-08-22 18:54:21 +00:00
Sai Hemanth Gantasala	50926b5d8e	IMPALA-12829: Skip processing transaction events if the table is HMS sync disabled. For transactional tables, the event processor is not skipping abort_txn and commit_txn_event if the database/table is HMS sync disabled. This processing is unnecessary and helps to improve event processor lag by skipping abort_txn, and commit_txn events if the corresponding database or transactional tables are HMS sync disabled. The database name and table name is present for the Alloc_write_id_event, skipping this event is already implemented if HMS sync is disabled. Since dbname and table name is not present for the abort_txn and commit_txn events, we need to check if HMS sync is disabled on the HMS table property when the table object is extracted in the CatalogServiceCatalog#addWriteIdsToTable(). Also, fixed the partitions and table refreshed metrics for CommitTxn event. Additional Issues discovered during testing: 1) CatalogServiceCatalog#reloadTableIfExists() didn't verify if the current eventId is older than the table's lastSyncEventId which leads to unecessary reloading of table for commit txns. 2) Insert queries from impala didn't update the validWriteIdList for transactional tables in the cache, so CommitTxn events triggered by insert events are triggering reload on unpartitioned transactional tables again while consuming these CommitTxn events. Fixed it by updating the validWriteIdList in the cache. 3) CommitTxn events generated after AlterTable events are leading to incorrect results if file metadata reload is skipped in AlterTable events. Reason being AlterTable event will update the writeId from metastore but doesn't reload filemetadata which yields incorrect results. This is fixed in HdfsTable class to not skip filemetadata reload if writeId is changed. 4) Added bigger timeouts in TestEventProcessingWithImpala test class to avoid flakiness for the transactional events in the event processor caused by catalogd_load_metadata_delay config Testing: - Added end-to-end tests to verify transaction events are skipped. Change-Id: I5d0ecb3b756755bc04c66a538a9ae6b88011a019 Reviewed-on: http://gerrit.cloudera.org:8080/21175 Reviewed-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com> Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>	2025-08-22 08:07:22 +00:00
Csaba Ringhofer	6d7c076086	IMPALA-14338: Update six to 1.17.0 to fix impala-shell on Python 3.12+ Change-Id: Iaa04b20f767f2ca74ee680151d029e304363994e Reviewed-on: http://gerrit.cloudera.org:8080/23327 Reviewed-by: Riza Suminto <riza.suminto@cloudera.com> Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>	2025-08-22 06:43:08 +00:00
Laszlo Gaal	ad7888898b	IMPALA-13223: Fix bootstrap-build.sh for platforms without Python2 bin/bootstrap-build.sh did not distinguish between various version of the Ubuntu platform, and attempted to install unversioned Python packages (python-dev and python-setuptools) even on newer versions that don't support Python 2 any longer (e.g. Ubuntu 22.04 and 24.04). On older Ubuntu versions these packages are still useful, so at this point it is not feasible just to drop them. This patch makes these packages optional: they are added to the list of packages to be installed only if they actually exist for the platform. The patch also extends the package list with some basic packages that are needed when bin/bootstrap_build.sh is run inside an Ubuntu 22.04 Docker container. Tests: ran a compile-only build on Ubuntu 20.04 (still has Python 2) and on Ubuntu 22.04 (does not support Python 2 any more). Change-Id: I94ade35395afded4e130b79eab8c27c6171b50d6 Reviewed-on: http://gerrit.cloudera.org:8080/21800 Reviewed-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com> Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>	2025-08-21 22:03:52 +00:00
jasonmfehr	3910e924d4	IMPALA-13237: [Patch 7] - Lock ClientRequestState during Opentelemetry Traces Updates the SpanManager class so it takes the ClientRequestState lock when reading from that object. Updates startup flag otel_trace_span_processor to be hidden. Manual testing revealed that setting this flag to "simple" (which uses SimpleSpanProcessor when forwarding OpenTelemetry traces) causes the SpanManager object to block until the destination OpenTelemetry collector receives the request and responds. Thus, network slowness or an overloaded OpenTelemetry collector will block the entire query processing flow since SpanManager will hold the ClientRequestState lock throughout the duration of the communication with the OpenTelemetry collector. Since the SimpleSpanProcessor is useful in testing, this flag was changed to hidden to avoid incorrect usage in production. When generating span attribute values on OpenTelemetry traces for queries, data is read from ClientRequestState without holding its lock. The documentation in client-request-state.h specifically states reading most fields requires holding its lock. An examination of the opentelemetry-cpp SDK code revealed the ClientRequestState lock must be held until the StartSpan() and EndSpan() functions complete. The reason is span attribute keys and values are deep copied from the source nostd::string_view objects during these functions. Testing accomplished by running the test_otel_trace.py custom cluster tests as regression tests. Additionally, manual testing with intentionally delayed network communication to an OpenTelemetry collector demonstrated that the StartSpan() and EndSpan() functions do not block waiting on the OpenTelemetry collector if the batch span processor is used. However, these functions do block if the simple span processor is used. Additionally, a cause of flaky tests was addressed. The custom cluster tests wait until JSON objects for all traces are written to the output file. Since each trace JSON object is written on its own line in the output file, this wait is accomplished by checking the number of lines in the output file. Occasionally, the traces would be partially written to the file which satisfied the line count check but the trace would not be fully written out when the assertion code loaded it. In these situations, the test failed because a partial JSON object cannot be loaded. The fix is to wait both for the expected line count and for the last line to end with a newline character. This fix ensures that the JSON representing the trace is fully written to the file before the assert code loads it. Generated-by: Github Copilot (Claude Sonnet 3.7) Change-Id: I649bdb6f88176995d45f7d10db898188bbe0b609 Reviewed-on: http://gerrit.cloudera.org:8080/23294 Reviewed-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com> Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>	2025-08-21 19:50:34 +00:00
Daniel Becker	991c0d5cf3	IMPALA-14326: Update commons-lang3 to version 3.18.0 Update commons-lang3 from version 3.17.0 to 3.18.0. Testing: - Core tests passed. Change-Id: Ie3f2e4ac7232e3f2e2c1c6c6a62225564faaaf4a Reviewed-on: http://gerrit.cloudera.org:8080/23324 Reviewed-by: Csaba Ringhofer <csringhofer@cloudera.com> Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>	2025-08-21 16:13:03 +00:00
Steve Carlin	ff58c5d42f	IMPALA-14101: [part 1] Commit Cost file from Calcite This commit is just a copy of the VolcanoCost.java file from Calcite into this Impala repository. The file can be found here: https://github.com/apache/calcite/blob/calcite-1.37.0/core/src/main/... .../java/org/apache/calcite/plan/volcano/VolcanoCost.java The only differences between this file and the Calcite file are: 1) All VolcanoCost strings have been changed to ImpalaCost 2) The package name is an Impala package. This will make it easier to show the changes made for the Impala cost model change in IMPALA-14101. Change-Id: I864e20fb63c0ae4f2f88016128d2a68f39e17dfb Reviewed-on: http://gerrit.cloudera.org:8080/23310 Reviewed-by: Riza Suminto <riza.suminto@cloudera.com> Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>	2025-08-20 21:01:29 +00:00
Steve Carlin	5244f6169e	IMPALA-14061: Calcite Planner: added Calcite rules This commit adds Calcite optimization rules to create more efficient plans. These rules should be considered a work in progress. These were tested against a 3TB tpcds database so they are fairly efficient as/is, but we can make improvements as we see them along the way. Most of the changes have been added to the CalciteOptimizer file. There are several phases of rules that are applied, which are as follows: - expand nodes: These rules change the plan to a plan that can be handled by Impala. For instance, there are RelNodes such as "LogicalIntersect" which are not directly applicable to the Impala physical nodes so they need to be expanded. - coerce nodes: This module changes the nodes so they have the correct datatype values (e.g. literal strings in Calcite are char but need to be varchar for Impala) - optimize nodes: first pass on reordering the logical RelNode ordering. - join: Squishes the join RelNodes together, pushes them into one "multiJoin" and then lets Calcite's join optimizer reorder the joins into a more optimal plan. A note on this: with this iteration, statistics are still not being applied. This will come in with later commits to make better plans. - post join optimize nodes: Reruns the optimize nodes since the join ordering may present new optimization opportunities - pre Impala commit: Extra massaging after optimization that is done at the end - conversion to Impala RelNodes: Maps Calcite RelNodes into Impala RelNodes which will then be mapped to Impala PlanNodes In addition to this general change, there is also a change with removing the "toCNF" rule. Calcite has multiple places where it creates a SEARCH operator via "simplifying" the RexNodes within various rules. This operator is not supported directly in Impala and we need to call "expandSearch" to handle this. Because Impala does this under the covers in the rules, this has been fixed by overriding the RexBuilder (with ImpalaRexBuilder) and expanding the SEARCH operator whenever it is called (sidenote: we could have changed the rules that called simplify, but that would have resulted in too much code duplication). The toCNF rule was removed and placed as a call within the CoerceOperandShuttle, which already manipulates all the RexNodes, so all that code is now in one place. Change-Id: I6671f7ed298a18965ef0b7a5fc10f4912333a52b Reviewed-on: http://gerrit.cloudera.org:8080/22870 Reviewed-by: Aman Sinha <amsinha@cloudera.com> Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>	2025-08-20 12:03:16 +00:00
Csaba Ringhofer	f8624d409d	IMPALA-13690: Fix flaky test test_get_operation_status_for_async_ddl This test only worked as intended with BEESWAX protocol, because in case of HS2 executing "set opt=?" does not affect execute_async(). As the sleep in the debug_action had no effect, the query could reach its terminal state too quickly. Besides fixing query option handling also added a sleep to the query itself to keep it longer in RUNNING state. Also fixed test_concurrent_alter_table_rename which didn't apply sync_ddl=1 for the same reason. Testing: - looped test_get_operation_status_for_client 100 times with debug and ASAN builds Change-Id: I260e40a7cdeae57f18891e009f32a9466b2f8ece Reviewed-on: http://gerrit.cloudera.org:8080/23308 Reviewed-by: Csaba Ringhofer <csringhofer@cloudera.com> Tested-by: Csaba Ringhofer <csringhofer@cloudera.com>	2025-08-20 08:25:13 +00:00
Riza Suminto	9fc941b611	IMPALA-14327: Update load-data.py and run-workload.py to use HS2 load-data.py is used for dataloading while run-workload.py is used for running perf-AB-test. This patch change the script from using beeswax protocol to HS2 protocol. Testing: Run data loading and perf-AB-test-ub2004 based on this patch. Change-Id: I1c3727871b8b2e75c3f10ceabfbe9cb96e36ead3 Reviewed-on: http://gerrit.cloudera.org:8080/23309 Reviewed-by: Riza Suminto <riza.suminto@cloudera.com> Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>	2025-08-20 07:20:29 +00:00
jasonmfehr	75ca356ed0	IMPALA-13237: [Patch 6] - Match Behavior of Older Impala Versions Older versions of Impala set exec_state_ in ClientRequestState to the error state earlier in the query processing than later versions. This difference translates to when OpenTelemetry trace child spans report an error status. For example, in older Impala versions, if a query specifies a column that does not exist, then the Planning child span has a status of ERROR. However, in the latest version, the Planning span has a status of OK, and only the Close span has a status of ERROR. This difference caused the custom cluster test test_otel_trace.py::TestOtelTrace::test_invalid_sql to fail in the older Impala versions but pass in the latest version. Additionally, older versions of Impala report a different default db. The latest version reports whatever the client set. This difference caused test_otel_trace.py::TestOtelTrace::test_retry_select_success and test_otel_trace.py::TestOtelTrace::test_retry_select_failed to fail in the older Impala versions because Impala used "tpch" as the default db while the latest version used "default". This change causes the OpenTelemetry trace child span where an error actually occurs to report an error status, matching the behavior of older Impala versions. It also modifies test_otel_trace.py to expect the default db in the OpenTelemetry trace "DefaultDb" attribute to match the query profile. Testing accomplished by running the test_otel_trace.py custom cluster tests. Change-Id: If57aaef2da6d6904c66d0150f50ea9ac1c3ebc8c Reviewed-on: http://gerrit.cloudera.org:8080/23293 Reviewed-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com> Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>	2025-08-19 04:05:34 +00:00
Daniel Vanko	80f5e3bfa3	IMPALA-14322: Fix typo in IMPALA-12520 In IMPALA-12520 affected paths were modified to test_warehouse (separated with underscore) instead of test-warehouse (with hyphen). This commit replaces the underscore to hyphen. Change-Id: I3a9737af3e6169cc0cd144df53fd35e9e2b20468 Reviewed-on: http://gerrit.cloudera.org:8080/23304 Reviewed-by: Daniel Becker <daniel.becker@cloudera.com> Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>	2025-08-18 19:06:36 +00:00

... 2 3 4 5 6 ...

12372 Commits