impala

mirror of https://github.com/apache/impala.git synced 2025-12-19 09:58:28 -05:00

Author	SHA1	Message	Date
stiga-huang	3725b4ea63	IMPALA-14617: Deflake TestMetadataReplicas.test_catalog_restart TestMetadataReplicas.test_catalog_restart creates a db and an underlying table in Hive, then expects an INVALIDATE command can bring up the table in impalads. The test runs in the legacy catalogd mode that has the bug of IMPALA-12103. So if the INVALIDATE command runs in a state that catalgod has the db in cache but the db doesn't show up in impalad's cache yet, catalogd will just return the table and impalad will skip it due to db not exists. Then the above assertion fails. The db is added in catalogd cache by processing the CREATE_DATABASE HMS event, which is asynchronous with executing the INVALIDATE command. If the command is triggered before that, the test passes. If the command is triggered after that, the test fails. When the test was written, we don't have HMS event processing yet. It's expected that the db is also added in catalogd by the INVALIDATE command. To deflake the issue, this patch disables HMS event processing in this test, so catalogd always has a consistent cache with impalad when executing the INVALIDATE command. This patch also changes the log level of a log in ImpaladCatalog to warn if a table is not added due to its db is missing. Tests: - Ran the test locally 10 times. Change-Id: I2d17404cc8093eacf9b51df3d22caf5cbb6a61a9 Reviewed-on: http://gerrit.cloudera.org:8080/23798 Reviewed-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com> Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>	2025-12-18 06:27:47 +00:00
Riza Suminto	e22c0d33d6	IMPALA-13917 (part 1): Remove Beeswax from protocol dimension Beeswax protocol has been due for deprecation in a long time. This patch remove BEESWAX from create_client_protocol_dimension(). This will limit protocol dimension to [HS2, HS2_HTTP] by default. It is still possible to include BEESWAX again for testing if DEFAULT_TEST_PROTOCOL env var is set to 'beeswax', such as: DEFAULT_TEST_PROTOCOL=beeswax impala-py.test custom_cluster/test_ipv6.py This patch does not disable beeswax server yet. Some tests that specifically test against beeswax protocol, such as test_beeswax.py, will continue to work. ImpalaTestSuite.beeswax_client also remain unchanged. Testing: Run following command and confirm that beeswax protocol is skipped. impala-py.test --collect-only --exploration=exhaustive \ custom_cluster/test_ipv6.py Change-Id: I3cff79f59305b5d44944804ed1f1b92838575495 Reviewed-on: http://gerrit.cloudera.org:8080/23076 Reviewed-by: Jason Fehr <jfehr@cloudera.com> Tested-by: Riza Suminto <riza.suminto@cloudera.com>	2025-12-18 01:46:00 +00:00
stiga-huang	d4644b0381	IMPALA-13607, IMPALA-14490: Deflake test_cache_valid_on_nontransactional_table_ddls When catalogd runs with --start_hms_server=true, it services all the HMS endpoints so that any HMS compatible client would be able to use catalogd as a metadata cache. For all the DDL/DML requests, catalogd just delegates them to HMS APIs without reloading related metadata in the cache. For read requests like get_table_req, catalogd serves them from its cache which could be stale. There is a flag, invalidate_hms_cache_on_ddls, to decide whether to explicitly invalidate the table when catalogd delegates a DDL/DML on the table to HMS. test_cache_valid_on_nontransactional_table_ddls is a test verifying that when invalidate_hms_cache_on_ddls=false, the cache is not updated so should have stale metadata. However, there are HMS events generated from invoking the HMS APIs. Even when invalidate_hms_cache_on_ddls=false, catalogd can still update its cache when processing the corresponding HMS events. The test fails when its check is done after catalogd applies the event (so the cache is up-to-date). If the check is done before that, the test passes. This patch deflakes the test by explicitly disabling event processing. Also updates the description of invalidate_hms_cache_on_ddls to mention the impact of event processing. Tests: - Ran the test locally 100 times. Change-Id: Ib1ffc11a793899a0dbdb009bf2ac311117f2318e Reviewed-on: http://gerrit.cloudera.org:8080/23792 Reviewed-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com> Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>	2025-12-17 22:22:53 +00:00
Sai Hemanth Gantasala	7bf9ba38eb	IMPALA-14616: Skip checking batch event processing metrics in test_event_processor_status When hierarchical event processing is enabled, there is no info about the current event batch shown in the /events page. Note that event batches are dispatched and processed later in parallel. The current event batch info is actually showing the current batch that is being dispatched which won't take long. This patch skips checking the current event batch info when hierarchical event processing is enabled. Tests - Verified that test runs fine locally. Change-Id: I2df24d2fd3b028a84d557e70141e68aa234908d4 Reviewed-on: http://gerrit.cloudera.org:8080/23790 Reviewed-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com> Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>	2025-12-17 07:42:38 +00:00
stiga-huang	aa4f19219c	IMPALA-14546: Fix jdbc driver urls in test_postgres_jdbc_tables TestPostgresJdbcTables.test_postgres_jdbc_tables uses hardcoded paths for JDBC driver URLs: "/test-warehouse/data-sources/jdbc-drivers/postgresql-jdbc.jar". This doesn't work correctly when running on Ozone where we need the prefix of "ofs://localhost:9862/impala". This patch fixes the issue by constructing the driver URL with FILESYSTEM_PREFIX which is "ofs://localhost:9862/impala" on Ozone. See more in bin/impala-config.sh about how it's set for different filesystems. Tests: - Ran the test on Ozone. Change-Id: Ie0c4368b3262d4dcb9e1c05475506411be2e2ef5 Reviewed-on: http://gerrit.cloudera.org:8080/23787 Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com> Reviewed-by: Riza Suminto <riza.suminto@cloudera.com>	2025-12-15 20:09:33 +00:00
Xuebin Su	d54b75ccf1	IMPALA-14619: Reset levels_readahead_ for late materialization Previously, `BaseScalarColumnReader::levels_readahead_` was not reset when the reader did not do page filtering. If a query selected the last row containing a collection value in a row group, `levels_readahead_` would be set and would not be reset when advancing to the next row group without page filtering. As a result, trying to skip collection values at the start of the next row group would cause a check failure. This patch fixes the failure by resetting `levels_readahead_` in `BaseScalarColumnReader::Reset()`, which is always called when advancing to the next row group. `levels_readahead_` is also moved out of the "Members used for page filtering" section as the variable is also used in late materialization. Testing: - Added an E2E test for the fix. Change-Id: Idac138ffe4e1a9260f9080a97a1090b467781d00 Reviewed-on: http://gerrit.cloudera.org:8080/23779 Reviewed-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com> Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>	2025-12-12 15:12:50 +00:00
stiga-huang	2ebdc05c1d	IMPALA-14615: Skip checking current event in test_event_processor_error_message When hierarchical event processing is enabled, there is no info about the current event batch shown in the /events page. Note that event batches are dispatched and processed later in parallel. The current event batch info is actually showing the current batch that is being dispatched which won't take long. This patch skips checking the current event batch info when hierarchical event processing is enabled. A new method, is_hierarchical_event_processing_enabled(), is added in ImpalaTestClusterProperties for the check. Also fixes is_event_polling_enabled() to accept float values of hms_event_polling_interval_s and adds the missing raise statement when it fails to parse the flags. Tests - Ran the test locally. Change-Id: Iffb84304a4096885492002b781199051aaa4fbb0 Reviewed-on: http://gerrit.cloudera.org:8080/23766 Reviewed-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com> Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>	2025-12-12 14:22:21 +00:00
Arnab Karmakar	ddd82e02b9	IMPALA-14065: Support WHERE clause in SHOW PARTITIONS statement This patch extends the SHOW PARTITIONS statement to allow an optional WHERE clause that filters partitions based on partition column values. The implementation adds support for various comparison operators, IN lists, BETWEEN clauses, IS NULL, and logical AND/OR expressions involving partition columns. Non-partition columns, subqueries, and analytic expressions in the WHERE clause are not allowed and will result in an analysis error. New analyzer tests have been added to AnalyzeDDLTest#TestShowPartitions to verify correct parsing, semantic validation, and error handling for supported and unsupported cases. Testing: - Added new unit tests in AnalyzeDDLTest for valid and invalid WHERE clause cases. - Verified functional tests covering partition filtering behavior. Change-Id: I2e2a14aabcea3fb17083d4ad6f87b7861113f89e Reviewed-on: http://gerrit.cloudera.org:8080/23566 Reviewed-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com> Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>	2025-12-11 15:36:08 +00:00
Sai Hemanth Gantasala	1684c2d9da	IMPALA-14131: Add flag to configure the default value of 'impala.disableHmsSync' FEATURE: Implement global 'disable_hms_sync_by_default' flag for event processing. This change introduces a new catalogd startup flag, `disable_hms_sync_by_default`, to simplify skipping/processing events. Problem: Disabling event processing globally requires tedious process of setting 'impala.disableHmsSync' property on every database and table, especially if few specific tables requires sync up of events. Solution: The new flag provides a global default for the 'impala.disableHmsSync' property. Behavior: - If `disable_hms_sync_by_default` is true (the intended default-off state), event processing is skipped for all tables/databases unless the property "impala.disableHmsSync"="false" is explicitly set. - This allows users to easily keep event processing off by default and opt-in specific databases or tables to start syncing. - The check order is: table-property > db-property > global default. - HMS polling remains independent and unaffected by this flag. Change-Id: I4ee617aed48575502d9cf5cf2cbea6ec897d6839 Reviewed-on: http://gerrit.cloudera.org:8080/23487 Reviewed-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com> Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>	2025-12-11 05:10:32 +00:00
Riza Suminto	3ed2a82a95	IMPALA-14606: Stop building impala-shell for Python 2 This patch stop setting up and building impala-shell for Python 2. A more thorough clean up will be done in the future. Testing: Pass build and test/shell/ in RHEL8. Change-Id: Ic7d59b283f4e2f011880ff6221d550b52714a538 Reviewed-on: http://gerrit.cloudera.org:8080/23750 Reviewed-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com> Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>	2025-12-10 04:40:46 +00:00
Csaba Ringhofer	780e6683a2	IMPALA-14573: port critical geospatial functions to c++ (part 1) This commit contains the simpler parts from https://gerrit.cloudera.org/#/c/20602 This mainly means accessors for the header of the binary format and bounding box check (st_envIntersects). New tests for not yet covered functions / overloads are also added. For details of the binary format see be/src/exprs/geo/shape-format.h Differences from the PR above: Only a subset of functions are added. The criteria was: 1. the native function must be fully compatible with the Java version* 2. must not rely on (de)serializing the full geometry 3. the function must be tested 1 implies 2 because (de)serialization is not implemented yet in the original patch for >2d geometries, which would break compatibility for the Java version for ZYZ/XYM/XYZM geometries. *: there are 2 known differences: 1. NULL handling: the Java functions return error instead of NULL when getting a NULL parameter 2. st_envIntersects() doesn't check if the SRID matches - the Java library looks inconsistant about this Because the native functions are fairly safe replacements for the Java ones, they are always used when geospatial_library=HIVE_ESRI. Change-Id: I0ff950a25320549290a83a3b1c31ce828dd68e3c Reviewed-on: http://gerrit.cloudera.org:8080/23700 Reviewed-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com> Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>	2025-12-06 07:50:23 +00:00
jichen0919	7e29ac23da	IMPALA-14092 Part2: Support querying of paimon data table via JNI This patch mainly implement the querying of paimon data table through JNI based scanner. Features implemented: - support column pruning. The partition pruning and predicate push down will be submitted as the third part of the patch. We implemented this by treating the paimon table as normal unpartitioned table. When querying paimon table: - PaimonScanNode will decide paimon splits need to be scanned, and then transfer splits to BE do the jni-based scan operation. - We also collect the required columns that need to be scanned, and pass the columns to Scanner for column pruning. This is implemented by passing the field ids of the columns to BE, instead of column position to support schema evolution. - In the original implementation, PaimonJniScanner will directly pass paimon row object to BE, and call corresponding paimon row field accessor, which is a java method to convert row fields to impala row batch tuples. We find it is slow due to overhead of JVM method calling. To minimize the overhead, we refashioned the implementation, the PaimonJniScanner will convert the paimon row batches to arrow recordbatch, which stores data in offheap region of impala JVM. And PaimonJniScanner will pass the arrow offheap record batch memory pointer to the BE backend. BE PaimonJniScanNode will directly read data from JVM offheap region, and convert the arrow record batch to impala row batch. The benchmark shows the later implementation is 2.x better than the original implementation. The lifecycle of arrow row batch is mainly like this: the arrow row batch is generated in FE,and passed to BE. After the record batch is imported to BE successfully, BE will be in charge of freeing the row batch. There are two free paths: the normal path, and the exception path. For the normal path, when the arrow batch is totally consumed by BE, BE will call jni to fetch the next arrow batch. For this case, the arrow batch is freed automatically. For the exceptional path, it happends when query is cancelled, or memory failed to allocate. For these corner cases, arrow batch is freed in the method close if it is not totally consumed by BE. Current supported impala data types for query includes: - BOOLEAN - TINYINT - SMALLINT - INTEGER - BIGINT - FLOAT - DOUBLE - STRING - DECIMAL(P,S) - TIMESTAMP - CHAR(N) - VARCHAR(N) - BINARY - DATE TODO: - Patches pending submission: - Support tpcds/tpch data-loading for paimon data table. - Virtual Column query support for querying paimon data table. - Query support with time travel. - Query support for paimon meta tables. - WIP: - Snapshot incremental read. - Complex type query support. - Native paimon table scanner, instead of jni based. Testing: - Create tests table in functional_schema_template.sql - Add TestPaimonScannerWithLimit in test_scanners.py - Add test_paimon_query in test_paimon.py. - Already passed the tpcds/tpch test for paimon table, due to the testing table data is currently generated by spark, and it is not supported by impala now, we have to do this since hive doesn't support generating paimon table for dynamic-partitioned tables. we plan to submit a separate patch for tpcds/tpch data loading and associated tpcds/tpch query tests. - JVM Offheap memory leak tests, have run looped tpch tests for 1 day, no obvious offheap memory increase is observed, offheap memory usage is within 10M. Change-Id: Ie679a89a8cc21d52b583422336b9f747bdf37384 Reviewed-on: http://gerrit.cloudera.org:8080/23613 Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com> Reviewed-by: Zoltan Borok-Nagy <boroknagyz@cloudera.com> Reviewed-by: Riza Suminto <riza.suminto@cloudera.com>	2025-12-05 18:19:57 +00:00
jasonmfehr	f8d21f34bc	IMPALA-14602: Fix Flaky OTel Test Tests in test_otel_trace.py that rely on queries being queued assume a first long-running query will be started before a second query. These tests are flaky most likely because the first long running query is executed asynchronously immediately followed by executing a second query. During slower builds (such as ASAN), the first query may not be in running state before the second query is started. This patch adds a check on the first query to ensure it is running before starting the second query. Change-Id: I9e77ec70b4668f0daed2ab9411f8f6c52ddccb2a Reviewed-on: http://gerrit.cloudera.org:8080/23743 Reviewed-by: Riza Suminto <riza.suminto@cloudera.com> Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>	2025-12-04 08:05:24 +00:00
ttttttz	5d1f1e0180	IMPALA-14183: Rename the environment variable USE_APACHE_HIVE to USE_APACHE_HIVE_3 When the environment variable USE_APACHE_HIVE is set to true, build Impala for adapting to Apache Hive 3.x. In order to better distinguish it from Apache Hive 2.x later, rename USE_APACHE_HIVE to USE_APACHE_HIVE_3. Additionally, to facilitate referencing different versions of the Hive MetastoreShim, the major version of Hive has been added to the environment variable IMPALA_HIVE_DIST_TYPE. Change-Id: I11b5fe1604b6fc34469fb357c98784b7ad88574d Reviewed-on: http://gerrit.cloudera.org:8080/21724 Reviewed-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com> Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>	2025-12-03 13:38:45 +00:00
Riza Suminto	0f8f54de20	IMPALA-14595: Fix Ozone trash path after IMPALA-12893 IMPALA-12893 upgrade CDP_BUILD_NUMBER=71942734 upgrade Ozone version to 1.4.0.7.3.1.500-182. This newer Ozone version does not include WAREHOUSE_PREFIX anymore in its trash path. This patch fix the broken tests in test_ddl.py by updating the expected trash path. Testing: Run and pass metadata/test_ddl.py in Ozone environment. Change-Id: If1271a399d4eb82fed9b073b99d9a7b2c18a03b1 Reviewed-on: http://gerrit.cloudera.org:8080/23734 Reviewed-by: Michael Smith <michael.smith@cloudera.com> Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>	2025-12-03 01:48:21 +00:00
Riza Suminto	4f3251e19a	IMPALA-14582: Deflake test_show_create_table_with_stats test_show_create_table_with_stats is flaky due to inconsistent metadata is not handled/retried correctly in coordinator side. This patch deflake it by retrying if InconsistentMetadataFetchException is caught. This patch also fix some flake8 warnings in test_show_create_table.py, including unused 'vector' parameter in several tests. Testing: Loop and pass test_show_create_table_with_stats 10 times. Change-Id: I397b9502d92bfd756929be8e851661fd9246dd5e Reviewed-on: http://gerrit.cloudera.org:8080/23728 Reviewed-by: Quanlong Huang <huangquanlong@gmail.com> Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>	2025-12-02 09:23:40 +00:00
Peter Rozsa	6cf21464b4	IMPALA-14569: Fix IllegalStateException in partition pruning on type mismatch This fixes an IllegalStateException in HdfsPartitionPruner when evaluating 'IN' predicates whose consist of two compatible types, for example DATE and STRING: date_col in (<date as string>). Previously, 'canEvalUsingPartitionMd' did not check if the slot type matched the literal type. This caused the frontend to attempt invalid comparisons via 'LiteralExpr.compareTo', leading to IllegalStateException or incorrect pruning. The fix ensures 'canEvalUsingPartitionMd' returns false on type mismatches, deferring evaluation to the backend where proper casting occurs. Testing: - Added regression test in hdfs-partition-pruning.test. Change-Id: Idc226a628c8df559329a060cb963b81e27e21eda Reviewed-on: http://gerrit.cloudera.org:8080/23706 Reviewed-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com> Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>	2025-11-27 02:48:28 +00:00
jasonmfehr	336034debd	IMPALA-14480: Optional OpenTelemetry DCHECKs The code in span-manager.cc contains aggressive DCHECKS that rely on the query lifecycle to be deterministic. In reality, the query lifecycle is not completely deterministic due to multiple threads being involved in execution, result retrieval, query shutdown, etc. On debug builds only, a new flag named, otel_trace_exhaustive_dchecks will be available with a default of 'false'. If set to 'true', then optional DCHECKs will be enabled in the SpanManager class to enable identification of edge cases where the query lifecycle proceeds in an unexpected way. The DCHECKs that are controlled by the new flag are those that rely on a specific ordering of start/end child span and add child span event calls. Change-Id: Id6507f3f0e23ecf7c2bece9a6b6c2d86bfac1e57 Reviewed-on: http://gerrit.cloudera.org:8080/23518 Reviewed-by: Michael Smith <michael.smith@cloudera.com> Reviewed-by: Riza Suminto <riza.suminto@cloudera.com> Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>	2025-11-26 04:48:46 +00:00
Noemi Pap-Takacs	fdad9d3204	IMPALA-13725: Add Iceberg table repair functionalities In some cases users delete files directly from storage without going through the Iceberg API, e.g. they remove old partitions. This corrupts the table, and makes queries that try to read the missing files fail. This change introduces a repair statement that deletes the dangling references of missing files from the metadata. Note that the table cannot be repaired if there are missing delete files because Iceberg's DeleteFiles API which is used to execute the operation allows removing only data files. Testing: - E2E - HDFS - S3, Ozone - analysis Change-Id: I514403acaa3b8c0a7b2581d676b82474d846d38e Reviewed-on: http://gerrit.cloudera.org:8080/23512 Reviewed-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com> Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>	2025-11-25 13:03:52 +00:00
Daniel Vanko	3d22c7fe05	IMPALA-12209: Always include format-version in DESCRIBE FORMATTED and SHOW CREATE TABLE for Iceberg tables HiveCatalog does not include format-version for Iceberg tables in the table's parameters, therefore the output of SHOW CREATE TABLE may not replicate the original table. This patch makes sure to add it to both the SHOW CREATE TABLE and DESCRIBE FORMATTED/EXTENDED output. Additionally, adds ICEBERG_DEFAULT_FORMAT_VERSION variable to E2E tests, deducting from IMPALA_ICEBERG_VERSION environment variable. If Iceberg version is at least 1.4, default format-version is 2, before 1.4 it's 1. This way tests can work with multiple Iceberg versions. Testing: * updated show-create-table.test and show-create-table-with-stats.test for Iceberg tables * added format-version checks to multiple DESCRIBE FORMATTED tests Change-Id: I991edf408b24fa73e8a8abe64ac24929aeb8e2f8 Reviewed-on: http://gerrit.cloudera.org:8080/23514 Reviewed-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com> Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>	2025-11-24 21:48:17 +00:00
Csaba Ringhofer	f6ceca2b4d	IMPALA-14571: increase planner cost of java functions The main motivation is to evaluate expensive geospatial functions (which are Java functions) last in predicates. Java functions have a major overhead anyway from the JNI call, so bumping all Java function costs seems beneficial. Note that currently geospatial functions are the only built-in Java functions. Change-Id: I11d1652d76092ec60af18a33502dacc25b284fcc Reviewed-on: http://gerrit.cloudera.org:8080/22733 Reviewed-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com> Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>	2025-11-24 16:52:59 +00:00
Joe McDonnell	5eea4f6f79	IMPALA-14559: Ship calcite-planner jar in Impala packages This adds the java/impala-package Maven project to make it easier to ship / test the Calcite planner. impala-package has a dependency on impala-frontend and calcite-planner, so its classpath requires no extra work when constructing the classpath. An additional cleanup is that this no longer puts the impala-frontend-*-tests.jar on the classpath by default. This requires updating the query event hooks test, as it relies on that jar being present. This does not change the default value for the use_calcite_planner query option, so there is no change in behavior. Testing: - Ran a core job - Built docker images and OS packages locally Change-Id: I81dec2a5b59e279229a735c8bb1a23c77111a793 Reviewed-on: http://gerrit.cloudera.org:8080/23497 Reviewed-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com> Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>	2025-11-21 03:36:12 +00:00
Steve Carlin	e67b627858	IMPALA-14408: (addendum) Log Calcite exception in profile This addendum logs the exception thrown in the runtime profile under the CalciteFailureReason key. Testing: test_ranger.py uses this. Change-Id: Ia18a52c488f9c73d51690997b277fd8e918c645f Reviewed-on: http://gerrit.cloudera.org:8080/23686 Reviewed-by: Joe McDonnell <joemcdonnell@cloudera.com> Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>	2025-11-20 21:08:48 +00:00
Zoltan Borok-Nagy	454cb07e7c	IMPALA-14556: Move Hive ACID stress tests to exhaustive tests Currently Hive ACID stress tests run with "core" exploration strategy. It was important to get instant feedback about this feature when this was actively developed. Since then development activity around Hive ACID decreased significantly, as focus shifted towards Iceberg. This patch moves Hive ACID tests to exhaustive tests where they will be still executed regularly, but won't slow down pre-commit tests. Change-Id: Id7181fea62e2e3f8fcf7897a70e54a1708ef3f3e Reviewed-on: http://gerrit.cloudera.org:8080/23677 Reviewed-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com> Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>	2025-11-18 16:20:16 +00:00
Arnab Karmakar	068158e495	IMPALA-12401: Support more info types for HS2 GetInfo() API This patch adds support for 40+ additional TGetInfoType values in the HiveServer2 GetInfo() API, improving ODBC/JDBC driver compatibility. Previously, only 3 info types were supported (CLI_SERVER_NAME, CLI_DBMS_NAME, CLI_DBMS_VER). The implementation follows the ODBC CLI specification and matches the behavior of Hive's GetInfo implementation where applicable. Testing: - Added unit tests in test_hs2.py for new info types - Tests verify correct return values and data types for each info type Change-Id: I1ce5f2b9dcc2e4633b4679b002f57b5b4ea3e8bf Reviewed-on: http://gerrit.cloudera.org:8080/23528 Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com> Reviewed-by: Csaba Ringhofer <csringhofer@cloudera.com>	2025-11-17 19:32:50 +00:00
Zoltan Borok-Nagy	6810368c10	IMPALA-14552: test_show_create_table should be more strict with TBLPROPERTIES contents Currently we use this regex to parse the contents of TBLPROPERTIES: kv_regex = "'([^\']+)'\\s=\\s'([^\']+)'" kv_results = dict(re.findall(kv_regex, map_match.group(1))) This allows strings like: 'X'='Y'='Z' 'X'='Z'$'A'='B' This means it's easy to write strings in .test files that are not valid SQL. This patch adds a few extra checks to validate the TBLPROPERTIES contents. Change-Id: I94110f50720c01dc7807ee56c794d235f4990282 Reviewed-on: http://gerrit.cloudera.org:8080/23671 Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com> Reviewed-by: Riza Suminto <riza.suminto@cloudera.com>	2025-11-14 23:58:47 +00:00
Mihaly Szjatinya	087b715a2b	IMPALA-14108: Add support for SHOW FILES IN table PARTITION for Iceberg tables This patch implements partition filtering support for the SHOW FILES statement on Iceberg tables, based on the functionality added in IMPALA-12243. Prior to this change, the syntax resulted in a NullPointerException. Key changes: - Added ShowFilesStmt.analyzeIceberg() to validate and transform partition expressions using IcebergPartitionExpressionRewriter and IcebergPartitionPredicateConverter. After that, it collects matching file paths using IcebergUtil.planFiles(). - Added FeIcebergTable.Utils.getIcebergTableFilesFromPaths() to accept pre-filtered file lists from the analysis phase. - Enhanced TShowFilesParams thrift struct with optional selected_files field to pass pre-filtered file paths from frontend to backend. Testing: - Analyzer tests for negative cases: non-existent partitions, invalid expressions, non-partition columns, unsupported transforms. - Analyzer tests for positive cases: all transform types, complex expressions. - Authorization tests for non-filtered and filtered syntaxes. - E2E tests covering every partition transform type with various predicates. - Schema evolution and rollback scenarios. The implementation follows AlterTableDropPartition's pattern where the analysis phase performs validation/metadata retrieval and the execution phase handles result formatting and display. Change-Id: Ibb9913e078e6842861bdbb004ed5d67286bd3152 Reviewed-on: http://gerrit.cloudera.org:8080/23455 Reviewed-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com> Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>	2025-11-14 21:43:10 +00:00
Arnab Karmakar	760eb4f2fa	IMPALA-13066: Extend SHOW CREATE TABLE to include stats and partitions Adds a new WITH STATS option to the SHOW CREATE TABLE statement to emit additional SQL statements for recreating table statistics and partitions. When specified, Impala outputs: - Base CREATE TABLE statement. - ALTER TABLE ... SET TBLPROPERTIES for table-level stats. - ALTER TABLE ... SET COLUMN STATS for all non-partition columns, restoring column stats. - For partitioned tables: - ALTER TABLE ... ADD PARTITION statements to recreate partitions. - Per-partition ALTER TABLE ... PARTITION (...) SET TBLPROPERTIES to restore partition-level stats. Partition output is limited by the PARTITION_LIMIT query option (default 1000). Setting PARTITION_LIMIT=0 includes all partitions and emits a warning if the limit is exceeded. Tests added to verify correctness of emitted statements. Default behavior of SHOW CREATE TABLE remains unchanged for compatibility. Change-Id: I87950ae9d9bb73cb2a435cf5bcad076df1570dc2 Reviewed-on: http://gerrit.cloudera.org:8080/23536 Reviewed-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com> Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>	2025-11-12 06:11:37 +00:00
ttttttz	75c639c9cd	IMPALA-14498: Fix a bug in initial code review checks When conducting a code review using flake8-diff, it may fail in some code sections due to the use of non-raw strings. This patch modifies one instance to successfully pass the initial code review. Although it is currently working, it may not cover all instances. Change-Id: I71889a117c64500bab13928971a2bce063a72cd4 Reviewed-on: http://gerrit.cloudera.org:8080/23656 Reviewed-by: Quanlong Huang <huangquanlong@gmail.com> Tested-by: Quanlong Huang <huangquanlong@gmail.com>	2025-11-12 01:05:10 +00:00
Xuebin Su	6b6f7e614d	IMPALA-14472: Add create/read support for ARRAY column of Kudu Initial implementation of KUDU-1261 (array column type) recently merged in upstream Apache Kudu repository. This patch add initial Impala support for working with Kudu tables having array type columns. Unlike rows, the elements of a Kudu array are stored in a different format than Impala. Instead of per-row bit flag for NULL info, values and NULL bits are stored in separate arrays. The following types of queries are not supported in this patch: - (IMPALA-14538) Queries that reference an array column as a table, e.g. ```sql SELECT item FROM kudu_array.array_int; ``` - (IMPALA-14539) Queries that create duplicate collection slots, e.g. ```sql SELECT array_int FROM kudu_array AS t, t.array_int AS unnested; ``` Testing: - Add some FE tests in AnalyzeDDLTest and AnalyzeKuduDDLTest. - Add EE test test_kudu.py::TestKuduArray. Since Impala does not support inserting complex types, including array, the data insertion part of the test is achieved through custom C++ code kudu-array-inserter.cc that insert into Kudu via Kudu C++ client. It would be great if we could migrate it to Python so that it can be moved to the same file as the test (IMPALA-14537). - Pass core tests. Co-authored-by: Riza Suminto Change-Id: I9282aac821bd30668189f84b2ed8fff7047e7310 Reviewed-on: http://gerrit.cloudera.org:8080/23493 Reviewed-by: Alexey Serbin <alexey@apache.org> Reviewed-by: Michael Smith <michael.smith@cloudera.com> Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>	2025-11-08 06:41:07 +00:00
Joe McDonnell	3398f20afe	IMPALA-14491: Fix run-workload.py's handling of HS2's exec summary Recently, we switched bin/run-workload.py to use HS2. It turns out that the HS2 client code is not producing the same data structure for the exec summary. report_benchmark_results.py relies on that data structure and fails for HS2. This changes the HS2 client code to use the same representation as the beeswax. There is already a function that does this conversion (build_summary_table_from_thrift) for our regular tests, so this reuses that function. Testing: - Ran bin/run-workload.py twice to produce json files and processed them with report_benchmark_results.py. This failed before the change and passed afterward. Change-Id: I0a041bdebe748b6b3a05b552584e0ca2327cff67 Reviewed-on: http://gerrit.cloudera.org:8080/23597 Reviewed-by: Riza Suminto <riza.suminto@cloudera.com> Reviewed-by: Michael Smith <michael.smith@cloudera.com> Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>	2025-10-25 16:37:46 +00:00
jichen0919	541fb3f405	IMPALA-14092 Part1: Prohibit Unsupported Operation for paimon table This patch is to prohibit un-supported operation against paimon table. All unsupported operations are added the checked in the analyze stage in order to avoid mis-operation. Currently only CREATE/DROP statement is supported, the prohibition will be removed later after the corresponding operation is truly supported. TODO: - Patches pending submission: - Support jni based query for paimon data table. - Support tpcds/tpch data-loading for paimon data table. - Virtual Column query support for querying paimon data table. - Query support with time travel. - Query support for paimon meta tables. Testing: - Add unit test for AnalyzeDDLTest.java. - Add unit test for AnalyzerTest.java. - Add test_paimon_negative and test_paimon_query in test_paimon.py. Change-Id: Ie39fa4836cb1be1b1a53aa62d5c02d7ec8fdc9d7 Reviewed-on: http://gerrit.cloudera.org:8080/23530 Reviewed-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com> Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>	2025-10-23 23:06:08 +00:00
pranav.lodha	7f77176970	IMPALA-13869: Support for 'hive.sql.query' property for Hive JDBC tables This patch adds support for the hive.sql.query table property in Hive JDBC tables accessed through Impala. Impala has support for Hive JDBC tables using the hive.sql.table property, which limits users to simple table access. However, many use cases demand the ability to expose complex joins, filters, aggregations, or derived columns as external views. Hive.sql.query leads to a custom SQL query that returns a virtual table(subquery) instead of pointing to a physical table. These use cases cannot be achieved with just the hive.sql.table property. This change allows Impala to: • Interact with views or complex queries defined on external systems without needing schema-level access to base tables. • Expose materialized logic (such as filters, joins, or transformations) via Hive to Impala consumers in a secure, abstracted way. • Better align with data virtualization use cases where physical data location and structure should be hidden from the querying engine. This patch also lays the groundwork for future enhancements such as predicate pushdown and performance optimizations for Hive JDBC tables backed by queries. Testing: End-to-end tests are included in test_ext_data_sources.py. Change-Id: I039fcc1e008233a3eeed8d09554195fdb8c8706b Reviewed-on: http://gerrit.cloudera.org:8080/22865 Reviewed-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com> Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>	2025-10-23 21:34:29 +00:00
Yida Wu	1bc7cdbff6	IMPALA-14493: Cap memory usage of global admission service The global admission service can experience OOM errors under high concurrency because its process memory tracker is inaccurate and doesn't account for all memory allocations. Ensuring memory tracker accurately accounts for every allocation could be difficult, this patch uses a simpler solution to introduce a hard memory cap using tcmalloc statistics, which accurately reflect the true process memory usage. If a new query is submitted while tcmalloc memory usage is over the process limit, the query will be rejected immediately to protect from OOM. Adds a new flag enable_admission_service_mem_safeguard allowing this feature to be enabled or disabled. By default, this feature is turned on Tests: Added test test_admission_service_low_mem_limit. Passed exhaustive tests. Change-Id: I2ee2c942a73fcd69358851fc2fdc0fc4fe531c73 Reviewed-on: http://gerrit.cloudera.org:8080/23542 Reviewed-by: Abhishek Rawat <arawat@cloudera.com> Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>	2025-10-23 12:13:11 +00:00
stiga-huang	ff8bb33b91	IMPALA-12870: Tag query id for Java pool threads Logs from Java threads running in ExecutorService are missing the query id which is stored in the C++ thread-local ThreadDebugInfo variable. This patch adds JNI calls for Java threads to manage the ThreadDebugInfo variable. Currently two thread pools are changed: - MissingTable loading pool in StmtMetadataLoader.parallelTableLoad(). - Table loading pool in TableLoadingMgr. MissingTable loading pool only lives within the parallelTableLoad() method. So we initialize ThreadDebugInfo with the queryId at the beginning of the thread and delete it at the end of the thread. Note that a thread might be reused to load different tables, but they all belong to the same query. Table loading pool is a long running pool in catalogd that never shut down. Threads in it is used to load tables triggered by different queries. We initialize ThreadDebugInfo as the above but update it when the thread starts loading table for a different query id, and reset it when the loading is done. The query id is passed down from the catalogd RPC request headers. Tests: - Added e2e test to verify the logs. - Ran existing CORE tests. Change-Id: I83cca55edc72de35f5e8c5422efc104e6aa894c1 Reviewed-on: http://gerrit.cloudera.org:8080/23558 Reviewed-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com> Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>	2025-10-23 03:35:29 +00:00
Joe McDonnell	1913ab46ed	IMPALA-14501: Migrate most scripts from impala-python to impala-python3 To remove the dependency on Python 2, existing scripts need to use python3 rather than python. These commands find those locations (for impala-python and regular python): git grep impala-python \| grep -v impala-python3 \| grep -v impala-python-common \| grep -v init-impala-python git grep bin/python \| grep -v python3 This removes or switches most of these locations by various means: 1. If a python file has a #!/bin/env impala-python (or python) but doesn't have a main function, it removes the hash-bang and makes sure that the file is not executable. 2. Most scripts can simply switch from impala-python to impala-python3 (or python to python3) with minimal changes. 3. The cm-api pypi package (which doesn't support Python 3) has been replaced by the cm-client pypi package and interfaces have changed. Rather than migrating the code (which hasn't been used in years), this deletes the old code and stops installing cm-api into the virtualenv. The code can be restored and revamped if there is any interest in interacting with CM clusters. 4. This switches tests/comparison over to impala-python3, but this code has bit-rotted. Some pieces can be run manually, but it can't be fully verified with Python 3. It shouldn't hold back the migration on its own. 5. This also replaces locations of impala-python in comments / documentation / READMEs. 6. kazoo (used for interacting with HBase) needed to be upgraded to a version that supports Python 3. The newest version of kazoo requires upgrades of other component versions, so this uses kazoo 2.8.0 to avoid needing other upgrades. The two remaining uses of impala-python are: - bin/cmake_aux/create_virtualenv.sh - bin/impala-env-versioned-python These will be removed separately when we drop Python 2 support completely. In particular, these are useful for testing impala-shell with Python 2 until we stop supporting Python 2 for impala-shell. The docker-based tests still use /usr/bin/python, but this can be switched over independently (and doesn't impact impala-python) Testing: - Ran core job - Ran build + dataload on Centos 7, Redhat 8 - Manual testing of individual scripts (except some bitrotted areas like the random query generator) Change-Id: If209b761290bc7e7c716c312ea757da3e3bca6dc Reviewed-on: http://gerrit.cloudera.org:8080/23468 Reviewed-by: Michael Smith <michael.smith@cloudera.com> Tested-by: Michael Smith <michael.smith@cloudera.com>	2025-10-22 16:30:17 +00:00
Michael Smith	512a73771f	IMPALA-14452: Fix impala-shell SSL with Python 3.12 Removes deprecated ImpalaHttpClient constructor that supported port and path as it has been deprecated since at least 2020 and appears unused. Removes cert_file and key_file as they were also never used, and if required must now be passed in via ssl_context. Updates TSSLSocket fixes for Thrift 0.16 and Python 3.12. _validate_cert was removed by Thrift 0.16, but everything worked because Thrift used ssl.match_hostname instead. With Python 3.12 ssl.match_hostname no longer exists so we rely on OpenSSL to handle verification with ssl.PROTOCOL_TLS_CLIENT. Only uses ssl.PROTOCOL_TLS_CLIENT when match_hostname is unavailable to avoid changing existing behavior. THRIFT-792 identifies that TSocket suppresses connection errors, where we would otherwise see SSL hostname verification errors like ssl.SSLCertVerificationError: [SSL: CERTIFICATE_VERIFY_FAILED] certificate verify failed: IP address mismatch, certificate is not valid for '::1'. (_ssl.c:1131) Python 2.7.9 and 3.2 are minimum required versions; both have been EOL for several years. Testing: - ran custom_cluster/{test_client_ssl.py,test_ipv6.py} on Ubuntu 24 with Python 3.12, OpenSSL 3.0.13. - ran custom_cluster/test_client_ssl.py on RHEL 7.9 with Python 2.7.5 and Python 3.6.8, OpenSSL 1.0.2k-fips. - adds test that hostname checking is configured. Change-Id: I046a9010ac4cb1f7d705935054b306cddaf8bdc7 Reviewed-on: http://gerrit.cloudera.org:8080/23519 Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com> Reviewed-by: Csaba Ringhofer <csringhofer@cloudera.com>	2025-10-20 09:55:22 +00:00
Steve Carlin	69813a8c40	IMPALA-14464: Calcite planner should allow semi-colon in statement The Calcite planner now handles a sql statement that has a semi-colon at the end. Note that impala-shell doesn't pass the semi-colon into the server. This is only seen with a direct call to the server. Change-Id: Ie690159cd03f28f6b793628aa946292af71b6970 Reviewed-on: http://gerrit.cloudera.org:8080/23517 Reviewed-by: Joe McDonnell <joemcdonnell@cloudera.com> Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>	2025-10-17 00:59:44 +00:00
Zoltan Borok-Nagy	bfae4d0b32	IMPALA-14496: Impala crashes when it writes multiple delete files per partition in a single DELETE operation Impala crashes when it needs to write multiple delete files per partition in a single DELETE operation. It is because IcebergBufferedDeleteSink has its own DmlExecState object, but sometimes the methods in TableSinkBase use the RuntimeState's DmlExecState object. I.e. it can happen that we add a partition to the IcebergBufferedDeleteSink's DmlExecState, but later we expect to find it in the RuntimeState's DmlExecState. This patch adds new methods to TableSinkBase that are specific for writing delete files, and they always take a DmlExecState object as a parameter. They are now used by IcebergBufferedDeleteSink. Testing * added e2e tests Change-Id: I46266007a6356e9ff3b63369dd855aff1396bb72 Reviewed-on: http://gerrit.cloudera.org:8080/23537 Reviewed-by: Mihaly Szjatinya <mszjat@pm.me> Reviewed-by: Michael Smith <michael.smith@cloudera.com> Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>	2025-10-15 19:58:37 +00:00
jasonmfehr	c0b3580754	IMPALA-14372: Output OpenTelemetry SDK Logs to Impala Logs Emits log messages from the OpenTelemetry SDK to the Impalad DEBUG, INFO, WARNING, and ERROR logs. Previously, these SDK log messages were dropped. Modifies the function of the 'otel_debug' startup flag. This flag defaults to 'false' which causes log messages from the SDK to be dropped. When set to 'true', log messages from the OpenTelemetry SDK will be sent to the Impala logging system. The overall glog level is applied to all messages sent from the OpenTelemetry SDK, thus DEBUG SDK logs will not appear in the Impalad logs unless the glog level is greater than or equal to 2. When a trace is successfully sent to the OpenTelemetry collector, zero log lines are generated. When a trace cannot be sent, local testing showed 12 lines with a total size around 3k were written between the impalad.ERROR and impalad.WARNING log files. The request body is not included in these log messages unless the glog level is greater than or equal to 2 thus log message size will not grow or shrink based on the size of the trace(s). This patch also removes the completely useless 'LoggingInstrumentation' class. Previously, the 'otel_debug' flag caused this class to log messages, but those messages provided no insightful information. Generated-by: Github Copilot (Claude Sonnet 3.7) Change-Id: I41aba21f46233e6430eede9606be1e791071717a Reviewed-on: http://gerrit.cloudera.org:8080/23418 Reviewed-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com> Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>	2025-10-08 00:00:01 +00:00
Venu Reddy	ebbc67cf40	IMPALA-13801: Support greatest synced event with hierarchical metastore event processing It is a follow-up jira/commit to IMPALA-12709. IMPALA-12152 and IMPALA-12785 are affected when hierarchical metastore event processing feature is enabled. Following changes are incorporated with this patch: 1. Added creationTime_ and dispatchTime_ fields in MetastoreEvent class to store the current time in millisec. They are used to calculate: a) Event dispatch time(time between a MetastoreEvent object creation and when event is moved to inProgressLog_ of EventExecutorService after dispatching it to a DbEventExecutor). b) Event schedule delays incurred at DbEventExecutors and TableEventExecutors(time between an event moved to EventExecutorService's inProgressLog_ and before start of processing event at appropriate DbEventExecutor and TableEventExecutor). c) Event process time from EventExecutorService point of view(time spent in inProgressLog_ before it is moved to processedLog_). Logs are added to show the event dispatch time, schedule delays, process time from EventExecutorService point of view for each event. Also a log is added to show the time taken for event's processIfEnabled(). 2. Added isDelimiter_ field in MetastoreEvent class to indicate whether it is a delimiter event. It is set only when hierarchical event processing is enabled. Delimiter is a kind of metastore event that do not require event processing. Delimeter event can be: a) A CommitTxnEvent that do not have any write event info for a given transaction. b) An AbortTxnEvent that do not have write ids for a given transaction. c) An IgnoredEvent. An event is determined and marked as delimiter in EventExecutorService#dispatch(). They are not queued to a DbEventExecutor for processing. They are just maintained in the inProgressLog_ to preserve continuity and correctness in synchronization tracking. The delimiter events are removed from inProgressLog_ when their preceding non-delimiter metastore event is removed from inProgressLog_. 3. Greatest synced event id is computed based on the dispatched events(inProgressLog_) and processed events(processedLog_) tree maps. Greatest synced event is the latest event such that all events with id less than or equal to the latest event are definitely synced. 4. Lag is calculated as difference between latest event time on HMS and the greatest synced event time. It is shown in the log. 5. Greatest synced event id is used in IMPALA-12152 changes. When greatest synced event id becomes greater than or equal to waitForEventId, all the required events are definitely synced. 6. Event processor is paused gracefully when paused with command in IMPALA-12785. This ensures that all the fetched events from HMS in current batch are processed before the event processor is fully paused. It is necessary to process the current batch of events because, certain events like AllocWriteIdEvent, AbortTxnEvent and CommitTxnEvent update table write ids in catalog upon metastore event object creation. And the table write ids are later updated to appropriate table object during their event process. Can lead to inconsistent state of write ids on table objects when paused abruptly in the middle of current batch of event processing. 7. Added greatest synced event id and event time in events processor metrics. And updated description of lag, pending events, last synced event id and event time metrics. 8. Atomically update the event queue and increment outstanding event count in enqueue methods of both DbProcessor and TableProcessor so that respective process methods do not process the event until event is added to queue and outstanding event count is incremented. Otherwise, event can get processed, outstanding event count gets decremented before it is incremented in enqueue method. 9. Refactored DbEventExecutor, DbProcessor, TableEventExecutor and TableProcessor classes to propapage the exception occurred along with event during event processing. EventProcessException is a wrapper added to hold reference to event being processed and exception occurred. 10.Added AcidTableWriteInfo helper class to store table, writeids and partitions for the transaction id received in CommitTxnEvent. Testing: - Added new tests and executed existing end to end tests. - Have executed the existing tests with hierarchical event processing enabled. Change-Id: I26240f36aaf85125428dc39a66a2a1e4d3197e85 Reviewed-on: http://gerrit.cloudera.org:8080/22997 Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com> Reviewed-by: Quanlong Huang <huangquanlong@gmail.com>	2025-09-26 10:53:46 +00:00
Joe McDonnell	48b38810e8	IMPALA-14465: Unset HEAPCHECK when custom cluster tests restart Kudu Custom cluster tests like TestKuduHMSIntegration restart the Kudu service with custom startup flags. On Redhat8 ARM64, these tests have been failing due to Kudu being unresponsive after this restart. Debugging showed that Kudu was stuck early in startup. This only reproduced via the custom cluster tests and never via regular minicluster startup. When custom cluster tests restart Kudu, the script to restart Kudu inherits environment variables from the test runner. It turns out that the HEAPCHECK environment variable (even when empty) causes Kudu to get stuck during startup on Redhat8 ARM64 after the recent toolchain update. As a short-term fix, this unsets HEAPCHECK when restarting the Kudu service for these tests. There will need to be further investigation / cleanup beyond this. Testing: - Ran the Kudu custom cluster tests on Redhat8 ARM64 and on Ubuntu 20 x86_64 Change-Id: I51513e194d9e605df199672231b412fae40343af Reviewed-on: http://gerrit.cloudera.org:8080/23467 Reviewed-by: Riza Suminto <riza.suminto@cloudera.com> Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>	2025-09-26 04:45:22 +00:00
Joe McDonnell	775f73f03e	IMPALA-14462: Fix tie-breaking for sorting scan ranges oldest to newest TestTupleCacheFullCluster.test_scan_range_distributed is flaky on s3 builds. The addition of a single file is changing scheduling significantly even with scan ranges sorted oldest to newest. This is because modification times on S3 have a granularity of one second. Multiple files have the same modification time, and the fix for IMPALA-13548 did not properly break ties for sorting. This adds logic to break ties for files with the same modification time. It compares the path (absolute path or relative path + partition) as well as the offset within the file. These should be enough to break all conceivable ties, as it is not possible to have two scan ranges with the same file at the same offset. In debug builds, this does additional validation to make sure that when a != b, comp(a, b) != comp(b, a). The test requires that adding a single file to the table changes exactly one cache key. If that final file has the same modification time as an existing file, scheduling may still mix up the files and change more than one cache key, even with tie-breaking. This adds a sleep just before generating the final file to guarantee that it gets a newer modification time. Testing: - Ran TestTupleCacheFullCluster.test_scan_range_distributed for 15 iterations on S3 Change-Id: I3f2e40d3f975ee370c659939da0374675a28cd38 Reviewed-on: http://gerrit.cloudera.org:8080/23458 Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com> Reviewed-by: Michael Smith <michael.smith@cloudera.com> Reviewed-by: Riza Suminto <riza.suminto@cloudera.com>	2025-09-25 23:30:28 +00:00
Joe McDonnell	e05d92cb3d	IMPALA-13548: Schedule scan ranges oldest to newest for tuple caching Scheduling does not sort scan ranges by modification time. When a new file is added to a table, its order in the list of scan ranges is not based on modification time. Instead, it is based on which partition it belongs to and what its filename is. A new file that is added early in the list of scan ranges can cause cascading differences in scheduling. For tuple caching, this means that multiple runtime cache keys could change due to adding a single file. To minimize that disruption, this adds the ability to sort the scan ranges by modification time and schedule scan ranges oldest to newest. This enables it for scan nodes that feed into tuple cache nodes (similar to deterministic scan range assignment). Testing: - Modified TestTupleCacheFullCluster::test_scan_range_distributed to have stricter checks about how many cache keys change after an insert (only one should change) - Modified TupleCacheTest#testDeterministicScheduling to verify that oldest to newest scheduling is also enabled. Change-Id: Ia4108c7a00c6acf8bbfc036b2b76e7c02ae44d47 Reviewed-on: http://gerrit.cloudera.org:8080/23228 Reviewed-by: Michael Smith <michael.smith@cloudera.com> Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>	2025-09-22 20:51:46 +00:00
Peter Rozsa	b0f1d49042	IMPALA-14016: Add multi-catalog support for local catalog mode This patch adds a new MetaProvider called MultiMetaProvider, which is capable of handling multiple MetaProviders at once, prioritizing one primary provider over multiple secondary providers. The primary provider handles some methods exclusively for deterministic behavior. In database listings, if one database name occurs multiple times the contained tables are merged under that database name; if the two separate databases contain a table with the same name, the query analyzation fails with an error. This change also modifies the local catalog implementation's initialization. If catalogd is deployed, then it instantiates the CatalogdMetaProvider and checks if the catalog configuration directory is set as a backend flag. If it's set, then it tries to load every configuration from the folder, and tries to instantiate the IcebergMetaProvider from those configs. If the instantiation fails, an error is reported to the logs, but the startup is not interrupted. Tests: - E2E tests for multi-catalog behavior - Unit test for ConfigLoader Change-Id: Ifbdd0f7085345e7954d9f6f264202699182dd1e1 Reviewed-on: http://gerrit.cloudera.org:8080/22878 Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com> Reviewed-by: Zoltan Borok-Nagy <boroknagyz@cloudera.com>	2025-09-19 15:03:59 +00:00
Joe McDonnell	ca356a8df5	IMPALA-13437 (part 2): Implement cost-based tuple cache placement This changes the default behavior of the tuple cache to consider cost when placing the TupleCacheNodes. It tries to pick the best locations within a budget. First, it eliminates unprofitable locations via a threshold. Next, it ranks the remaining locations by their profitability. Finally, it picks the best locations in rank order until it reaches the budget. The threshold is based on the ratio of processing cost for regular execution versus the processing cost for reading from the cache. If the ratio is below the threshold, the location is eliminated. The threshold is specified by the tuple_cache_required_cost_reduction_factor query option. This defaults to 3.0, which means that the cost of reading from the cache must be less than 1/3 the cost of computing the value normally. A higher value makes this more restrictive about caching locations, which pushes in the direction of lower overhead. The ranking is based on the cost reduction per byte. This is given by the formula: (regular processing cost - cost to read from cache) / estimated serialized size This prefers locations with small results or high reduction in cost. The budget is based on the estimated serialized size per node. This limits the total caching that a query will do. A higher value allows more caching, which can increase the overhead on the first run of a query. A lower value is less aggressive and can limit the overhead at the expense of less caching. This uses a per-node limit as the limit should scale based on the size of the executor group as each executor brings extra capacity. The budget is specified by the tuple_cache_budget_bytes_per_executor. The old behavior to place the tuple cache at all eligible locations is still available via the tuple_cache_placement_policy query option. The default is the cost_based policy described above, but the old behavior is available via the all_eligible policy. This is useful for correctness testing (and the existing tuple cache test cases). This changes the explain plan output: - The hash trace is only enabled at VERBOSE level. This means that the regular profile will not contain the hash trace, as the regular profile uses EXTENDED. - This adds additional information at VERBOSE to display the cost information for each plan node. This can help trace why a particular location was not picked. Testing: - This adds a TPC-DS planner test with tuple caching enabled (based on the existing TpcdsCpuCostPlannerTest) - This modifies existing tests to adapt to changes in the explain plan output Change-Id: Ifc6e7b95621a7937d892511dc879bf7c8da07cdc Reviewed-on: http://gerrit.cloudera.org:8080/23219 Reviewed-by: Michael Smith <michael.smith@cloudera.com> Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>	2025-09-18 21:02:51 +00:00
stiga-huang	0b619962e6	IMPALA-14011: Skip test_no_hms_event_incremental_refresh_transactional_table on new Hive versions The feature of hms_event_incremental_refresh_transactional_table is already mature that it has been enabled for years. We'd like to deprecate the feature of turning it off. However, for older Hive versions like Apache Hive 3 that don't provide sufficient APIs for Impala to process COMMIT_TXN events, users can still turn this off. This patch skips test_no_hms_event_incremental_refresh_transactional_table when running on CDP Hive. To run the test on Apache Hive 3, adjust the test to create ACID table using tblproperties instead of "create transactional table" statement. Tests: - Ran the test on CDP Hive and Apache Hive 3. Change-Id: I93379e5331072bec1d3a4769f7d7ab59431478ee Reviewed-on: http://gerrit.cloudera.org:8080/23435 Reviewed-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com> Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>	2025-09-17 20:11:33 +00:00
stiga-huang	3c38888dfb	IMPALA-14441: Skip tests depending on unsupported HMS APIs for Apache Hive 3 Some tests import new API objects that are missing in Apache Hive 3. This patch skips these import statements and related tests when running on Apache Hive 3. Change-Id: I9053b3b9d507a038a018f242628e344a528b5af0 Reviewed-on: http://gerrit.cloudera.org:8080/23428 Reviewed-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com> Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>	2025-09-17 14:12:13 +00:00
Sai Hemanth Gantasala	46525bcd7c	IMPALA-14082: Support batch processing of RELOAD events on same table Currently, RELOAD events of partitioned table are processed one after the other. Processing them one by one acquires the table lock multiple times to load individual partitions in sequence. This also keeps the table version changing which impacts performance of coordinators in local-catalog mode - query planning needs retry to handle InconsistentMetadataFetchException due to table version changes. This patch handles the batch processing logic RELOAD events on same table by reusing the exisiting logic of BatchPartitionEvent. This implementation adds four new methods canBeBatched(),addToBatchEvents(), getPartitionForBatching(), getBatchEventType()(pre-requisites to reuse batching logic) to the RELOAD event class. Testing: - Added an end-to-end to verify the batching. Change-Id: Ie3e9a99b666a1c928ac2a136bded1e5420f77dab Reviewed-on: http://gerrit.cloudera.org:8080/23159 Reviewed-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com> Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>	2025-09-17 12:50:57 +00:00
jasonmfehr	0fe8de0f3f	IMPALA-14401: Deflake/Improve OpenTelemetry Tracing Tests Contains the following improvements to the Impala queries as OpenTelemetry traces custom cluster tests: 1. Supporting code for asserting traces was moved to 'tests/util/otel_trace.py'. The moved code was modified to remove all references to 'self'. Since this code used 'self.assert_impalad_log_contains', it had to be modified so the caller provides the correct log file path to search. The '__find_span_log' function was updated to call a new generic file grep function to run the necessary log file search regex. All other code was moved unmodified. 2. Classes 'TestOtelTraceSelectsDMLs' and 'TestOtelTraceDDLs' contained a total of 11 individual tests that used the 'unique_database' fixture. When this fixture is used in a test, it results in two DDLs being run before the test to drop/create the database and one DDL being run after the test to drop the database. These classes now create a test database once during 'setup_class' and drop it once during 'teardown_class' because creating a new database for each test was unnecessary. This change dropped test execution time from about 97 seconds to about 77 seconds. 3. Each test now has comments describing what the test is asserting. 4. The unnecessary sleep in 'test_query_exec_fail' was removed saving five seconds of test execution time. 5. New test 'test_dml_insert_fail' added. Previously, the situation where an insert DML failed was not tested. The test passed without any changes to backend code. 6. Test 'test_ddl_createtable_fail' is greatly simplified by using a debug action to fail the query instead of multiple parallel queries where one dropped the database the other was inserting into. The simplified setup eliminated test flakiness caused by timing differences and sped up test execution by about 5 seconds. 7. Fixed test flakiness was caused by timing issues. Depending on when the close process was initiated, span events are sometimes in the QueryExecution span and sometimes in the Close span. Test assertions cannot handle these situations. All span event assertions for the Close span were removed. IMPALA-14334 will fix these assertions. 8. The function 'query_id_from_ui' which retrieves the query profile using the Impala debug ui now makes multiple attempts to retrieve the query. In slower test situations, such as ASAN, the query may not yet be available when the function is called initially which used to cause tests to fail. This test flakiness is now eliminated through the addition of the retries. Testing accomplished by running tests in test_otel_trace.py both locally and in a full Jenkins build. Generated-by: Github Copilot (Claude Sonnet 3.7) Change-Id: I0c3e0075df688c7ae601c6f2e5743f56d6db100e Reviewed-on: http://gerrit.cloudera.org:8080/23385 Reviewed-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com> Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>	2025-09-15 23:21:29 +00:00

1 2 3 4 5 ...

3566 Commits