impala/testdata/bin at 5eea4f6f797e5549623a4dc4891ec82af08a41ac - impala - Gitea: Git with a cup of tea

jprdonnelly/impala

mirror of https://github.com/apache/impala.git synced 2025-12-30 03:01:44 -05:00

Files

History

Michael Smith 166b39547e IMPALA-14553: Run schema eval concurrently

The majority of time spent in generate-schema-statements.py is in
eval_section for schema operations that shell out, often uploading files
via the hadoop CLI or generating data files. These operations should be
independent.

Runs eval_section at the beginning so we don't repeat it for each row in
test_vectors, and executes them in parallel via a ThreadPool. Defaults
to NUM_CONCURRENT_TESTS threads because the underlying operations have
some concurrency to them (such as HDFS mirroring writes).

Also collects existing tables into a set to optimize lookup.

Reduces generate-schema-statements by ~60%, from 2m30s to 1m. Confirmed
that contents of logs/data_loading/sql/functional are identical.

Change-Id: I2a78d05fd6a0005c83561978713237da2dde6af2
Reviewed-on: http://gerrit.cloudera.org:8080/23627
Reviewed-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>
Tested-by: Michael Smith <michael.smith@cloudera.com>

2025-11-17 16:34:22 +00:00

..

minicluster_lakekeeper

IMPALA-14018: Configure OAUTH2 with Lakekeeper and fix Impala's config handling

2025-09-08 13:43:28 +00:00

minicluster_trino

IMPALA-14018: Configure OAUTH2 with Lakekeeper and fix Impala's config handling

2025-09-08 13:43:28 +00:00

IMPALA-13237: [Patch 4 of 5] - Helpers to Visualize OpenTelemetry Traces

2025-07-18 01:33:57 +00:00

build-trino-docker-image.sh

IMPALA-12414: Add scripts to run Trino in the dev environment

2023-09-13 22:07:13 +00:00

check-hbase-nodes.py

IMPALA-14501: Migrate most scripts from impala-python to impala-python3

2025-10-22 16:30:17 +00:00

check-schema-diff.sh

IMPALA-7399: Emit a junit xml report when trapping errors

2018-08-23 18:33:58 +00:00

clean-mysql-env.sh

IMPALA-12925: Fix decimal data type for external JDBC table

2024-04-05 09:16:53 +00:00

compute-table-stats.sh

IMPALA-13620: Refresh compute_table_stats.py script

2025-01-08 07:49:31 +00:00

copy-ext-data-sources.sh

IMPALA-12378: Auto Ship JDBC Data Source

2024-02-07 16:29:11 +00:00

copy-udfs-udas.sh

IMPALA-11528: Catalogd should start up with a corrupt Hive function.

2022-09-13 14:48:31 +00:00

create-ext-data-source-table.sql.template

IMPALA-14545: Don't use absolute hdfs paths for JDBC table driver.url

2025-11-12 22:17:44 +00:00

create-hbase.sh

IMPALA-3918: Remove Cloudera copyrights and add ASF license header

2016-08-09 08:19:41 +00:00

create-load-data.sh

IMPALA-14545: Don't use absolute hdfs paths for JDBC table driver.url

2025-11-12 22:17:44 +00:00

create-mini.sql

IMPALA-4110: Clean up issues found by Apache RAT.

2016-09-14 22:09:24 +00:00

create-table-many-blocks.sh

IMPALA-7399: Emit a junit xml report when trapping errors

2018-08-23 18:33:58 +00:00

create-tpc-jdbc-tables.py

IMPALA-14501: Migrate most scripts from impala-python to impala-python3

2025-10-22 16:30:17 +00:00

create-tpcds-testcase-files.sh

IMPALA-11562: Revert support for o3fs as default filesystem

2022-09-28 22:35:48 +00:00

download-impala-jdbc-driver.sh

IMPALA-12502: Support Impala to Impala federation

2023-12-22 21:44:49 +00:00

generate-block-ids.sh

Expose $IMPALA_MAVEN_OPTIONS for configuring Maven.

2017-11-14 01:29:56 +00:00

generate-load-nested.sh

IMPALA-12570: Add longer strings to tables containing collections

2023-11-22 18:50:22 +00:00

generate-schema-statements.py

IMPALA-14553: Run schema eval concurrently

2025-11-17 16:34:22 +00:00

generate-test-vectors.py

IMPALA-14501: Migrate most scripts from impala-python to impala-python3

2025-10-22 16:30:17 +00:00

jwt_requirements.txt

IMPALA-11880: Adds support for authenticating to Impala using JWTs.

2023-05-11 23:22:05 +00:00

jwt-generate.sh

IMPALA-11880: Adds support for authenticating to Impala using JWTs.

2023-05-11 23:22:05 +00:00

jwt-util.py

IMPALA-11880: Adds support for authenticating to Impala using JWTs.

2023-05-11 23:22:05 +00:00

kill-all.sh

IMPALA-14511: Fix pgrep to avoid warning

2025-10-23 22:00:36 +00:00

kill-hbase.sh

IMPALA-11990: Make actual failures clearer

2023-03-10 22:23:18 +00:00

kill-hive-server.sh

IMPALA-11621: Remove hiveserver2.pid when shutting down HiveServer2

2022-09-29 16:11:22 +00:00

kill-java-service.sh

IMPALA-7399: Emit a junit xml report when trapping errors

2018-08-23 18:33:58 +00:00

kill-kudu.sh

IMPALA-12852: Make Kudu service start and stop independent

2024-04-02 08:26:59 +00:00

kill-lakekeeper.sh

IMPALA-14018: Configure OAUTH2 with Lakekeeper and fix Impala's config handling

2025-09-08 13:43:28 +00:00

kill-mini-dfs.sh

IMPALA-3918: Remove Cloudera copyrights and add ASF license header

2016-08-09 08:19:41 +00:00

kill-ranger-server.sh

IMPALA-12198: Create $RANGER_LOG_DIR before stopping in kill-ranger-server.sh

2023-06-10 02:04:46 +00:00

kill-sentry-service.sh

IMPALA-7399: Emit a junit xml report when trapping errors

2018-08-23 18:33:58 +00:00

kill-trino.sh

IMPALA-12414: Add scripts to run Trino in the dev environment

2023-09-13 22:07:13 +00:00

load_nested.py

IMPALA-14501: Migrate most scripts from impala-python to impala-python3

2025-10-22 16:30:17 +00:00

load-dependent-tables-hive2.sql

IMPALA-8369 (part 4): Hive 3: fixes for functional dataset loading

2019-05-15 11:00:45 +00:00

load-dependent-tables.sql

IMPALA-13284: Loading test data on Apache Hive3

2024-08-20 07:01:21 +00:00

load-ext-data-sources.sh

IMPALA-14005: Support for quoted reserved words column names

2025-08-12 15:01:13 +00:00

load-metastore-snapshot.sh

IMPALA-11562: Revert support for o3fs as default filesystem

2022-09-28 22:35:48 +00:00

load-test-warehouse-snapshot.sh

IMPALA-12967, IMPALA-13059, IMPALA-13144, IMPALA-13195: test_migrated_table_field_id_resolution fails in exhaustive mode

2024-10-16 14:13:25 +00:00

load-tpc-kudu.py

IMPALA-14501: Migrate most scripts from impala-python to impala-python3

2025-10-22 16:30:17 +00:00

minikdc_env.sh

IMPALA-9361: manually configured kerberized minicluster

2020-02-08 05:16:12 +00:00

patch_hive.sh

IMPALA-13284: Loading test data on Apache Hive3

2024-08-20 07:01:21 +00:00

random_avro_schema.py

IMPALA-14501: Migrate most scripts from impala-python to impala-python3

2025-10-22 16:30:17 +00:00

README-BENCHMARK-TEST-GENERATION

…

restore-stats-on-planner-tests.py

IMPALA-2945: Account for duplicate keys on multiple nodes preAgg

2025-01-17 20:22:03 +00:00

rewrite-iceberg-metadata.py

IMPALA-14501: Migrate most scripts from impala-python to impala-python3

2025-10-22 16:30:17 +00:00

run-all.sh

IMPALA-12852: Make Kudu service start and stop independent

2024-04-02 08:26:59 +00:00

run-hbase.sh

IMPALA-9361: manually configured kerberized minicluster

2020-02-08 05:16:12 +00:00

run-hive-server.sh

IMPALA-13920: Allow running minicluster with Java 17

2025-04-04 17:50:01 +00:00

run-iceberg-rest-server.sh

IMPALA-14481: Use $JAVA instead of java in run-iceberg-rest-server.sh

2025-10-14 21:35:47 +00:00

run-kudu.sh

IMPALA-12852: Make Kudu service start and stop independent

2024-04-02 08:26:59 +00:00

run-lakekeeper.sh

IMPALA-14018: Configure OAUTH2 with Lakekeeper and fix Impala's config handling

2025-09-08 13:43:28 +00:00

run-mini-dfs.sh

IMPALA-13920: Allow running minicluster with Java 17

2025-04-04 17:50:01 +00:00

run-ranger-server.sh

IMPALA-12188: Avoid unnecessary output from sourcing bin/impala-config.sh

2023-07-14 03:17:47 +00:00

run-step.sh

IMPALA-11341: Print error log files when data-loading fails

2022-06-21 06:51:38 +00:00

run-trino.sh

IMPALA-12414: Add scripts to run Trino in the dev environment

2023-09-13 22:07:13 +00:00

setup-dfs-keys.sh

IMPALA-9448: Use Ozone TDE in minicluster

2022-09-09 02:37:41 +00:00

setup-hdfs-env.sh

IMPALA-9448: Use Ozone TDE in minicluster

2022-09-09 02:37:41 +00:00

setup-mysql-env.sh

IMPALA-14005: Support for quoted reserved words column names

2025-08-12 15:01:13 +00:00

setup-ranger.sh

IMPALA-12921, IMPALA-12985: Support running Impala with locally built Ranger

2024-06-15 10:25:13 +00:00

trino-cli.sh

IMPALA-12414: Add scripts to run Trino in the dev environment

2023-09-13 22:07:13 +00:00

TRINO-README.md

IMPALA-13168: Add README file for setting up Trino

2024-06-25 14:09:06 +00:00

wait-for-hiveserver2.py

IMPALA-14501: Migrate most scripts from impala-python to impala-python3

2025-10-22 16:30:17 +00:00

wait-for-metastore.py

IMPALA-14501: Migrate most scripts from impala-python to impala-python3

2025-10-22 16:30:17 +00:00