Commit Graph

655 Commits

Author SHA1 Message Date
Michael Smith
f05eac6476 IMPALA-12602: Unregister queries on idle timeout
Queries cancelled due to idle_query_timeout/QUERY_TIMEOUT_S are now also
Unregistered to free any remaining memory, as you cannot fetch results
from a cancelled query.

Adds a new structure - idle_query_statuses_ - to retain Status messages
for queries closed this way so that we can continue to return a clear
error message if the client returns and requests query status or
attempts to fetch results. This structure must be global because HS2
server can only identify a session ID from a query handle, and the query
handle no longer exists. SessionState tracks queries added to
idle_query_statuses_ so they can be cleared when the session is closed.

Also ensures MarkInactive is called in ClientRequestState when Wait()
completes. Previously WaitInternal would only MarkInactive on success,
leaving any failed requests in an active state until explicitly closed
or the session ended.

The beeswax get_log RPC will not return the preserved error message or
any warnings for these queries. It's also possible the summary and
profile are rotated out of query log as the query is no longer inflight.
This is an acceptable outcome as a client will likely not look for a
log/summary/profile after it times out.

Testing:
- updates test_query_expiration to verify number of waiting queries is
  only non-zero for queries cancelled by EXEC_TIME_LIMIT_S and not yet
  closed as an idle query
- modified test_retry_query_timeout to use exec_time_limit_s because
  queries closed by idle_timeout_s don't work with get_exec_summary

Change-Id: Iacfc285ed3587892c7ec6f7df3b5f71c9e41baf0
Reviewed-on: http://gerrit.cloudera.org:8080/21074
Reviewed-by: Michael Smith <michael.smith@cloudera.com>
Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>
2024-04-03 03:25:10 +00:00
jasonmfehr
3e4fdeece1 IMPALA-12824: Removes the prettyprint_duration Built-in Function
The prettyprint_duration function was originally
implemented in IMPALA-12824 to work with the workload
management tables which stored durations in integer
nanoseconds. These tables have changed to store decimal
seconds.

The prettyprint_duration function would have required a
large investment of time to make it work with decimal
values, and since the new format is more human readable
anyways, this function has been removed.

Change-Id: If2154c2ed9a7217ed4b7587adeae87df55ff03dc
Reviewed-on: http://gerrit.cloudera.org:8080/21208
Reviewed-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>
Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>
2024-03-28 06:58:56 +00:00
Saurabh Katiyal
eb2939245f IMPALA-12693: [DOCS] Typo in link for ltrim in string functions docs
Fixed documentation typo for LTRIM string function, from LTRI to LTRIM.

Change-Id: If4345fc6d19f04d0c0c6feef3e0c8598271224fe
Reviewed-on: http://gerrit.cloudera.org:8080/21123
Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>
Reviewed-by: Daniel Becker <daniel.becker@cloudera.com>
2024-03-13 09:20:16 +00:00
Noemi Pap-Takacs
70c35425d3 IMPALA-12774: [DOCS] Document ALTER TABLE SORT BY syntax
Extended the ALTER TABLE documentation with the SORT BY clause.
Also added more information about the available and the deafult
sort orders to the CREATE TABLE description.

Testing: Built docs locally.

Change-Id: Ieb348d8395a6140f0be200d73e2f22fded9a5116
Reviewed-on: http://gerrit.cloudera.org:8080/21083
Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>
Reviewed-by: Daniel Becker <daniel.becker@cloudera.com>
2024-03-11 10:10:43 +00:00
Anshula Jain
ca3fe6d6af IMPALA-12692 : [DOCS] Typo in docs about random() function
Changed name of random fucntion in impala_math_functions.xml
from "RANDOME(), RANDOME(BIGINT seed)" to "RANDOM(), RANDOM(BIGINT seed)"

Change-Id: I4844eb8d155326081c385d88b98a591dbbde7369
Reviewed-on: http://gerrit.cloudera.org:8080/21126
Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>
Reviewed-by: Michael Smith <michael.smith@cloudera.com>
2024-03-08 16:34:50 +00:00
Michael Smith
b81368a225 IMPALA-12858: [DOCS] Correct idle_client_poll_period_s docs
Correctly refer to idle_client_poll_period_s in documentation.

Change-Id: Ib89c8e3877bed508f6ba18483e48b0a4b4bd5cce
Reviewed-on: http://gerrit.cloudera.org:8080/21092
Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>
Reviewed-by: Laszlo Gaal <laszlo.gaal@cloudera.com>
2024-02-29 19:40:17 +00:00
Riza Suminto
f5c12c65db IMPALA-12801: Increase query_log_ default size and bound its memory.
Coordinator's /queries page is useful to show information about recently
run and completed queries. Having more entries will be helpful to
inspect queries that completed further back. The maximum entry of this
table is controlled by 'query_log_size' flag. Higher value means more
queries to keep, but it also cost more memory overhead in coordinator.

This patch increase 'query_log_size' default value from 100 to 200. This
patch also add flag 'query_log_size_in_bytes' (default to 2GB) as an
additional safeguard to evict entry from query_log_ when this limit
exceeded, preventing query_log_ total memory to grow prohibitively
large. 'query_log_size_in_bytes' is used in combination with
'query_log_size' to limit the number of QueryStateRecord to retain in
query_log_, whichever is less.

Testing:
- Pass exhaustive tests.

Change-Id: I107e2c2c7f2b239557be37360e8eecf5479e8602
Reviewed-on: http://gerrit.cloudera.org:8080/21020
Reviewed-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>
Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>
2024-02-23 21:01:08 +00:00
jasonmfehr
d03ffc70f2 IMPALA-12824: Adds built-in functions prettyprint_duration and prettyprint_bytes.
The prettyprint_duration function takes an integer input containing a
number of nanoseconds and returns a human readable value breaking down
the input by hours, minutes, seconds, milliseconds, microseconds, and
nanoseconds.

The prettyprint_bytes function takes an integer input containing a
number of bytes and returns a human readable values breaking down the
input by gigabytes, megabytes, kilobytes, and bytes.

Functionality tests were added to the existing expr-test suite that
tests built-in functions.

Functional-query workloads were added in two new .test files under the
testdata directory to exercise these two new functions. Corresponding
pytests were added to run the tests in these new .test files.

Benchmarks were added to expr-benchmark, and new benchmarks were
generated with a release build running on a machine with the cpu
Intel(R) Core(TM) i7-10700 CPU @ 2.90GHz.

Documentation was added to the built-in string functions docs.

Change-Id: I3e76632ce21ad2ca5df474160338699a542a6913
Reviewed-on: http://gerrit.cloudera.org:8080/21038
Reviewed-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>
Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>
2024-02-21 04:23:28 +00:00
pranavyl
ab445195b0 IMPALA-12756: [DOCS] Unicode column name support documentation
The patch focuses on documenting that Impala supports unicode
column names, consistent with Hive's current support (as we use
Hive MetaStore to store table metadata).

Change-Id: I3d43d942a3ea069020f06adab6ea77e62ad5ffbe
Reviewed-on: http://gerrit.cloudera.org:8080/20950
Reviewed-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>
Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>
2024-01-30 14:00:31 +00:00
Zoltan Borok-Nagy
dea8546d80 IMPALA-12653: Update documentation about the UPDATE statement
This patch adds documentation about the UPDATE statement.

Change-Id: I2a4f3dcdba5faaa7dffda60b8590d09e6a92a165
Reviewed-on: http://gerrit.cloudera.org:8080/20818
Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>
Reviewed-by: Noemi Pap-Takacs <npaptakacs@cloudera.com>
Reviewed-by: Andrew Sherman <asherman@cloudera.com>
2024-01-02 10:49:13 +00:00
Riddhi Jain
d01d028b07 IMPALA-11762: [DOCS] Reserved words documentation lags behind the code
Crosschecked keywordMap from:
https://github.com/apache/impala/blob/master/fe/src/main/jflex/sql-scanner.flex
with upstream docs:
https://impala.apache.org/docs/build/html/topics/impala_reserved_words.html#reserved_words

Added following Keywords missing from docs:
buckets
disable
enable
hudiparquet
jsonfile
lexical
managedlocation
minus
non
norely
novalidate
optimize
orc
rely
rwstorage
selectivity
sets
spec
storagehandler_uri
system_version
unset
user_defined_fn
validate
zorder

Change-Id: I0ae58a4730c2e3d8d82cccdff23c1fff36117522
Reviewed-on: http://gerrit.cloudera.org:8080/20605
Reviewed-by: Laszlo Gaal <laszlo.gaal@cloudera.com>
Tested-by: Laszlo Gaal <laszlo.gaal@cloudera.com>
2023-11-24 01:14:11 +00:00
Shajini Thayasingh
43051237d3 IMPALA-11967: [DOCS] Update Compute Incremental Stats syntax
Updated "compute incremental stats" syntax to support a list of columns.

Change-Id: Id5ad3bdf26572a1d0510df9b41ee1f12ae2cf747
Reviewed-on: http://gerrit.cloudera.org:8080/19602
Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>
Reviewed-by: Quanlong Huang <huangquanlong@gmail.com>
2023-11-14 01:15:34 +00:00
Shajini Thayasingh
a01ad35566 IMPALA-12491: [DOCS] Add a note on the cache item
Described how the scan request will access the cache when there is
no change in the mtime in the file metadata.

Change-Id: I508ce667181d635c17373c7336ea9f83984d7641
Reviewed-on: http://gerrit.cloudera.org:8080/20611
Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>
Reviewed-by: Quanlong Huang <huangquanlong@gmail.com>
Reviewed-by: Michael Smith <michael.smith@cloudera.com>
2023-11-06 18:57:18 +00:00
Tamas Mate
eadd35f6d5 IMPALA-11853: Fix formatted docs query options CSS
The recent documentation formatting changes introduced the navigation
panel on the left. However, due to the length of the query options
navigation title these could overlap with the documentation paragraphs.

This commit removes the underscores from the navigation titles of the
query options, so browsers can break them into multiple lines.
Additionally, the "SET" and "Query Options for the SET Statement" pages
are merged to save some more space for the query option navigation
titles.

Testing:
 - Built the documentation and tested manually

Change-Id: Icec787d7a2af848aaaff65be2ecf311a5ce8fe7f
Reviewed-on: http://gerrit.cloudera.org:8080/20556
Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>
Reviewed-by: Jason Fehr <jfehr@cloudera.com>
Reviewed-by: Peter Rozsa <prozsa@cloudera.com>
Reviewed-by: Tamas Mate <tmater@apache.org>
2023-10-18 10:22:05 +00:00
Abhishek Rawat
05bc485851 IMPALA-10860: Allow setting mem_limit for coordinators
Added support for MEM_LIMIT_COORDINATORS query option. This is
similar to exisiting MEM_LIMIT_EXECUTORS, but applies to coordinators.
There are cases where Planner generates inaccurate estimates for
coordinator fragments and would be good to be able to set mem limit
just for the coordinator, since a query's memory requirement on
coordinator tends to be much lower compared to that on executors.

If MEM_LIMIT is set, then MEM_LIMIT_COORDINATORS is ignored.

Also updated the documentation for the new query option.

Testing:
- Added new custom cluster tests which validates MEM_LIMIT_COORDINATORS
applies only on coordinator. The test also validates that both
MEM_LIMIT_EXECUTORS and MEM_LIMIT_COORDINATORS can be set together.
- Built docs and made sure that the new changes have proper formatting.

Change-Id: I2dfc9a735e82dce2fd903bdaf6bc2e46e982ef8c
Reviewed-on: http://gerrit.cloudera.org:8080/20378
Reviewed-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>
Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>
2023-08-23 01:55:00 +00:00
Zoltan Borok-Nagy
12276c79f9 IMPALA-12335: [DOCS] Add documentation about the DELETE statement
IMPALA-11877 added support for the DELETE statement for Iceberg
tables. This patch documents this feature.

Change-Id: If111a7ecd20bda2d4928332ef2ccd905814cb203
Reviewed-on: http://gerrit.cloudera.org:8080/20361
Reviewed-by: Zoltan Borok-Nagy <boroknagyz@cloudera.com>
Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>
2023-08-17 15:44:06 +00:00
Zoltan Borok-Nagy
240a9cddc2 [docs] Add note about REFRESH for tables in custom catalog
This patch adds a small note about Iceberg tables in custom catalog.

Change-Id: I909603dff20fc4fefde48539d301844d40167a57
Reviewed-on: http://gerrit.cloudera.org:8080/20328
Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>
Reviewed-by: Andrew Sherman <asherman@cloudera.com>
2023-08-15 20:27:21 +00:00
Andrew Sherman
a24897269d IMPALA-12332: Undo IMPALA-8615 as the corresponding configs have been removed
IMPALA-8615 documented the changes made in IMPALA-8536, but the configs
were subsequently removed in IMPALA-9077. Rollback IMPALA-8615 to bring
the docs up to date.

Revert "IMPALA-8615: [DOCS] Document the scalable admission control parameters"
This was a clean revert, and there were no overlapping changes to this file.

TESTING:
- built docs and reviewed the file.

This reverts commit b2136c39fc.

Change-Id: Ibc856c62babb4b305b6a7c286a0f4c86e6e418cc
Reviewed-on: http://gerrit.cloudera.org:8080/20308
Reviewed-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>
Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>
2023-08-03 16:08:03 +00:00
whsyourdaddy
bb237660a0 IMPALA-12047: [DOCS] Fix a URL mistake when using "git clone" in docs/README.md
There's a URL mistake when using "git clone" in docs/README.md:
  git clone https://gitbox.apache.org/repos/asf/impala.git/docs

This doesn't work and it will prompt "repository not found".

This change corrects the description, providing two ways to download
the docs - either by downloading the whole repository and going to the
docs/ directory or by downloading only the docs using git sparse-checkout.

Change-Id: Ib00c37e28e67cca5b3630742b4c366dea4e967b7
Reviewed-on: http://gerrit.cloudera.org:8080/19634
Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>
Reviewed-by: Daniel Becker <daniel.becker@cloudera.com>
Reviewed-by: Yingchun Lai <laiyingchun@apache.org>
2023-04-17 08:55:03 +00:00
Noemi Pap-Takacs
b8915c9a4d IMPALA-11878: Document that AVRO file format is supported in Iceberg tables
Change-Id: I0e3fdade3a12570e488a3a58819697134dbb4c15
Reviewed-on: http://gerrit.cloudera.org:8080/19680
Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>
Reviewed-by: Tamas Mate <tmater@apache.org>
2023-04-17 08:26:34 +00:00
Shajini Thayasingh
c353d69cbd IMPALA-11985: [DOCS] Support for Kudu's multi-rows transaction
Fixed some typos and made final changes.
Clarified some questions that were raised as comments.
Incorporated some minor comments.
Documented the support for Kudu's multi-rows transaction.

Change-Id: Ic226679d83d7221f843994ead11cb2bc9e971882
Reviewed-on: http://gerrit.cloudera.org:8080/19651
Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>
Reviewed-by: Alexey Serbin <alexey@apache.org>
Reviewed-by: Wenzhe Zhou <wzhou@cloudera.com>
2023-03-31 23:29:23 +00:00
Shajini Thayasingh
19b5d781f4 IMPALA-11987: [DOCS] Document the mem_limit_executors query option
Document the difference between mem_limit and mem_limit_executors
query options.

Change-Id: Ibce8347d564afde0f9b9e83df560454f4b8b5c9d
Reviewed-on: http://gerrit.cloudera.org:8080/19604
Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>
Reviewed-by: Daniel Becker <daniel.becker@cloudera.com>
Reviewed-by: Wenzhe Zhou <wzhou@cloudera.com>
2023-03-22 15:46:22 +00:00
Shajini Thayasingh
8edfab000e IMPALA-12010: [DOCS] Document the support for non-unique primary key
Incorporated new comments.
Made changes to address new comments posted after the patch was merged.

Change-Id: Ifaea9645cf771b76fda3083d73b14524f53eb186
Reviewed-on: http://gerrit.cloudera.org:8080/19636
Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>
Reviewed-by: Abhishek Chennaka <achennaka@cloudera.com>
Reviewed-by: Wenzhe Zhou <wzhou@cloudera.com>
2023-03-21 20:56:27 +00:00
Andrew Sherman
b969a1fc6a IMPALA-11596: [DOCS] Document EXECUTE ROLLBACK functionality
Since IMPALA-11482 Impala supports the 'Alter Table Execute Rollback'
statement for Iceberg tables. Update the docs to cover this change.

The section on DESCRIBE HSTORY is expanded to include the output
columns, as this information is relevant to EXECUTE ROLLBACK.
The section on Cloning Iceberg tables is moved so that the sections
concerned with table history are adjacent.

TESTING:
- Built docs locally.

Change-Id: I0e1690378e560197263c49f468618b1ded922df3
Reviewed-on: http://gerrit.cloudera.org:8080/19606
Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>
Reviewed-by: Noemi Pap-Takacs <npaptakacs@cloudera.com>
Reviewed-by: Zoltan Borok-Nagy <boroknagyz@cloudera.com>
2023-03-21 14:28:03 +00:00
Michael Smith
39fea06f2b IMPALA-11990: Make actual failures clearer
A hack to cleanup after Hbase fails when services haven't been started
yet (which is always at least once in a CI run) with a large error
message. That error isn't useful and can be misleading for people
reviewing test logs. Suppress it.

Guards data load for Ozone as a usable snapshot is required. Also fixes
a typo in fixed issues.

Change-Id: Idc37d03780fca35427b977524b2b97a6892c88f7
Reviewed-on: http://gerrit.cloudera.org:8080/19459
Reviewed-by: Gabor Kaszab <gaborkaszab@cloudera.com>
Tested-by: Michael Smith <michael.smith@cloudera.com>
2023-03-10 22:23:18 +00:00
Shajini Thayasingh
27e238c8cf IMPALA-11906: [DOCS] Document the support for non-unique primary key
Incorporated the comments received.
Added a new sub-section.
Change-Id: I7b5a452f2199d097077150c012497aa4a3ecf7d9
Reviewed-on: http://gerrit.cloudera.org:8080/19587
Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>
Reviewed-by: Abhishek Chennaka <achennaka@cloudera.com>
Reviewed-by: Wenzhe Zhou <wzhou@cloudera.com>
2023-03-08 17:35:52 +00:00
Michael Smith
1321b5ce54 IMPALA-11920: [DOCS] Cleanup and update spill examples
Updates documentation to include examples with service identifier. Also
fixes inconsistent use of ASCII quotes for example text, highlighting
code and variable names, and normalizes descriptions between
S3/HDFS/Ozone. Removes "priority" from remote descriptions as it is
optional and does nothing.

Change-Id: I624a607bda33ab47100e1540ff1d66c8d19a7329
Reviewed-on: http://gerrit.cloudera.org:8080/19504
Reviewed-by: Michael Smith <michael.smith@cloudera.com>
Tested-by: Michael Smith <michael.smith@cloudera.com>
2023-03-02 17:25:55 +00:00
Riza Suminto
fa64be7cc7 IMPALA-11940: [DOCS] Document manifest caching settings for Iceberg
IMPALA-11658 implements Iceberg manifest caching for Impala. This patch
adds documentation for configuring the cache(s).

Testing:
- Built docs locally

Change-Id: Idd761a81f5c81a25a5ec0889402f85157c23e9fe
Reviewed-on: http://gerrit.cloudera.org:8080/19530
Reviewed-by: Daniel Becker <daniel.becker@cloudera.com>
Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>
Reviewed-by: Zoltan Borok-Nagy <boroknagyz@cloudera.com>
2023-02-24 17:16:34 +00:00
Shajini Thayasingh
7dcf80b32e IMPALA-10804: [DOCS] Document spill to remote storage
Spill to HDFS, S3, and Ozone.

Change-Id: I3efb2ffcc06cdbe69845c6dc4cf03d9f2e3dcabc
Reviewed-on: http://gerrit.cloudera.org:8080/19472
Reviewed-by: Yida Wu <wydbaggio000@gmail.com>
Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>
2023-02-07 22:04:51 +00:00
Michael Smith
88d49b6919 IMPALA-11693: Enable allow_erasure_coded_files by default
Enables allow_erasure_coded_files by default as we've now completed all
planned work to support it.

Testing
- Ran HDFS+EC test suite
- Ran Ozone+EC test suite

Change-Id: I0cfef087f2a7ae0889f47e85c5fab61a795d8fd4
Reviewed-on: http://gerrit.cloudera.org:8080/19362
Reviewed-by: Joe McDonnell <joemcdonnell@cloudera.com>
Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>
2023-01-31 16:53:46 +00:00
Andrew Sherman
0617f2e66f IMPALA-11862: [DOCS] Document the default value of --ssl_cipher_list.
Since IMPALA-11240 the default value of ssl_cipher_list is not empty.
Update the docs to cover this change.

TESTING:
- Built docs locally.

Change-Id: I000fbb5bd37f52b85afe3855852875360b55ccfa
Reviewed-on: http://gerrit.cloudera.org:8080/19447
Reviewed-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>
Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>
2023-01-31 00:32:54 +00:00
Michael Smith
23cdc7edeb IMPALA-11867: [DOCS] Document Ozone support
Adds a topic documenting Apache Ozone support, and recommends using the
ofs protocol.

Change-Id: I724a40c086fe0466646e7e108645fd8dbaee5f1d
Reviewed-on: http://gerrit.cloudera.org:8080/19448
Reviewed-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>
Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>
2023-01-30 21:50:32 +00:00
Andrew Sherman
52bdbe1f7e IMPALA-11835: [DOCS] Document that Iceberg Time Travel queries use the old schema.
Since IMPALA-10153, time travel queries use the schema of the table from
the point specified by the time travel parameters. Update the docs to
cover this change.

TESTING:
- Built docs locally

Change-Id: I8ba0b604733c14b697d268b9ea2067a829c0e351
Reviewed-on: http://gerrit.cloudera.org:8080/19420
Reviewed-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>
Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>
2023-01-30 18:26:29 +00:00
wzhou-code
43928b190b IMPALA-11617: Pool service should be made aware of cpu core limit
IMPALA-11604 enables the planner to compute CPU usage for certain
queries and to select suitable executor groups to run. The CPU usage is
expressed as the CPU cores required to process a query.

This patch added the CPU core limit, which is the maximum CPU core
available per node and coordinator for each executor group, to the pool
service.

Testing:
 - Passed core run.
 - Verified that CPU cores were shown on the admission and
   metrics pages of the Impala debug web server.

Change-Id: Id4c5ee519ce7c329b06ac821283e215a3560f525
Reviewed-on: http://gerrit.cloudera.org:8080/19366
Reviewed-by: Andrew Sherman <asherman@cloudera.com>
Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>
2023-01-28 01:34:14 +00:00
Tamas Mate
cf6165aca8 IMPALA-11819: [DOCS] Add Iceberg LOAD DATA information
This commit adds information on how LOAD DATA statement can be used with
Iceberg tables.

Testing:
 - Built docs locally

Change-Id: Iec242781a4551aa04e4e920e3f3a1010c7ab808e
Reviewed-on: http://gerrit.cloudera.org:8080/19396
Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>
Reviewed-by: Gergely Fürnstáhl <gfurnstahl@cloudera.com>
Reviewed-by: Noemi Pap-Takacs <npaptakacs@cloudera.com>
Reviewed-by: Tamas Mate <tmater@apache.org>
2023-01-09 12:04:46 +00:00
Michael Smith
8b83726b8e IMPALA-9487: Add erasure coding policy to SHOW, DESCRIBE
Adds erasure coding policy to introspection commands SHOW FILES, SHOW
PARTITIONS, SHOW TABLE STATS, and DESCRIBE EXTENDED.

Remove `throws IOException` for methods that don't throw. Removes null
check for getSd because getStorageDescriptorInfo - which is called right
after getTableMetaDataInformation - uses it without checking for null.

Adds '$ERASURECODE_POLICY' for runtime test substitution. The test suite
replaces this with the current erasure code policy - from
HDFS_ERASURECODE_POLICY - or NONE to match expected output.

Testing:
- ran backend, end-to-end, and custom cluster tests with erasure coding
- ran backend, end-to-end, and custom cluster tests with exhaustive
  strategy

Change-Id: Idd95f2d18b3980581788c92993b6d2f53504b5e0
Reviewed-on: http://gerrit.cloudera.org:8080/19268
Reviewed-by: Michael Smith <michael.smith@cloudera.com>
Tested-by: Michael Smith <michael.smith@cloudera.com>
2022-12-14 22:37:14 +00:00
Daniel Becker
c8eb2692fb IMPALA-11790: [DOCS] Document the IMPALA-10660 shell option
IMPALA-10660 introduced the "hs2_fp_format" shell option which is
documented in this change.

Change-Id: I402ea08cb3482a84234127c125539d96b4b1ed14
Reviewed-on: http://gerrit.cloudera.org:8080/19358
Reviewed-by: Daniel Becker <daniel.becker@cloudera.com>
Tested-by: Daniel Becker <daniel.becker@cloudera.com>
2022-12-14 15:49:30 +00:00
Daniel Becker
7f43afc277 IMPALA-11791: [DOCS] Document IMPALA-9499 query option
IMPALA-9499 introduced the EXPAND_COMPLEX_TYPES query option which is
documented in this change.

Also updates docs/topics/impala_complex_types.xml - complex types are
now allowed in the select list with the exceptions of collections
embedded in structs and structs embedded in collections.

Change-Id: I1f0a6b402de1ed9bb6aa05987a6ff8e6d62accb5
Reviewed-on: http://gerrit.cloudera.org:8080/19348
Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>
Reviewed-by: Csaba Ringhofer <csringhofer@cloudera.com>
2022-12-13 12:53:45 +00:00
Gergely Fürnstáhl
644a6890d7 IMPALA-11758: Fixed error detection for usage of reserved words.
Fixed the control flow of checking whether the parsing failed on the
usage of a reserved word where an identifier was expected. Added a hint
to the error message on how to fix the query.

Query: create database iceberg
ERROR: ParseException: Syntax error in line 1:
create database iceberg
                ^
Encountered: ICEBERG
Expected: DEFAULT, EXTENDED, FORMATTED, IF, IDENTIFIER

Hint: reserved words have to be escaped when used as an identifier, e.g. `iceberg`

CAUSED BY: Exception: Syntax error

Testing:
 - Added new tests to test_reserved_words_version.py
 - Adjusted ParserTests
 - Built docs

Change-Id: I9605d80a25ff878d12f6ca2ebb99ac26298d8efa
Reviewed-on: http://gerrit.cloudera.org:8080/19290
Reviewed-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>
Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>
2022-12-06 15:14:00 +00:00
Tamas Mate
e1e92da796 IMPALA-11676: Prettify asf-site docs
This commit refactors and adds a new build option to the docs build
script/Makefile, these options are:
 - plain-html: the plain html docs, without css and navigation bar, this
was "the" html build before this change.
 - asf-site-html: html docs, with css and navigation bar.
 - pdf

The css is comming from DITA project's documentation.

Testing:
 - Built the docs and tested the pages manually.

Change-Id: Ic9621cb0abaa7fd9bf445da08440c0f6a9788180
Reviewed-on: http://gerrit.cloudera.org:8080/19242
Reviewed-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>
Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>
2022-11-16 20:26:31 +00:00
xiabaike
6a1a871fb7 IMPALA-11420: [DOCS] Document CREATE/ALTER VIEW SET/UNSET TBLPROPERTIES syntax
Update document for
    [ CREATE VIEW ... TBLPROPERTIES ('key' = 'value', ...) ]
and
    [ ALTER VIEW view_name SET/UNSET TBLPROPERTIES... ] syntax.

Change-Id: Ief1d6bb525ba85a58b8123a0cb712d83523daaec
Reviewed-on: http://gerrit.cloudera.org:8080/19143
Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>
Reviewed-by: Quanlong Huang <huangquanlong@gmail.com>
2022-10-18 00:13:15 +00:00
Wenzhe Zhou
1b59d32eff Revert "IMPALA-11617: Pool service should be made aware of processing cost limit"
This reverts commit 1d62bddb84.

Change-Id: I1ebf5ff9685072079e18497d869d06b2c55153fe
Reviewed-on: http://gerrit.cloudera.org:8080/19139
Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>
Reviewed-by: Wenzhe Zhou <wzhou@cloudera.com>
2022-10-14 04:01:06 +00:00
wzhou-code
1d62bddb84 IMPALA-11617: Pool service should be made aware of processing cost limit
IMPALA-11604 enables the planner to compute CPU usage for certain
queries and to select suitable executor groups to run. The CPU usage is
expressed as the total amount of data to be processed per query.

This patch added the processing cost limit, which is the total amount of
data that each executor group can handle, to the pool service.

Testing:
 - Passed core run.
 - Verified that processing costs were shown on the admission and
   metrics pages of the Impala debug web server.

Change-Id: I9bd2a7284eda47a969ef91e4be19f96d2af53449
Reviewed-on: http://gerrit.cloudera.org:8080/19121
Reviewed-by: Qifan Chen <qchen@cloudera.com>
Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>
2022-10-12 23:46:57 +00:00
Tamas Mate
876829951d IMPALA-11483: [DOCS] Document EXPIRE SNAPSHOTS functionality
This commit documents the Iceberg expire snapshot functionality.

Testing:
 - Built the docs locally and verified the generated html.

Change-Id: Iddd927dd1b6ee3b99dd29982018a610389e64836
Reviewed-on: http://gerrit.cloudera.org:8080/18838
Reviewed-by: Tamas Mate <tmater@apache.org>
Tested-by: Tamas Mate <tmater@apache.org>
2022-08-30 10:17:26 +00:00
Shajini Thayasingh
2f55f85519 IMPALA-11519: [DOCS] add UTF-8 requirements
added a note about Glibc version and en_US.UTF-8 locale
updated the notes in both topics
Change-Id: I4d7a21c787c66868219c7bd64aa31f772de2f850
Reviewed-on: http://gerrit.cloudera.org:8080/18897
Reviewed-by: Quanlong Huang <huangquanlong@gmail.com>
Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>
2022-08-23 22:34:42 +00:00
Tamas Mate
51126259be IMPALA-11436: Change search bind authentication parameters
Impala's search bind authentication intends to mimic Spring's behaviour.
However, the login username and user dn paremeters were swapped for
group searches compared to Spring. This change intends to align these
parameters.

For user search, Spring uses {0} to replace the login username.
Meanwhile, during group search {0} is used to replace the login user dn
and {1} is used to replace the login username.

Testing:
 - Ran LdapSearchBindImpalaShellTest frontend tests

Change-Id: I9808566a348f7c6200b0571fbc05e67f720f2075
Reviewed-on: http://gerrit.cloudera.org:8080/18819
Reviewed-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>
Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>
2022-08-19 14:40:56 +00:00
yx91490
d029ae5367 IMPALA-11426: [DOCS] Fix quotes not paired in doc 'impala_alter_table.xml'
Change-Id: Ic29292cd3c6247fbf7c604d0618e5fbae53b2f01
Reviewed-on: http://gerrit.cloudera.org:8080/18723
Reviewed-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>
Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>
2022-07-17 13:46:33 +00:00
yx91490
5e4696799a IMPALA-11310: [Doc] the code's tag didn't closed in impala_admission_config.xml
Change-Id: I31d68dd1163bba371da502024b631aeca1ff9b5b
Reviewed-on: http://gerrit.cloudera.org:8080/18552
Reviewed-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>
Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>
2022-07-15 13:00:23 +00:00
LPL
995887ae4e IMPALA-11287: [DOCS] Implement cloning between Iceberg tables
This commit adds documents related to cloning Iceberg Table.

Testing:
 - Built the docs and checked the contents.

Change-Id: Ie93cbfe6db40b4e41d1445f19801b4c53407c836
Reviewed-on: http://gerrit.cloudera.org:8080/18706
Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>
Reviewed-by: Zoltan Borok-Nagy <boroknagyz@cloudera.com>
2022-07-11 14:56:47 +00:00
yx91490
c7784bde55 IMPALA-1682: Support printing the output of a query (rows) vertically.
In vertical mode, impala-shell will print each row in the format:
firstly print a line contains line number, then print this row's columns
line by line, each column line started with it's name and a colon.

To enable it: use shell option '-E' or '--vertical', or 'set VERTICAL=
true' in interactive mode. to disable it in interactive mode: 'set
VERTICAL=false'. NOTICE: it will be disabled if '-B' option or 'set
WRITE_DELIMITED=true' is specified.

Tests:
add methods in test_shell_interactive.py and test_shell_commandline.py.

Change-Id: I5cee48d5a239d6b7c0f51331275524a25130fadf
Reviewed-on: http://gerrit.cloudera.org:8080/18549
Reviewed-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>
Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>
2022-06-13 15:41:07 +00:00