Commit Graph

675 Commits

Author SHA1 Message Date
Peter Rozsa
ba17491bc0 IMPALA-11889: Docs for ESRI geospatial functions
This change adds documentation for geospatial functions added in
IMPALA-11745.

Change-Id: I5f765927a0856e3034968462514536fd1fffcea5
Reviewed-on: http://gerrit.cloudera.org:8080/22076
Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>
Reviewed-by: Csaba Ringhofer <csringhofer@cloudera.com>
2024-11-20 15:03:06 +00:00
m-sanjana19
c83e5d9769 IMPALA-13030: [DOCS] Documentation of AI built-in function (ai_generate_text)
Change-Id: Iae921f6554c7010f9568ee4a42b4abcb3534d4a6
Reviewed-on: http://gerrit.cloudera.org:8080/21629
Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>
Reviewed-by: Yida Wu <wydbaggio000@gmail.com>
2024-10-23 05:27:45 +00:00
Daniel Becker
64e43ad469 IMPALA-13410: Document reading Puffin files
IMPALA-13247 introduced support for reading Puffin files belonging to
the current snapshot. This change documents it.

Change-Id: Ib2975a67aadd948d9451f44a1c884349161c19d2
Reviewed-on: http://gerrit.cloudera.org:8080/21870
Reviewed-by: Peter Rozsa <prozsa@cloudera.com>
Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>
Reviewed-by: Csaba Ringhofer <csringhofer@cloudera.com>
2024-10-21 09:34:04 +00:00
Peter Rozsa
1f16919172 IMPALA-12732: Docs for MERGE statement
This change adds documentation for MERGE statement.

Change-Id: Ifadbae34ba802c4d4bd2feeec74f637607f108d7
Reviewed-on: http://gerrit.cloudera.org:8080/21834
Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>
Reviewed-by: Zoltan Borok-Nagy <boroknagyz@cloudera.com>
2024-10-09 11:32:18 +00:00
Peter Rozsa
39cab9adee IMPALA-13220: Docs for Iceberg DROP PARTITION
This patch adds a new section to the Iceberg topic about DROP PARTITION.

Change-Id: I45ea95d94ff9785309911c71b5dcf7c13c05b3c4
Reviewed-on: http://gerrit.cloudera.org:8080/21833
Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>
Reviewed-by: Noemi Pap-Takacs <npaptakacs@cloudera.com>
Reviewed-by: Daniel Becker <daniel.becker@cloudera.com>
2024-10-02 11:01:47 +00:00
Noemi Pap-Takacs
2dded92093 IMPALA-13392: Document File Filtering in OPTIMIZE Statement
Document the feature added in 'IMPALA-12867: Filter files to
OPTIMIZE based on file size'.

Change-Id: I73f88adedaf48909784baaf42488cb96defddfc3
Reviewed-on: http://gerrit.cloudera.org:8080/21852
Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>
Reviewed-by: Daniel Becker <daniel.becker@cloudera.com>
2024-10-01 09:59:58 +00:00
Noemi Pap-Takacs
bcba81a1de IMPALA-11663: Update documentation for MT_DOP
The MT_DOP documentation was outdated stating that MT_DOP values
greater than zero are not supported for DML statements.
However, IMPALA-10351 introduced this feature and now DML statements
do not produce an error if MT_DOP is set to a non-zero value.

Change-Id: Id34ccdaa8e1738756f4f12f7074e9f076b9209b4
Reviewed-on: http://gerrit.cloudera.org:8080/21846
Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>
Reviewed-by: Daniel Becker <daniel.becker@cloudera.com>
2024-09-25 12:14:52 +00:00
Riza Suminto
93c64e7e9a IMPALA-13376: Add docs for AGG_MEM_CORRELATION_FACTOR etc
This patch adds documentation for AGG_MEM_CORRELATION_FACTOR and
LARGE_AGG_MEM_THRESHOLD option introduced in Apache Impala 4.4.0.

IMPALA-12548 fix behavior of AGG_MEM_CORRELATION_FACTOR. Higher value
will lower memory estimation, while lower value will result in higher
memory estimation. The documentation in ImpalaService.thrift, however,
says the opposite. This patch fix documentation in thrift file as well.

Testing:
- Run "make plain-html" in docs/ dir and confirm the output.
- Manually check with comments in
  PlannerTest.testAggNodeMaxMemEstimate()

Change-Id: I00956a50fb7616ca3c3ea2fd75fd11239a6bcd90
Reviewed-on: http://gerrit.cloudera.org:8080/21793
Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>
Reviewed-by: Michael Smith <michael.smith@cloudera.com>
2024-09-24 17:10:34 +00:00
m-sanjana19
10a380bcbb IMPALA-13257: [DOCS] Documentation for unnest() and querying arrays
Currently, the two topics, Querying Arrays and Zipping Unnest on
Arrays from Views, were missing.

The documentation has been added, and the parent topic has been
updated with references to the child topics.

Change-Id: I3ad29153bf6ed3939fb1d87d6220bd22f8f7fa1b
Reviewed-on: http://gerrit.cloudera.org:8080/21651
Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>
Reviewed-by: Quanlong Huang <huangquanlong@gmail.com>
2024-08-13 21:38:30 +00:00
Fang-Yu Rao
589dbd6f1a IMPALA-13276: Revise the documentation of 'RUNTIME_FILTER_WAIT_TIME_MS'
This patch revises the documentation of the query option
'RUNTIME_FILTER_WAIT_TIME_MS' as well as the code comment for the same
query option to make its meaning clearer.

Change-Id: Ic98e23a902a65e4fa41a628d4a3edb1894660fb4
Reviewed-on: http://gerrit.cloudera.org:8080/21644
Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>
Reviewed-by: Csaba Ringhofer <csringhofer@cloudera.com>
2024-08-09 17:49:03 +00:00
Fang-Yu Rao
13a3d19a2c IMPALA-13250: [DOCS] Document ENABLED_RUNTIME_FILTER_TYPES query option
This patch documents the ENABLED_RUNTIME_FILTER_TYPES query option based
on the respective code comments in ImpalaService.thrift and
query-options.cc.

Change-Id: Ib7a34782bed6f812fedf717d8a076e2706f0bba9
Reviewed-on: http://gerrit.cloudera.org:8080/21645
Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>
Reviewed-by: Quanlong Huang <huangquanlong@gmail.com>
2024-08-08 22:07:48 +00:00
m-sanjana19
b1941c8f17 IMPALA-13071: Update the doc of Impala components
Change-Id: I83192110d29c4d44529d1276a17c9da4a91435aa
Reviewed-on: http://gerrit.cloudera.org:8080/21621
Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>
Reviewed-by: Quanlong Huang <huangquanlong@gmail.com>
2024-08-07 02:31:36 +00:00
m-sanjana19
7d72a0c17d IMPALA-13271: Correct the documentation with respect to granting privileges on URI
Currently, when an administrator grants a privilege on a URI to
a grantee via impala-shell, the created policy in Ranger's policy 
repository is non-recursive.

That is, the policy does not apply for any directory under the URI.
This patch corrects this in the documentation.

Change-Id: Ife9f07294fb0f0b24acb1c8d0199c64ec7d73e9a
Reviewed-on: http://gerrit.cloudera.org:8080/21633
Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>
Reviewed-by: Fang-Yu Rao <fangyu.rao@cloudera.com>
2024-08-05 16:19:45 +00:00
m-sanjana19
db6ead8136 IMPALA-13142: [DOCS] Documentation for Impala StateStore & Catalogd HA
Change-Id: I8927c9cd61f0274ad91111d6ac4a079f7a563197
Reviewed-on: http://gerrit.cloudera.org:8080/21615
Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>
Reviewed-by: Yida Wu <wydbaggio000@gmail.com>
Reviewed-by: Wenzhe Zhou <wzhou@cloudera.com>
2024-08-01 02:36:25 +00:00
jankiram84
6632fd00e1 IMPALA-12754: [DOCS] External JDBC table support
Created the docs for Impala external JDBC table support

Change-Id: I5360389037ae9ee675ab406d87617d55d476bf8f
Reviewed-on: http://gerrit.cloudera.org:8080/21539
Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>
Reviewed-by: gaurav singh <gsingh@cloudera.com>
Reviewed-by: Wenzhe Zhou <wzhou@cloudera.com>
2024-06-20 18:05:29 +00:00
Michael Smith
4681666e93 IMPALA-12800: Add cache for isTrueWithNullSlots() evaluation
isTrueWithNullSlots() can be expensive when it has to query the backend.
Many of the expressions will look similar, especially in large
auto-generated expressions. Adds a cache based on the nullified
expression to avoid querying the backend for expressions with identical
structure.

With DEBUG logging enabled for the Analyzer, computes and logs stats
about the null slots cache.

Adds 'use_null_slots_cache' query option to disable caching. Documents
the new option.

Change-Id: Ib63f5553284f21f775d2097b6c5d6bbb63699acd
Reviewed-on: http://gerrit.cloudera.org:8080/21484
Reviewed-by: Quanlong Huang <huangquanlong@gmail.com>
Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>
2024-06-12 12:27:05 +00:00
Riza Suminto
98739a8455 IMPALA-13083: Clarify REASON_MEM_LIMIT_TOO_LOW_FOR_RESERVATION
This patch improves REASON_MEM_LIMIT_TOO_LOW_FOR_RESERVATION error
message by saying the specific configuration that must be adjusted such
that the query can pass the Admission Control. New fields
'per_backend_mem_to_admit_source' and
'coord_backend_mem_to_admit_source' of type MemLimitSourcePB are added
into QuerySchedulePB. These fields explain what limiting factor drives
final numbers at 'per_backend_mem_to_admit' and
'coord_backend_mem_to_admit' respectively. In turn, Admission Control
will use this information to compose a more informative error message
that the user can act upon. The new error message pattern also
explicitly mentions "Per Host Min Memory Reservation" as a place to look
at to investigate memory reservations scheduled for each backend node.

Updated documentation with examples of query rejection by Admission
Control and how to read the error message.

Testing:
- Add BE tests at admission-controller-test.cc
- Adjust and pass affected EE tests

Change-Id: I1ef7fb7e7a194b2036c2948639a06c392590bf66
Reviewed-on: http://gerrit.cloudera.org:8080/21436
Reviewed-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>
Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>
2024-05-23 03:54:00 +00:00
Daniel Becker
aba27edc33 IMPALA-13036: Document Iceberg metadata tables
This change adds documentation on how Iceberg metadata tables can be
used.

Testing:
 - built docs locally

Change-Id: Ic453f567b814cb4363a155e2008029e94efb6ed1
Reviewed-on: http://gerrit.cloudera.org:8080/21387
Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>
Reviewed-by: Peter Rozsa <prozsa@cloudera.com>
2024-05-10 12:40:16 +00:00
m-sanjana19
aac7f527da IMPALA-11328: [DOCS] Fix incorrect default value for max_errors
Change-Id: I442cd3ff51520c12376a13d7c78565542793d908
Reviewed-on: http://gerrit.cloudera.org:8080/21419
Reviewed-by: Quanlong Huang <huangquanlong@gmail.com>
Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>
2024-05-10 11:20:41 +00:00
Noemi Pap-Takacs
9b05a205fe IMPALA-13000: Document OPTIMIZE TABLE
Document OPTIMIZE TABLE syntax and behaviour.

Testing:
 - built docs locally

Change-Id: I851669686ed4da610dcac97c9b88ff23b0a4a647
Reviewed-on: http://gerrit.cloudera.org:8080/21320
Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>
Reviewed-by: Zoltan Borok-Nagy <boroknagyz@cloudera.com>
Reviewed-by: Daniel Becker <daniel.becker@cloudera.com>
2024-04-22 10:40:44 +00:00
Michael Smith
f05eac6476 IMPALA-12602: Unregister queries on idle timeout
Queries cancelled due to idle_query_timeout/QUERY_TIMEOUT_S are now also
Unregistered to free any remaining memory, as you cannot fetch results
from a cancelled query.

Adds a new structure - idle_query_statuses_ - to retain Status messages
for queries closed this way so that we can continue to return a clear
error message if the client returns and requests query status or
attempts to fetch results. This structure must be global because HS2
server can only identify a session ID from a query handle, and the query
handle no longer exists. SessionState tracks queries added to
idle_query_statuses_ so they can be cleared when the session is closed.

Also ensures MarkInactive is called in ClientRequestState when Wait()
completes. Previously WaitInternal would only MarkInactive on success,
leaving any failed requests in an active state until explicitly closed
or the session ended.

The beeswax get_log RPC will not return the preserved error message or
any warnings for these queries. It's also possible the summary and
profile are rotated out of query log as the query is no longer inflight.
This is an acceptable outcome as a client will likely not look for a
log/summary/profile after it times out.

Testing:
- updates test_query_expiration to verify number of waiting queries is
  only non-zero for queries cancelled by EXEC_TIME_LIMIT_S and not yet
  closed as an idle query
- modified test_retry_query_timeout to use exec_time_limit_s because
  queries closed by idle_timeout_s don't work with get_exec_summary

Change-Id: Iacfc285ed3587892c7ec6f7df3b5f71c9e41baf0
Reviewed-on: http://gerrit.cloudera.org:8080/21074
Reviewed-by: Michael Smith <michael.smith@cloudera.com>
Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>
2024-04-03 03:25:10 +00:00
jasonmfehr
3e4fdeece1 IMPALA-12824: Removes the prettyprint_duration Built-in Function
The prettyprint_duration function was originally
implemented in IMPALA-12824 to work with the workload
management tables which stored durations in integer
nanoseconds. These tables have changed to store decimal
seconds.

The prettyprint_duration function would have required a
large investment of time to make it work with decimal
values, and since the new format is more human readable
anyways, this function has been removed.

Change-Id: If2154c2ed9a7217ed4b7587adeae87df55ff03dc
Reviewed-on: http://gerrit.cloudera.org:8080/21208
Reviewed-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>
Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>
2024-03-28 06:58:56 +00:00
Saurabh Katiyal
eb2939245f IMPALA-12693: [DOCS] Typo in link for ltrim in string functions docs
Fixed documentation typo for LTRIM string function, from LTRI to LTRIM.

Change-Id: If4345fc6d19f04d0c0c6feef3e0c8598271224fe
Reviewed-on: http://gerrit.cloudera.org:8080/21123
Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>
Reviewed-by: Daniel Becker <daniel.becker@cloudera.com>
2024-03-13 09:20:16 +00:00
Noemi Pap-Takacs
70c35425d3 IMPALA-12774: [DOCS] Document ALTER TABLE SORT BY syntax
Extended the ALTER TABLE documentation with the SORT BY clause.
Also added more information about the available and the deafult
sort orders to the CREATE TABLE description.

Testing: Built docs locally.

Change-Id: Ieb348d8395a6140f0be200d73e2f22fded9a5116
Reviewed-on: http://gerrit.cloudera.org:8080/21083
Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>
Reviewed-by: Daniel Becker <daniel.becker@cloudera.com>
2024-03-11 10:10:43 +00:00
Anshula Jain
ca3fe6d6af IMPALA-12692 : [DOCS] Typo in docs about random() function
Changed name of random fucntion in impala_math_functions.xml
from "RANDOME(), RANDOME(BIGINT seed)" to "RANDOM(), RANDOM(BIGINT seed)"

Change-Id: I4844eb8d155326081c385d88b98a591dbbde7369
Reviewed-on: http://gerrit.cloudera.org:8080/21126
Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>
Reviewed-by: Michael Smith <michael.smith@cloudera.com>
2024-03-08 16:34:50 +00:00
Michael Smith
b81368a225 IMPALA-12858: [DOCS] Correct idle_client_poll_period_s docs
Correctly refer to idle_client_poll_period_s in documentation.

Change-Id: Ib89c8e3877bed508f6ba18483e48b0a4b4bd5cce
Reviewed-on: http://gerrit.cloudera.org:8080/21092
Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>
Reviewed-by: Laszlo Gaal <laszlo.gaal@cloudera.com>
2024-02-29 19:40:17 +00:00
Riza Suminto
f5c12c65db IMPALA-12801: Increase query_log_ default size and bound its memory.
Coordinator's /queries page is useful to show information about recently
run and completed queries. Having more entries will be helpful to
inspect queries that completed further back. The maximum entry of this
table is controlled by 'query_log_size' flag. Higher value means more
queries to keep, but it also cost more memory overhead in coordinator.

This patch increase 'query_log_size' default value from 100 to 200. This
patch also add flag 'query_log_size_in_bytes' (default to 2GB) as an
additional safeguard to evict entry from query_log_ when this limit
exceeded, preventing query_log_ total memory to grow prohibitively
large. 'query_log_size_in_bytes' is used in combination with
'query_log_size' to limit the number of QueryStateRecord to retain in
query_log_, whichever is less.

Testing:
- Pass exhaustive tests.

Change-Id: I107e2c2c7f2b239557be37360e8eecf5479e8602
Reviewed-on: http://gerrit.cloudera.org:8080/21020
Reviewed-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>
Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>
2024-02-23 21:01:08 +00:00
jasonmfehr
d03ffc70f2 IMPALA-12824: Adds built-in functions prettyprint_duration and prettyprint_bytes.
The prettyprint_duration function takes an integer input containing a
number of nanoseconds and returns a human readable value breaking down
the input by hours, minutes, seconds, milliseconds, microseconds, and
nanoseconds.

The prettyprint_bytes function takes an integer input containing a
number of bytes and returns a human readable values breaking down the
input by gigabytes, megabytes, kilobytes, and bytes.

Functionality tests were added to the existing expr-test suite that
tests built-in functions.

Functional-query workloads were added in two new .test files under the
testdata directory to exercise these two new functions. Corresponding
pytests were added to run the tests in these new .test files.

Benchmarks were added to expr-benchmark, and new benchmarks were
generated with a release build running on a machine with the cpu
Intel(R) Core(TM) i7-10700 CPU @ 2.90GHz.

Documentation was added to the built-in string functions docs.

Change-Id: I3e76632ce21ad2ca5df474160338699a542a6913
Reviewed-on: http://gerrit.cloudera.org:8080/21038
Reviewed-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>
Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>
2024-02-21 04:23:28 +00:00
pranavyl
ab445195b0 IMPALA-12756: [DOCS] Unicode column name support documentation
The patch focuses on documenting that Impala supports unicode
column names, consistent with Hive's current support (as we use
Hive MetaStore to store table metadata).

Change-Id: I3d43d942a3ea069020f06adab6ea77e62ad5ffbe
Reviewed-on: http://gerrit.cloudera.org:8080/20950
Reviewed-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>
Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>
2024-01-30 14:00:31 +00:00
Zoltan Borok-Nagy
dea8546d80 IMPALA-12653: Update documentation about the UPDATE statement
This patch adds documentation about the UPDATE statement.

Change-Id: I2a4f3dcdba5faaa7dffda60b8590d09e6a92a165
Reviewed-on: http://gerrit.cloudera.org:8080/20818
Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>
Reviewed-by: Noemi Pap-Takacs <npaptakacs@cloudera.com>
Reviewed-by: Andrew Sherman <asherman@cloudera.com>
2024-01-02 10:49:13 +00:00
Riddhi Jain
d01d028b07 IMPALA-11762: [DOCS] Reserved words documentation lags behind the code
Crosschecked keywordMap from:
https://github.com/apache/impala/blob/master/fe/src/main/jflex/sql-scanner.flex
with upstream docs:
https://impala.apache.org/docs/build/html/topics/impala_reserved_words.html#reserved_words

Added following Keywords missing from docs:
buckets
disable
enable
hudiparquet
jsonfile
lexical
managedlocation
minus
non
norely
novalidate
optimize
orc
rely
rwstorage
selectivity
sets
spec
storagehandler_uri
system_version
unset
user_defined_fn
validate
zorder

Change-Id: I0ae58a4730c2e3d8d82cccdff23c1fff36117522
Reviewed-on: http://gerrit.cloudera.org:8080/20605
Reviewed-by: Laszlo Gaal <laszlo.gaal@cloudera.com>
Tested-by: Laszlo Gaal <laszlo.gaal@cloudera.com>
2023-11-24 01:14:11 +00:00
Shajini Thayasingh
43051237d3 IMPALA-11967: [DOCS] Update Compute Incremental Stats syntax
Updated "compute incremental stats" syntax to support a list of columns.

Change-Id: Id5ad3bdf26572a1d0510df9b41ee1f12ae2cf747
Reviewed-on: http://gerrit.cloudera.org:8080/19602
Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>
Reviewed-by: Quanlong Huang <huangquanlong@gmail.com>
2023-11-14 01:15:34 +00:00
Shajini Thayasingh
a01ad35566 IMPALA-12491: [DOCS] Add a note on the cache item
Described how the scan request will access the cache when there is
no change in the mtime in the file metadata.

Change-Id: I508ce667181d635c17373c7336ea9f83984d7641
Reviewed-on: http://gerrit.cloudera.org:8080/20611
Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>
Reviewed-by: Quanlong Huang <huangquanlong@gmail.com>
Reviewed-by: Michael Smith <michael.smith@cloudera.com>
2023-11-06 18:57:18 +00:00
Tamas Mate
eadd35f6d5 IMPALA-11853: Fix formatted docs query options CSS
The recent documentation formatting changes introduced the navigation
panel on the left. However, due to the length of the query options
navigation title these could overlap with the documentation paragraphs.

This commit removes the underscores from the navigation titles of the
query options, so browsers can break them into multiple lines.
Additionally, the "SET" and "Query Options for the SET Statement" pages
are merged to save some more space for the query option navigation
titles.

Testing:
 - Built the documentation and tested manually

Change-Id: Icec787d7a2af848aaaff65be2ecf311a5ce8fe7f
Reviewed-on: http://gerrit.cloudera.org:8080/20556
Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>
Reviewed-by: Jason Fehr <jfehr@cloudera.com>
Reviewed-by: Peter Rozsa <prozsa@cloudera.com>
Reviewed-by: Tamas Mate <tmater@apache.org>
2023-10-18 10:22:05 +00:00
Abhishek Rawat
05bc485851 IMPALA-10860: Allow setting mem_limit for coordinators
Added support for MEM_LIMIT_COORDINATORS query option. This is
similar to exisiting MEM_LIMIT_EXECUTORS, but applies to coordinators.
There are cases where Planner generates inaccurate estimates for
coordinator fragments and would be good to be able to set mem limit
just for the coordinator, since a query's memory requirement on
coordinator tends to be much lower compared to that on executors.

If MEM_LIMIT is set, then MEM_LIMIT_COORDINATORS is ignored.

Also updated the documentation for the new query option.

Testing:
- Added new custom cluster tests which validates MEM_LIMIT_COORDINATORS
applies only on coordinator. The test also validates that both
MEM_LIMIT_EXECUTORS and MEM_LIMIT_COORDINATORS can be set together.
- Built docs and made sure that the new changes have proper formatting.

Change-Id: I2dfc9a735e82dce2fd903bdaf6bc2e46e982ef8c
Reviewed-on: http://gerrit.cloudera.org:8080/20378
Reviewed-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>
Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>
2023-08-23 01:55:00 +00:00
Zoltan Borok-Nagy
12276c79f9 IMPALA-12335: [DOCS] Add documentation about the DELETE statement
IMPALA-11877 added support for the DELETE statement for Iceberg
tables. This patch documents this feature.

Change-Id: If111a7ecd20bda2d4928332ef2ccd905814cb203
Reviewed-on: http://gerrit.cloudera.org:8080/20361
Reviewed-by: Zoltan Borok-Nagy <boroknagyz@cloudera.com>
Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>
2023-08-17 15:44:06 +00:00
Zoltan Borok-Nagy
240a9cddc2 [docs] Add note about REFRESH for tables in custom catalog
This patch adds a small note about Iceberg tables in custom catalog.

Change-Id: I909603dff20fc4fefde48539d301844d40167a57
Reviewed-on: http://gerrit.cloudera.org:8080/20328
Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>
Reviewed-by: Andrew Sherman <asherman@cloudera.com>
2023-08-15 20:27:21 +00:00
Andrew Sherman
a24897269d IMPALA-12332: Undo IMPALA-8615 as the corresponding configs have been removed
IMPALA-8615 documented the changes made in IMPALA-8536, but the configs
were subsequently removed in IMPALA-9077. Rollback IMPALA-8615 to bring
the docs up to date.

Revert "IMPALA-8615: [DOCS] Document the scalable admission control parameters"
This was a clean revert, and there were no overlapping changes to this file.

TESTING:
- built docs and reviewed the file.

This reverts commit b2136c39fc.

Change-Id: Ibc856c62babb4b305b6a7c286a0f4c86e6e418cc
Reviewed-on: http://gerrit.cloudera.org:8080/20308
Reviewed-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>
Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>
2023-08-03 16:08:03 +00:00
whsyourdaddy
bb237660a0 IMPALA-12047: [DOCS] Fix a URL mistake when using "git clone" in docs/README.md
There's a URL mistake when using "git clone" in docs/README.md:
  git clone https://gitbox.apache.org/repos/asf/impala.git/docs

This doesn't work and it will prompt "repository not found".

This change corrects the description, providing two ways to download
the docs - either by downloading the whole repository and going to the
docs/ directory or by downloading only the docs using git sparse-checkout.

Change-Id: Ib00c37e28e67cca5b3630742b4c366dea4e967b7
Reviewed-on: http://gerrit.cloudera.org:8080/19634
Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>
Reviewed-by: Daniel Becker <daniel.becker@cloudera.com>
Reviewed-by: Yingchun Lai <laiyingchun@apache.org>
2023-04-17 08:55:03 +00:00
Noemi Pap-Takacs
b8915c9a4d IMPALA-11878: Document that AVRO file format is supported in Iceberg tables
Change-Id: I0e3fdade3a12570e488a3a58819697134dbb4c15
Reviewed-on: http://gerrit.cloudera.org:8080/19680
Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>
Reviewed-by: Tamas Mate <tmater@apache.org>
2023-04-17 08:26:34 +00:00
Shajini Thayasingh
c353d69cbd IMPALA-11985: [DOCS] Support for Kudu's multi-rows transaction
Fixed some typos and made final changes.
Clarified some questions that were raised as comments.
Incorporated some minor comments.
Documented the support for Kudu's multi-rows transaction.

Change-Id: Ic226679d83d7221f843994ead11cb2bc9e971882
Reviewed-on: http://gerrit.cloudera.org:8080/19651
Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>
Reviewed-by: Alexey Serbin <alexey@apache.org>
Reviewed-by: Wenzhe Zhou <wzhou@cloudera.com>
2023-03-31 23:29:23 +00:00
Shajini Thayasingh
19b5d781f4 IMPALA-11987: [DOCS] Document the mem_limit_executors query option
Document the difference between mem_limit and mem_limit_executors
query options.

Change-Id: Ibce8347d564afde0f9b9e83df560454f4b8b5c9d
Reviewed-on: http://gerrit.cloudera.org:8080/19604
Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>
Reviewed-by: Daniel Becker <daniel.becker@cloudera.com>
Reviewed-by: Wenzhe Zhou <wzhou@cloudera.com>
2023-03-22 15:46:22 +00:00
Shajini Thayasingh
8edfab000e IMPALA-12010: [DOCS] Document the support for non-unique primary key
Incorporated new comments.
Made changes to address new comments posted after the patch was merged.

Change-Id: Ifaea9645cf771b76fda3083d73b14524f53eb186
Reviewed-on: http://gerrit.cloudera.org:8080/19636
Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>
Reviewed-by: Abhishek Chennaka <achennaka@cloudera.com>
Reviewed-by: Wenzhe Zhou <wzhou@cloudera.com>
2023-03-21 20:56:27 +00:00
Andrew Sherman
b969a1fc6a IMPALA-11596: [DOCS] Document EXECUTE ROLLBACK functionality
Since IMPALA-11482 Impala supports the 'Alter Table Execute Rollback'
statement for Iceberg tables. Update the docs to cover this change.

The section on DESCRIBE HSTORY is expanded to include the output
columns, as this information is relevant to EXECUTE ROLLBACK.
The section on Cloning Iceberg tables is moved so that the sections
concerned with table history are adjacent.

TESTING:
- Built docs locally.

Change-Id: I0e1690378e560197263c49f468618b1ded922df3
Reviewed-on: http://gerrit.cloudera.org:8080/19606
Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>
Reviewed-by: Noemi Pap-Takacs <npaptakacs@cloudera.com>
Reviewed-by: Zoltan Borok-Nagy <boroknagyz@cloudera.com>
2023-03-21 14:28:03 +00:00
Michael Smith
39fea06f2b IMPALA-11990: Make actual failures clearer
A hack to cleanup after Hbase fails when services haven't been started
yet (which is always at least once in a CI run) with a large error
message. That error isn't useful and can be misleading for people
reviewing test logs. Suppress it.

Guards data load for Ozone as a usable snapshot is required. Also fixes
a typo in fixed issues.

Change-Id: Idc37d03780fca35427b977524b2b97a6892c88f7
Reviewed-on: http://gerrit.cloudera.org:8080/19459
Reviewed-by: Gabor Kaszab <gaborkaszab@cloudera.com>
Tested-by: Michael Smith <michael.smith@cloudera.com>
2023-03-10 22:23:18 +00:00
Shajini Thayasingh
27e238c8cf IMPALA-11906: [DOCS] Document the support for non-unique primary key
Incorporated the comments received.
Added a new sub-section.
Change-Id: I7b5a452f2199d097077150c012497aa4a3ecf7d9
Reviewed-on: http://gerrit.cloudera.org:8080/19587
Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>
Reviewed-by: Abhishek Chennaka <achennaka@cloudera.com>
Reviewed-by: Wenzhe Zhou <wzhou@cloudera.com>
2023-03-08 17:35:52 +00:00
Michael Smith
1321b5ce54 IMPALA-11920: [DOCS] Cleanup and update spill examples
Updates documentation to include examples with service identifier. Also
fixes inconsistent use of ASCII quotes for example text, highlighting
code and variable names, and normalizes descriptions between
S3/HDFS/Ozone. Removes "priority" from remote descriptions as it is
optional and does nothing.

Change-Id: I624a607bda33ab47100e1540ff1d66c8d19a7329
Reviewed-on: http://gerrit.cloudera.org:8080/19504
Reviewed-by: Michael Smith <michael.smith@cloudera.com>
Tested-by: Michael Smith <michael.smith@cloudera.com>
2023-03-02 17:25:55 +00:00
Riza Suminto
fa64be7cc7 IMPALA-11940: [DOCS] Document manifest caching settings for Iceberg
IMPALA-11658 implements Iceberg manifest caching for Impala. This patch
adds documentation for configuring the cache(s).

Testing:
- Built docs locally

Change-Id: Idd761a81f5c81a25a5ec0889402f85157c23e9fe
Reviewed-on: http://gerrit.cloudera.org:8080/19530
Reviewed-by: Daniel Becker <daniel.becker@cloudera.com>
Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>
Reviewed-by: Zoltan Borok-Nagy <boroknagyz@cloudera.com>
2023-02-24 17:16:34 +00:00
Shajini Thayasingh
7dcf80b32e IMPALA-10804: [DOCS] Document spill to remote storage
Spill to HDFS, S3, and Ozone.

Change-Id: I3efb2ffcc06cdbe69845c6dc4cf03d9f2e3dcabc
Reviewed-on: http://gerrit.cloudera.org:8080/19472
Reviewed-by: Yida Wu <wydbaggio000@gmail.com>
Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>
2023-02-07 22:04:51 +00:00
Michael Smith
88d49b6919 IMPALA-11693: Enable allow_erasure_coded_files by default
Enables allow_erasure_coded_files by default as we've now completed all
planned work to support it.

Testing
- Ran HDFS+EC test suite
- Ran Ozone+EC test suite

Change-Id: I0cfef087f2a7ae0889f47e85c5fab61a795d8fd4
Reviewed-on: http://gerrit.cloudera.org:8080/19362
Reviewed-by: Joe McDonnell <joemcdonnell@cloudera.com>
Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>
2023-01-31 16:53:46 +00:00