This document is wrote refering to RCFile and Parquet's docs.
The orc-support patch was merged in impala-2.12 and impala-3.0,
so we start to support ORC format as an experimental feature
since impala-2.12.
Change-Id: Ib1ee23ed844653c274babdce5a332dbe5c79b630
Reviewed-on: http://gerrit.cloudera.org:8080/10525
Reviewed-by: Balazs Jeszenszky <jeszyb@gmail.com>
Reviewed-by: Alex Rodoni <arodoni@cloudera.com>
Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>
Change-Id: I91d9f4f039a603382ff4415d1dd22a351279cbfa
IMPALA-6723 Hints for CTAS
Change-Id: I201a4e1ddaf62164e1f6b636c4e1e60af60e1af7
IMPALA-6723: [DOCS] Hints for CTAS
Optimizer hints were move out of SELECT section.
Hints for CTAS were added to the same section as INSERT.
Change-Id: I91d9f4f039a603382ff4415d1dd22a351279cbfa
Reviewed-on: http://gerrit.cloudera.org:8080/9993
Reviewed-by: Alex Rodoni <arodoni@cloudera.com>
Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>
The alias and ordinal substitution logic has been
changed by IMPALA-5191. This commit updates the
documentation regarding to the new behavior.
Change-Id: I558230d07212da62d2cd12e07a52ceba03e980a8
Cherry-picks: not for 2.x.
Reviewed-on: http://gerrit.cloudera.org:8080/9211
Reviewed-by: Alex Behm <alex.behm@cloudera.com>
Tested-by: Impala Public Jenkins
Removed the Llama options file.
Removed impala_sqlref.ditamap that is not used.
Removed the reference to impala_sqlref.ditamap in README.md
Change-Id: If0c2b8160af9c95ec1e1b744b558d9537dd2550d
Cherry-picks: not for 2.x
Reviewed-on: http://gerrit.cloudera.org:8080/9219
Reviewed-by: John Russell <jrussell@cloudera.com>
Tested-by: Impala Public Jenkins
Modify both char_length() and length() usage notes to say when they
return the same or different results.
Include the same example, showing both STRING and CHAR types,
under both functions.
Change-Id: I18cabfce66351bb890bfbfc26b93466204a82625
Reviewed-on: http://gerrit.cloudera.org:8080/9014
Reviewed-by: Tim Armstrong <tarmstrong@cloudera.com>
Tested-by: Impala Public Jenkins
In a new subtopic:
IS [NOT] TRUE
IS [NOT] FALSE
Folded into IS [NOT] NULL:
IS [NOT] UNKNOWN
Change-Id: Iefebf210418ec2d47b154bd37166b76720f085bb
Reviewed-on: http://gerrit.cloudera.org:8080/8942
Reviewed-by: Vuk Ercegovac <vercegovac@cloudera.com>
Reviewed-by: Alex Behm <alex.behm@cloudera.com>
Tested-by: Impala Public Jenkins
Add some boilerplate wording and reuse the same text
in several likely spots where readers might expect it
(under the pages for performance tuning with joins, hints,
views, and subqueries).
Change-Id: Ibd8e4d8e5260f5a2b7b5c52dc3269bb373b9c76f
Reviewed-on: http://gerrit.cloudera.org:8080/8336
Reviewed-by: Alex Behm <alex.behm@cloudera.com>
Tested-by: Impala Public Jenkins
In particular, the new query options:
BUFFER_POOL_LIMIT
MAX_ROW_SIZE
MIN_SPILLABLE_BUFFER_SIZE
DEFAULT_SPILLABLE_BUFFER_SIZE
Change-Id: I49323f8ffbff3e195058e88762eedbb1fcb1bc0e
Reviewed-on: http://gerrit.cloudera.org:8080/8003
Tested-by: Impala Public Jenkins
Reviewed-by: John Russell <jrussell@cloudera.com>
Explain how doing COMPUTE INCREMENTAL STATS for the first time
starts over and discards any previous stats from COMPUTE STATS.
As a consequence, moved some wording and examples into
impala_common.xml so that content could be used in
multiple places. Also made a new subtopic on the "Partitioning"
page because I saw COMPUTE INCREMENTAL STATS wasn't mentioned
there.
Change-Id: Ia53a6518ce5541e5c9a2cd896856ce042a599b03
Reviewed-on: http://gerrit.cloudera.org:8080/7999
Reviewed-by: Alex Behm <alex.behm@cloudera.com>
Tested-by: Impala Public Jenkins
"immediately after the SELECT keyword" was mentioned in a
few places for STRAIGHT_JOIN. I reworded all instances to
mention that [DISTINCT | ALL] can also come before the
hint name.
Change-Id: I3cac1afccc132f389b2017ad217fdf7e7b04513a
Reviewed-on: http://gerrit.cloudera.org:8080/8031
Reviewed-by: Alex Behm <alex.behm@cloudera.com>
Tested-by: Impala Public Jenkins
Added information in the "DOUBLE Data Type" (impala_double.html)
and the "FLOAT Data Type" (impala_float.html) topics about
how Impala handles NaN values.
Change-Id: Id9485b6790d58fafdae32332d2634cbe893d7fb0
Reviewed-on: http://gerrit.cloudera.org:8080/7098
Reviewed-by: Michael Brown <mikeb@cloudera.com>
Tested-by: Impala Public Jenkins
Reusing the same advice under "Known Issues", scalability
considerations, and in the Impala + Kerberos section.
Change-Id: Icbfa755e2c9769a8458fd93362769856cf32e301
Reviewed-on: http://gerrit.cloudera.org:8080/7349
Reviewed-by: Mostafa Mokhtar <mmokhtar@cloudera.com>
Tested-by: Impala Public Jenkins
The text from impala_common.xml is reused verbatim under
the REFRESH page and in the UDFs page by a #include-like
mechanism.
Change-Id: Ic41fec781396b69e6df06b8de0b29c42ad51ce8f
Reviewed-on: http://gerrit.cloudera.org:8080/7044
Reviewed-by: Taras Bobrovytsky <tbobrovytsky@cloudera.com>
Tested-by: Impala Public Jenkins
Switched the example for Infinity and NaN in the
impala.common file so it will be correct every-
where this example is used in the Impala
documentation. Instead of saying, "pow(-1, 0.5)
returns infinity and 1/0 returns NaN" it has
been corrected to "1/0 returns infinity and
pow(-1, 0.5) returns NaN."
Change-Id: I0e5d950b250c2e4425bde7d9e0bccbb068a73e12
Reviewed-on: http://gerrit.cloudera.org:8080/7083
Reviewed-by: Todd Lipcon <todd@apache.org>
Reviewed-by: Michael Brown <mikeb@cloudera.com>
Tested-by: Impala Public Jenkins
Add CLUSTERED hint.
Update hint syntax in INSERT topic.
Also modernize the hint syntax as shown under INSERT
to include the -- and /* */ formats also. List
the [] style last since it is the least-preferred
option.
Switch to preferring /* */ syntax for hints
instead of using the [ ] notation by default.
Finally, take out references to the SORTBY hint because
it didn't actually make it in. Intent for future is to have a way
to get this behavior without using a hint.
Change-Id: Id3c1da9a87ace361b096fa73d8504b2f54e75bed
Reviewed-on: http://gerrit.cloudera.org:8080/5655
Reviewed-by: John Russell <jrussell@cloudera.com>
Tested-by: Impala Public Jenkins
Link to Apache Kudu docs for details of new security features.
Change-Id: I1266ad38468ef2e987aff5aaaa4db35e6cafdacc
Reviewed-on: http://gerrit.cloudera.org:8080/6634
Reviewed-by: Matthew Jacobs <mj@cloudera.com>
Tested-by: Impala Public Jenkins
Update all URLs in *keydefs* file that point to
issues.cloudera.org.
Update all <xref> tags in the doc source that point
to individual JIRA issues on issues.cloudera.org.
(Leave alone links pointing to JIRA reports; those
don't seem to be redirected currently.)
Change-Id: I9ad2725e4afedddedf8ba65963b217803f532dea
Reviewed-on: http://gerrit.cloudera.org:8080/6452
Reviewed-by: Laurel Hale <laurel@cloudera.com>
Reviewed-by: Michael Brown <mikeb@cloudera.com>
Reviewed-by: John Russell <jrussell@cloudera.com>
Tested-by: Impala Public Jenkins
Remove last few 'Cloudera recommends' plus
some obsolete/unused Llama text that mentioned CDH.
Change-Id: Ia8544830ad564e59c5eae9844ff3fc255f4ae533
Reviewed-on: http://gerrit.cloudera.org:8080/6381
Reviewed-by: Ambreen Kazi <ambreen.kazi@cloudera.com>
Reviewed-by: John Russell <jrussell@cloudera.com>
Tested-by: Impala Public Jenkins
Get rid of cloudera.com URLs within the topics/*.xml source files.
Abstract any that need to remain (e.g. blog posts) into impala_keydefs
file for easy examining and editing.
After this change, the number of source/artifact references to CDH and
Cloudera is small enough that we can enumerate exceptions and start the endgame:
Cleanup items remaining in XML source files:
grep -EiI "[^a-zA-Z]cm[^a-zA-Z]|cdh|cloudera" *.xml |
grep -v issues.cloudera.org | wc -l
282
Cleanup items remaining in HTML output files:
grep -EiI "[^a-zA-Z]cm[^a-zA-Z]|cdh|cloudera" ../build/html/topics/*.html |
grep -v issues.cloudera.org | wc -l
148
(These numbers will go down further when the 'installing' and 'updating'
edits land in master.)
Change-Id: I9e29c0feec7bd8e974d8a3d1eb84abe757514be7
Reviewed-on: http://gerrit.cloudera.org:8080/6345
Reviewed-by: John Russell <jrussell@cloudera.com>
Tested-by: Impala Public Jenkins
Followup from Laurel's code reviews, to physically
remove references to Cloudera Manager that were hidden.
Remove a few stray instances of Cloudera Manager that I found
still remaining in the source.
Fix up trailing spaces introduced during earlier
Cloudera Manager-related edits.
Also remove stray 'Cloudera' references, or stale/commented
Cloudera-specific info, noticed near other spots being edited.
Change-Id: Ifc4a84527ae42c39b3717190b6cf669e17fff04b
Reviewed-on: http://gerrit.cloudera.org:8080/6325
Reviewed-by: Ambreen Kazi <ambreen.kazi@cloudera.com>
Reviewed-by: John Russell <jrussell@cloudera.com>
Tested-by: Impala Public Jenkins
Genericize 3-part version numbers in "known issues".
Genericize CDH version numbers in 'ports' topic.
Genericize 'Cloudera' and hostnames in 'Tables' topic.
Genericize the version numbers in 'added in' blurbs.
Remove lots of CDH / Impala notices from release notes.
Remove obsolete conref'able elements that weren't
actually being called from anywhere, that contained
CDH version number wording.
Reword 'Cloudera recommends'.
Remove more hidden or commented material with
Cloudera-specific wording.
Remove obsolete CDH references from 'incompatible changes'.
Change 'cloudera' HDFS username for LOAD DATA examples.
Remove material related to big lists of CDH fixed JIRAs.
Genericize some CDH-related language.
Change-Id: Iaa5db6c20f4d010972ade4945a3ea59b32ef95de
Reviewed-on: http://gerrit.cloudera.org:8080/6267
Reviewed-by: Ambreen Kazi <ambreen.kazi@cloudera.com>
Reviewed-by: John Russell <jrussell@cloudera.com>
Tested-by: Impala Public Jenkins
Most of these fixes involved hiding the paragraphs with
the DITA attribute 'audience="hidden"' and then inserting
a paragraph suitable for upstream documentation. This
hides the mention of Cloudera Manager in the rendered
documentation. In a subsequent cleanup project, the
"Cloudera Manager" mentions will be removed from the
XML.
Change-Id: I3c3c2177e0b9c4c81f1541820013c66a59c0c7b1
Reviewed-on: http://gerrit.cloudera.org:8080/6069
Reviewed-by: John Russell <jrussell@cloudera.com>
Tested-by: Impala Public Jenkins
Removed the topic "impala_howto_rm.xml" from the build
because it was focused on resource management by using
Cloudera Manager. Commented it out of impala.ditamap,
impala_keydefs.ditamap, and from a reference to it
in shared/impala_common.xml. This removes it
completely from the rendered upstream docs. A later
project will remove the source XML.
Change-Id: I26fc661144ba906828d5f6b5b2ea2eca02693369
Reviewed-on: http://gerrit.cloudera.org:8080/6108
Reviewed-by: John Russell <jrussell@cloudera.com>
Tested-by: Impala Public Jenkins