First cut at 'new features' topic.
Includes the Incompatible Changes subtopic for Impala 2.8.
Also did some cleanup throughout the Incompatible
Changes page:
- Took out references to Cloudera release numbers
from titles.
- Suppressed the display of ancient subtopics from
the Impala beta days, which are intertwined with
things like what version of Cloudera Manager was
supported.
Patch set 3:
More on MT_DOP for COMPUTE STATS.
Address comments from Greg and MJ.
Added incompatible change for IMPALA-4160 - Llama removal.
Also pulled out hidden discussion and obsolete details of no-op
query options.
Change-Id: I7c47f422e509cec6d3eb8aaa82294b584f393aed
Reviewed-on: http://gerrit.cloudera.org:8080/5668
Reviewed-by: Jim Apple <jbapple-impala@apache.org>
Tested-by: Impala Public Jenkins
There was one tutorial that actually ran under the
'cloudera' user and so repeated that name over and
over in directory and HDFS paths. I switched that to
'username'.
I suppressed some <note> and <li> tags with Cloudera
Manager-specific details. Will physically remove those
from the source in a subsequent iteration.
I left several instances of audience="Cloudera" because
those will be changed to audience="hidden" as part of
a separate change request.
I marked with rev="upstream" some <codeblock> tags
containing impala-shell banners with a Cloudera copyright
statement. Will decide on a convention to handle those
(elide those lines, or use a conref to consistently
substitute the generic equivalent) and do that in a
followup patch set.
Change-Id: I44245b65ce6f247ae8771f582f4b33c3712145ae
Reviewed-on: http://gerrit.cloudera.org:8080/5663
Reviewed-by: Laurel Hale <laurel@cloudera.com>
Reviewed-by: Jim Apple <jbapple-impala@apache.org>
Tested-by: Impala Public Jenkins
This set of edits removes references and links to Cloudera Navigator
and Cloudera Manager from the auditing and lineage topics. Those
were either marked as 'hidden' or replaced with a generic suggestion
to use cluster management software with a focus on governance.
Some paragraphs with overflowing lines were also fixed.
Change-Id: I192bc2d1de89e55418c045d1a0e5433cf02cf782
Reviewed-on: http://gerrit.cloudera.org:8080/5957
Reviewed-by: Jim Apple <jbapple-impala@apache.org>
Tested-by: Impala Public Jenkins
Rewrote sections to eliminate 'Cloudera Manager' from topics.
Look for subsequent phases to remove remaining instances of CM.
Change-Id: I02ff6c3fc74e2e59b5d130226bd38c23c9c094b7
Reviewed-on: http://gerrit.cloudera.org:8080/6049
Reviewed-by: Jim Apple <jbapple-impala@apache.org>
Tested-by: Impala Public Jenkins
Almost all of the original Impala FAQ material was
Cloudera-themed or commercially oriented. Lots of
answers about the QuickStart VM, Cloudera discussion
forums, CDH-based recommendations, etc. IMO it is
not worth trying to adapt each FAQ entry to be generic.
Better to start over from the ground up.
Phase 1 of making an Apache-friendly FAQ is to strip
the original page "down to the studs" so new FAQ
entries can be added with more of a developer theme,
based on questions people have in the community.
Change-Id: Ib81242f0981c04fff99e2c27e06a8d9f4da34c9f
Reviewed-on: http://gerrit.cloudera.org:8080/6003
Reviewed-by: Jim Apple <jbapple-impala@apache.org>
Tested-by: Impala Public Jenkins
Upgrade with details of latest syntax.
Fine-tune discussion of PK and other Kudu
notions.
The impala_kudu diff looks larger than actual changes
to the page, because subtopics got moved
around and promoted/demoted (which changes the
indentation). Best to review that page start-to-finish.
CREATE TABLE details for Impala + Kudu.
ALTER TABLE details for Impala + Kudu.
Unhide the Impala partitioning + Kudu topic.
Mainly a brief intro then a link to delegate
details to the main Kudu page, which already
has a partitioning subtopic.
Include changes to reserved words. Entirely
from Kudu integration work.
Add Kudu considerations for misc SQL statements.
Addressed Todd's and Dimitris's comments for certain files.
(Up to the beginning of the "Partitioning" section in
impala_kudu.xml.)
Added Kudu blurbs to data type topics:
- Some aren't supported.
- Others are supported but can't go in the primary key.
Added walkthrough of renaming internal/external tables.
Split out Kudu CREATE TABLE syntax from other file formats.
Correct info about CTAS for Kudu tables.
Add examples of basic Kudu, external Kudu, and Kudu CTAS.
Change-Id: I76dcb948dab08532fe41326b22ef78d73282db2c
Reviewed-on: http://gerrit.cloudera.org:8080/5649
Reviewed-by: Matthew Jacobs <mj@cloudera.com>
Tested-by: Impala Public Jenkins
This is part 1 of the changes being made to the Impala authorization
topics. References to CDH and Cloudera Manager docs/products have been
either 'hidden' or removed completely.
Examples with Sentry have been made more generic. Instances of
Cloudera-specific folders or filenames have been removed.
Change-Id: Ie5c4431f3236b18fc282343ed98513f0e578130e
Reviewed-on: http://gerrit.cloudera.org:8080/5931
Reviewed-by: Jim Apple <jbapple-impala@apache.org>
Reviewed-by: John Russell <jrussell@cloudera.com>
Tested-by: Impala Public Jenkins
Make keydefs corresponding to most of the links
(especially external links) throughout the doc source.
Make keydefs for all possible IMPALA- JIRA issues, up to
IMPALA-9999.
Change-Id: If57c5730f80fd32ee77b31849e4e75afd53fab38
Reviewed-on: http://gerrit.cloudera.org:8080/5923
Reviewed-by: Ambreen Kazi <ambreen.kazi@cloudera.com>
Tested-by: Impala Public Jenkins
Reviewed-by: John Russell <jrussell@cloudera.com>
2x links that point to archive.cloudera.com mirror of HBase
docs, point to Apache HBase doc site instead.
Remove a little Cloudera-specific verbiage, esp. in banners of
hive / impala-shell output.
Change-Id: I1b832d23be7182d88c99ee169976ad9aeed746b1
Reviewed-on: http://gerrit.cloudera.org:8080/5900
Reviewed-by: Ambreen Kazi <ambreen.kazi@cloudera.com>
Reviewed-by: John Russell <jrussell@cloudera.com>
Tested-by: Impala Public Jenkins
The "Known Issue" link to IMPALA-1480 is the full
URL + the text "IMPALA-1480" after it, instead of
"IMPALA-1480" being the link, because "IMPALA-1480"
went by accident outside the </xref> end tag for
the link.
Also fill in Known Issues title for IMPALA-3441.
Change-Id: I87b6c7b9f4c6db50c63aed7f6248dc8e808e4f6f
Reviewed-on: http://gerrit.cloudera.org:8080/5881
Reviewed-by: Jim Apple <jbapple-impala@apache.org>
Tested-by: Impala Public Jenkins
Keep the location and cache options with ADD PARTITION,
and PURGE with DROP PARTITION.
Change-Id: I07458d9851bb2ab48311001f696353f834c4fe80
Reviewed-on: http://gerrit.cloudera.org:8080/5729
Reviewed-by: Dimitris Tsirogiannis <dtsirogiannis@cloudera.com>
Tested-by: Impala Public Jenkins
Fill in syntax, usage notes, examples for
UPDATE, DELETE, UPSERT. Take out IGNORE from
INSERT.
Add 2nd syntax form and examples for DELETE.
Add join syntax to UPDATE.
Change-Id: I60512b7957fb53d86d3123a4f1d46fbb355f4665
Reviewed-on: http://gerrit.cloudera.org:8080/5646
Reviewed-by: Matthew Jacobs <mj@cloudera.com>
Tested-by: Impala Public Jenkins
Also touch up the signatures of the related quotient() function,
which are looking a bit stale since the very early days.
Change-Id: I85093383ca6f32ba69f46fdf0220a1026e36f37f
Reviewed-on: http://gerrit.cloudera.org:8080/5634
Reviewed-by: Alex Behm <alex.behm@cloudera.com>
Tested-by: Impala Public Jenkins
Syntax and usage notes for ALTER TABLE,
COMPUTE STATS, and SHOW FILES.
Mixed in a little bit with new Kudu syntax for
ALTER TABLE. Didn't include all new Kudu info
in this CR, the better to minimize merge conflicts.
Added note about performance/scalability of IMPALA-1654.
Added new Known Issue item for IMPALA-4106 under Performance category.
Change-Id: I2060552d5081e5f93b1b1f398414c52fa03f215b
Reviewed-on: http://gerrit.cloudera.org:8080/5726
Reviewed-by: Alex Behm <alex.behm@cloudera.com>
Tested-by: Impala Public Jenkins
Fix the version numbers that show up in titles
and in lead-ins, e.g. "____ or higher only", to
reflect purely Impala release numbers.
Substitute the most prevalent forms of reference
to CDH and Cloudera support with generic equivalents.
Resolved merge conflict with the code review to reorg
all the keydefs into a separate file.
Change-Id: Ia23b96a8344e6430528701b81bea7eaaccc72b18
Reviewed-on: http://gerrit.cloudera.org:8080/5661
Reviewed-by: John Russell <jrussell@cloudera.com>
Tested-by: Impala Public Jenkins
For this change to land in master, the audience="hidden" code review
needs to be completed first. Otherwise, the doc build would still work
but the audience="hidden" content would be visible rather than hidden as
desired.
Some work happening in parallel might introduce additional instances of
audience="Cloudera". I suggest addressing those in a followup CR so this
global change can land quickly.
Since the changes apply across so many different files, but are so
narrow in scope, I suggest that the way to validate (check that no
extraneous changes were introduced accidentally) is to diff just the
changed lines:
git diff -U0 HEAD^ HEAD
In patch set 2, I updated other topics marked audience="Cloudera"
by CRs that were pushed in the meantime.
Change-Id: Ic93d89da77e1f51bbf548a522d98d0c4e2fb31c8
Reviewed-on: http://gerrit.cloudera.org:8080/5613
Reviewed-by: John Russell <jrussell@cloudera.com>
Tested-by: Impala Public Jenkins
Patch set #2 to bring this gerrit review
up to date with subsequent changes to .ditamap
file in master.
Also incorporate <topicref> and corresponding
<keydef> for SCRATCH_LIMIT query option.
Change-Id: I4d3098356e1b112ba08bfaf7386c3a1f30306223
Reviewed-on: http://gerrit.cloudera.org:8080/5599
Reviewed-by: John Russell <jrussell@cloudera.com>
Tested-by: Impala Public Jenkins
One query option was missing a <navtitle> tag.
The other had a <navtitle> tag but it was blank.
The results could have been inconsistent or missing
link text for auto-generated navigational links
such as "next page" and "previous page" links.
Change-Id: I2350baffeb216655380d4055bff8bdc09457ec3f
Reviewed-on: http://gerrit.cloudera.org:8080/5682
Reviewed-by: Laurel Hale <laurel@cloudera.com>
Reviewed-by: John Russell <jrussell@cloudera.com>
Tested-by: Impala Public Jenkins
That's an alias for audience="Cloudera",
which will be removed in a subsequent CR.
This 2nd patch set is just for practice amending the
commit message without touching any files.
Change-Id: I0771ccf912d8112194ad52a7fa76b092ea6cff72
Reviewed-on: http://gerrit.cloudera.org:8080/5591
Reviewed-by: John Russell <jrussell@cloudera.com>
Tested-by: John Russell <jrussell@cloudera.com>
A 'category' metatag with the value 'CDH'.
There's one other instance of this metatag,
but it's in impala_cdh.xml which is getting
deleted in a different code review.
Change-Id: Ic0001cd027435f1a8a4fd02ad28e169313e5a6f4
Reviewed-on: http://gerrit.cloudera.org:8080/5627
Reviewed-by: Jim Apple <jbapple-impala@apache.org>
Tested-by: John Russell <jrussell@cloudera.com>
The errata page is in an early experimental form and not expected
to be ready for prime time anytime soon. Depends on some
automation that's not practical to put in place during the
transition to Apache infra. That page is chock full of references
to CDH version numbers, so best to make sure it can't possibly
appear in the output.
Using 'Cloudera' condition instead of 'pre-release' to clarify that
the errata topic is experimental and will remain invisible for the
foreseeable future.
Also applying the same attribute to the corresponding <keydef> tag
so the hidden topic doesn't get processed in some unforeseen way.
Will change those conditions to 'hidden' after the other changes
to audience= attributes land.
Change-Id: Id254192b325ba18c415a2cf17e6357800d81c437
Reviewed-on: http://gerrit.cloudera.org:8080/5625
Reviewed-by: Jim Apple <jbapple-impala@apache.org>
Reviewed-by: Ambreen Kazi <ambreen.kazi@cloudera.com>
Tested-by: John Russell <jrussell@cloudera.com>
Some instances of 'CDH' in the doc metadata might show up
in variable substitions or unexpected places generally,
such as the PDF title page or inside cover. Switching them
all to Impala-only numbers. Filling in Impala 2.8 as the
'current' number to reflect what will be the next release.
Also remove another instance of 'Cloudera' in the title.
Change-Id: I69e855aa958a45b3286a27fa69bcf8df315fc174
Reviewed-on: http://gerrit.cloudera.org:8080/5626
Reviewed-by: Jim Apple <jbapple-impala@apache.org>
Tested-by: John Russell <jrussell@cloudera.com>
Initial experiment to see how the doc looks & builds when the biggest,
most obvious pieces of Cloudera Manager-related content are suppressed:
- Primarily entire subtopics.
- In some cases, individual notes, paragraphs, and list items.
- Also the CM reference in the reused "cancel" text which showed up
many times throughout the SQL syntax section.
Just hiding in the output for now, rather than removing from source,
because we might find there's generic material we need to hoist out of
the CM-related subtopics.
Was planning to use audience="hidden" to hide, but since that CR hasn't
landed in master yet, still using audience="Cloudera" for the moment,
and will switch attribute values later.
A little rewording of titles and suppressing smaller elements such as
paragraphs where the CM aspect didn't apply to the entire subtopic.
Change-Id: Ic799f77bc758a1cf40c53fa412bf02d852901a69
Reviewed-on: http://gerrit.cloudera.org:8080/5607
Reviewed-by: Jim Apple <jbapple-impala@apache.org>
Tested-by: John Russell <jrussell@cloudera.com>
While we are working on excising it, we don't want users to be
confused about what the manual is intended to describe.
Change-Id: I7740189fd7ff7f22d8471f037e190d9923521936
Reviewed-on: http://gerrit.cloudera.org:8080/5610
Reviewed-by: Tim Armstrong <tarmstrong@cloudera.com>
Tested-by: Impala Public Jenkins
Removes some of the Cloudera-specific references that can't be removed
with widely-applicable automation.
Change-Id: Ib3f63fb309e0617d7fe014231bb0ab0ad67c8474
Reviewed-on: http://gerrit.cloudera.org:8080/5239
Reviewed-by: Jim Apple <jbapple-impala@apache.org>
Tested-by: Impala Public Jenkins
This commit also removes the now unused `DISTRIBUTE`, `SPLIT`, and
`BUCKETS` keywords that were going to be newly released in Impala 2.6,
but are now unused. Additionally, a few remaining uses of the
`DISTRIBUTE BY` syntax has been switched to `PARTITION BY`.
Change-Id: I32fdd5ef26c532f7a30220db52bdfbf228165922
Reviewed-on: http://gerrit.cloudera.org:8080/5382
Reviewed-by: Matthew Jacobs <mj@cloudera.com>
Tested-by: Internal Jenkins
This now gives a clean RAT check with bin/check-rat-report.py, which
is one way for the Impala community to check compliance with ASF rules
on intellectual property.
Change-Id: I2ad06435f84a65ba126759e42a18fdaf52cd7036
Reviewed-on: http://gerrit.cloudera.org:8080/5232
Reviewed-by: Jim Apple <jbapple-impala@apache.org>
Tested-by: Impala Public Jenkins
Reviewed-by: John Russell <jrussell@cloudera.com>
The docs can be built by running "make" from the docs directory. This
does not hook into buildall.sh for now, as users who run buildall.sh
do not usually edit docs/.
Change-Id: I9551b75268cb7cb29a58367a3ef03b127dccbfca
Reviewed-on: http://gerrit.cloudera.org:8080/5238
Reviewed-by: Tim Armstrong <tarmstrong@cloudera.com>
Reviewed-by: Greg Rahn <grahn@cloudera.com>
Tested-by: Jim Apple <jbapple-impala@apache.org>
These are refugees from doc_prototype. They can be rendered with the
DITA Open Toolkit version 2.3.3 by:
/tmp/dita-ot-2.3.3/bin/dita \
-i impala.ditamap \
-f html5 \
-o $(mktemp -d) \
-filter impala_html.ditaval
Change-Id: I8861e99adc446f659a04463ca78c79200669484f
Reviewed-on: http://gerrit.cloudera.org:8080/5014
Reviewed-by: John Russell <jrussell@cloudera.com>
Tested-by: John Russell <jrussell@cloudera.com>