Commit Graph

21 Commits

Author SHA1 Message Date
John Russell
eaefbb90ce IMPALA-1654: [DOCS] DDL for multiple partitions
Syntax and usage notes for ALTER TABLE,
COMPUTE STATS, and SHOW FILES.

Mixed in a little bit with new Kudu syntax for
ALTER TABLE. Didn't include all new Kudu info
in this CR, the better to minimize merge conflicts.

Added note about performance/scalability of IMPALA-1654.

Added new Known Issue item for IMPALA-4106 under Performance category.

Change-Id: I2060552d5081e5f93b1b1f398414c52fa03f215b
Reviewed-on: http://gerrit.cloudera.org:8080/5726
Reviewed-by: Alex Behm <alex.behm@cloudera.com>
Tested-by: Impala Public Jenkins
2017-01-25 18:39:12 +00:00
John Russell
7d9e766207 Improve substitution variables.
Fix the version numbers that show up in titles
and in lead-ins, e.g. "____ or higher only", to
reflect purely Impala release numbers.

Substitute the most prevalent forms of reference
to CDH and Cloudera support with generic equivalents.

Resolved merge conflict with the code review to reorg
all the keydefs into a separate file.

Change-Id: Ia23b96a8344e6430528701b81bea7eaaccc72b18
Reviewed-on: http://gerrit.cloudera.org:8080/5661
Reviewed-by: John Russell <jrussell@cloudera.com>
Tested-by: Impala Public Jenkins
2017-01-18 20:05:53 +00:00
John Russell
8377b9949c Global search/replace: audience="Cloudera" -> audience="hidden".
For this change to land in master, the audience="hidden" code review
needs to be completed first. Otherwise, the doc build would still work
but the audience="hidden" content would be visible rather than hidden as
desired.

Some work happening in parallel might introduce additional instances of
audience="Cloudera". I suggest addressing those in a followup CR so this
global change can land quickly.

Since the changes apply across so many different files, but are so
narrow in scope, I suggest that the way to validate (check that no
extraneous changes were introduced accidentally) is to diff just the
changed lines:

git diff -U0 HEAD^ HEAD

In patch set 2, I updated other topics marked audience="Cloudera"
by CRs that were pushed in the meantime.

Change-Id: Ic93d89da77e1f51bbf548a522d98d0c4e2fb31c8
Reviewed-on: http://gerrit.cloudera.org:8080/5613
Reviewed-by: John Russell <jrussell@cloudera.com>
Tested-by: Impala Public Jenkins
2017-01-18 19:31:57 +00:00
John Russell
2665866c16 IMPALA-4643: Centralize <keydef> tags in separate .ditamap
Patch set #2 to bring this gerrit review
up to date with subsequent changes to .ditamap
file in master.

Also incorporate <topicref> and corresponding
<keydef> for SCRATCH_LIMIT query option.

Change-Id: I4d3098356e1b112ba08bfaf7386c3a1f30306223
Reviewed-on: http://gerrit.cloudera.org:8080/5599
Reviewed-by: John Russell <jrussell@cloudera.com>
Tested-by: Impala Public Jenkins
2017-01-13 22:28:28 +00:00
John Russell
09c77a879e Clean up <navtitle> tags for 2 query options
One query option was missing a <navtitle> tag.

The other had a <navtitle> tag but it was blank.

The results could have been inconsistent or missing
link text for auto-generated navigational links
such as "next page" and "previous page" links.

Change-Id: I2350baffeb216655380d4055bff8bdc09457ec3f
Reviewed-on: http://gerrit.cloudera.org:8080/5682
Reviewed-by: Laurel Hale <laurel@cloudera.com>
Reviewed-by: John Russell <jrussell@cloudera.com>
Tested-by: Impala Public Jenkins
2017-01-13 19:58:46 +00:00
John Russell
6a2c9049ff IMPALA-3671: Add SCRATCH_LIMIT query option.
Clarify comment about # of DataNodes.

Change-Id: I662a59a59da8a170a2710d4a5245363ae1e3f754
Reviewed-on: http://gerrit.cloudera.org:8080/5651
Reviewed-by: Tim Armstrong <tarmstrong@cloudera.com>
Tested-by: Impala Public Jenkins
2017-01-11 23:38:00 +00:00
Ambreen Kazi
63a9f3e194 Change one instance of namenode to NameNode
Change-Id: Ibf5f387fa07cf988a99b9ede4066bfd2b27afe16
Reviewed-on: http://gerrit.cloudera.org:8080/5594
Reviewed-by: Jim Apple <jbapple-impala@apache.org>
Tested-by: Impala Public Jenkins
2017-01-10 01:12:58 +00:00
John Russell
7b1913afbc Apply attribute to hide main data source topic in keydef tag.
Use the same audience= attr as the corresponding topicref tag.

Change-Id: I5c7d3bf9a2a2c3fe73873fea95d025d838e987bb
Reviewed-on: http://gerrit.cloudera.org:8080/5592
Reviewed-by: John Russell <jrussell@cloudera.com>
Tested-by: John Russell <jrussell@cloudera.com>
2017-01-07 00:38:24 +00:00
John Russell
62ec6de367 Add audience="hidden" to DITAVal.
That's an alias for audience="Cloudera",
which will be removed in a subsequent CR.

This 2nd patch set is just for practice amending the
commit message without touching any files.
Change-Id: I0771ccf912d8112194ad52a7fa76b092ea6cff72
Reviewed-on: http://gerrit.cloudera.org:8080/5591
Reviewed-by: John Russell <jrussell@cloudera.com>
Tested-by: John Russell <jrussell@cloudera.com>
2017-01-07 00:30:46 +00:00
John Russell
f3de156966 Remove one instance of invisible metatag.
A 'category' metatag with the value 'CDH'.

There's one other instance of this metatag,
but it's in impala_cdh.xml which is getting
deleted in a different code review.

Change-Id: Ic0001cd027435f1a8a4fd02ad28e169313e5a6f4
Reviewed-on: http://gerrit.cloudera.org:8080/5627
Reviewed-by: Jim Apple <jbapple-impala@apache.org>
Tested-by: John Russell <jrussell@cloudera.com>
2017-01-07 00:23:53 +00:00
John Russell
1072d076e4 Make sure 'errata' page is hidden.
The errata page is in an early experimental form and not expected
to be ready for prime time anytime soon. Depends on some
automation that's not practical to put in place during the
transition to Apache infra. That page is chock full of references
to CDH version numbers, so best to make sure it can't possibly
appear in the output.

Using 'Cloudera' condition instead of 'pre-release' to clarify that
the errata topic is experimental and will remain invisible for the
foreseeable future.

Also applying the same attribute to the corresponding <keydef> tag
so the hidden topic doesn't get processed in some unforeseen way.

Will change those conditions to 'hidden' after the other changes
to audience= attributes land.

Change-Id: Id254192b325ba18c415a2cf17e6357800d81c437
Reviewed-on: http://gerrit.cloudera.org:8080/5625
Reviewed-by: Jim Apple <jbapple-impala@apache.org>
Reviewed-by: Ambreen Kazi <ambreen.kazi@cloudera.com>
Tested-by: John Russell <jrussell@cloudera.com>
2017-01-07 00:23:39 +00:00
John Russell
1184b33cd4 IMPALA-3398: Remove references to a particular Cloudera server.
Sometimes the .cloudera.com hostname of a test machine leaked into code
examples or output. Replacing with a .example.com equivalent.

Change-Id: I7a641e11668fbe45af3e37cbd55f4955a996f3cf
Reviewed-on: http://gerrit.cloudera.org:8080/5621
Reviewed-by: Ambreen Kazi <ambreen.kazi@cloudera.com>
Reviewed-by: Jim Apple <jbapple-impala@apache.org>
Tested-by: John Russell <jrussell@cloudera.com>
2017-01-07 00:19:21 +00:00
John Russell
37827ed17b Take out (stale) architecture diagram w/ Cloudera-specific terminology.
Change-Id: Iab3786f113a5492f3aeac7480bfbfdb030cedf5b
Reviewed-on: http://gerrit.cloudera.org:8080/5596
Reviewed-by: Jim Apple <jbapple-impala@apache.org>
Tested-by: John Russell <jrussell@cloudera.com>
2017-01-07 00:17:54 +00:00
John Russell
e98a65c65f IMPALA-3402: Change version numbers in shared variables file.
Some instances of 'CDH' in the doc metadata might show up
in variable substitions or unexpected places generally,
such as the PDF title page or inside cover. Switching them
all to Impala-only numbers. Filling in Impala 2.8 as the
'current' number to reflect what will be the next release.

Also remove another instance of 'Cloudera' in the title.

Change-Id: I69e855aa958a45b3286a27fa69bcf8df315fc174
Reviewed-on: http://gerrit.cloudera.org:8080/5626
Reviewed-by: Jim Apple <jbapple-impala@apache.org>
Tested-by: John Russell <jrussell@cloudera.com>
2017-01-07 00:12:58 +00:00
John Russell
c4ee03a7e5 IMPALA-3401: Suppress blocks of content devoted to Cloudera Manager.
Initial experiment to see how the doc looks & builds when the biggest,
most obvious pieces of Cloudera Manager-related content are suppressed:
- Primarily entire subtopics.
- In some cases, individual notes, paragraphs, and list items.
- Also the CM reference in the reused "cancel" text which showed up
  many times throughout the SQL syntax section.

Just hiding in the output for now, rather than removing from source,
because we might find there's generic material we need to hoist out of
the CM-related subtopics.

Was planning to use audience="hidden" to hide, but since that CR hasn't
landed in master yet, still using audience="Cloudera" for the moment,
and will switch attribute values later.
A little rewording of titles and suppressing smaller elements such as
paragraphs where the CM aspect didn't apply to the entire subtopic.

Change-Id: Ic799f77bc758a1cf40c53fa412bf02d852901a69
Reviewed-on: http://gerrit.cloudera.org:8080/5607
Reviewed-by: Jim Apple <jbapple-impala@apache.org>
Tested-by: John Russell <jrussell@cloudera.com>
2017-01-05 23:55:45 +00:00
Jim Apple
81c3653b96 Add disclaimer to docs: Cloudera-specific info still present.
While we are working on excising it, we don't want users to be
confused about what the manual is intended to describe.

Change-Id: I7740189fd7ff7f22d8471f037e190d9923521936
Reviewed-on: http://gerrit.cloudera.org:8080/5610
Reviewed-by: Tim Armstrong <tarmstrong@cloudera.com>
Tested-by: Impala Public Jenkins
2017-01-05 22:57:37 +00:00
Jim Apple
b222d90bd8 IMPALA-3398: Rework Impala documentation to be non-Cloudera-specific
Removes some of the Cloudera-specific references that can't be removed
with widely-applicable automation.

Change-Id: Ib3f63fb309e0617d7fe014231bb0ab0ad67c8474
Reviewed-on: http://gerrit.cloudera.org:8080/5239
Reviewed-by: Jim Apple <jbapple-impala@apache.org>
Tested-by: Impala Public Jenkins
2016-12-13 22:02:32 +00:00
Dan Burkert
f83652c1da Replace INTO N BUCKETS with PARTITIONS N in CREATE TABLE
This commit also removes the now unused `DISTRIBUTE`, `SPLIT`, and
`BUCKETS` keywords that were going to be newly released in Impala 2.6,
but are now unused. Additionally, a few remaining uses of the
`DISTRIBUTE BY` syntax has been switched to `PARTITION BY`.

Change-Id: I32fdd5ef26c532f7a30220db52bdfbf228165922
Reviewed-on: http://gerrit.cloudera.org:8080/5382
Reviewed-by: Matthew Jacobs <mj@cloudera.com>
Tested-by: Internal Jenkins
2016-12-07 07:31:16 +00:00
Jim Apple
d484d2f684 Add Apache license header to files in doc directory
This now gives a clean RAT check with bin/check-rat-report.py, which
is one way for the Impala community to check compliance with ASF rules
on intellectual property.

Change-Id: I2ad06435f84a65ba126759e42a18fdaf52cd7036
Reviewed-on: http://gerrit.cloudera.org:8080/5232
Reviewed-by: Jim Apple <jbapple-impala@apache.org>
Tested-by: Impala Public Jenkins
Reviewed-by: John Russell <jrussell@cloudera.com>
2016-12-02 23:54:32 +00:00
Jim Apple
83f777da71 Start a docs build system.
The docs can be built by running "make" from the docs directory. This
does not hook into buildall.sh for now, as users who run buildall.sh
do not usually edit docs/.

Change-Id: I9551b75268cb7cb29a58367a3ef03b127dccbfca
Reviewed-on: http://gerrit.cloudera.org:8080/5238
Reviewed-by: Tim Armstrong <tarmstrong@cloudera.com>
Reviewed-by: Greg Rahn <grahn@cloudera.com>
Tested-by: Jim Apple <jbapple-impala@apache.org>
2016-12-01 19:34:29 +00:00
Jim Apple
3be0f122a5 IMPALA-3398: Add docs to main Impala branch.
These are refugees from doc_prototype. They can be rendered with the
DITA Open Toolkit version 2.3.3 by:

/tmp/dita-ot-2.3.3/bin/dita \
  -i impala.ditamap \
  -f html5 \
  -o $(mktemp -d) \
  -filter impala_html.ditaval

Change-Id: I8861e99adc446f659a04463ca78c79200669484f
Reviewed-on: http://gerrit.cloudera.org:8080/5014
Reviewed-by: John Russell <jrussell@cloudera.com>
Tested-by: John Russell <jrussell@cloudera.com>
2016-11-17 22:38:44 +00:00