mirror of
https://github.com/apache/impala.git
synced 2025-12-19 09:58:28 -05:00
IMPALA-13071: Update the doc of Impala components
Change-Id: I83192110d29c4d44529d1276a17c9da4a91435aa Reviewed-on: http://gerrit.cloudera.org:8080/21621 Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com> Reviewed-by: Quanlong Huang <huangquanlong@gmail.com>
This commit is contained in:
committed by
Quanlong Huang
parent
72e38f0e75
commit
b1941c8f17
@@ -141,20 +141,25 @@ under the License.
|
||||
|
||||
<conbody>
|
||||
|
||||
<p> The Impala component known as the Catalog Service relays the metadata
|
||||
changes from Impala SQL statements to all the Impala daemons in a
|
||||
cluster. It is physically represented by a daemon process named
|
||||
<codeph>catalogd</codeph>. You only need such a process on one host in
|
||||
a cluster. Because the requests are passed through the StateStore
|
||||
daemon, it makes sense to run the <cmdname>statestored</cmdname> and
|
||||
<cmdname>catalogd</cmdname> services on the same host. </p>
|
||||
<p> The Impala component known as the Catalog Service relays the metadata changes from Impala
|
||||
SQL statements to all the Impala coordinators in a cluster. It is physically represented by
|
||||
a daemon process named <codeph>catalogd</codeph>. You only need such a process on one host
|
||||
in a cluster. Because the requests are passed through the StateStore daemon, it makes sense
|
||||
to run the <cmdname>statestored</cmdname> and <cmdname>catalogd</cmdname> services on the
|
||||
same host. </p>
|
||||
|
||||
<p> The catalog service avoids the need to issue <codeph>REFRESH</codeph>
|
||||
and <codeph>INVALIDATE METADATA</codeph> statements when the metadata
|
||||
changes are performed by statements issued through Impala. When you
|
||||
create a table, load data, and so on through Hive, you do need to issue
|
||||
<codeph>REFRESH</codeph> or <codeph>INVALIDATE METADATA</codeph> on an
|
||||
Impala daemon before executing a query there. </p>
|
||||
<p> The catalog service avoids the need to issue <codeph>REFRESH</codeph> and
|
||||
<codeph>INVALIDATE METADATA</codeph> statements when the metadata changes are performed by
|
||||
statements issued through Impala.
|
||||
</p>
|
||||
<p> When you create a table, load data, and so on through Hive, you do need to issue
|
||||
<codeph>REFRESH</codeph> or <codeph>INVALIDATE METADATA</codeph> on an Impala daemon
|
||||
before executing a query. Performing <codeph>REFRESH</codeph> or <codeph>INVALIDATE
|
||||
METADATA</codeph> is not required when <cite>Automatic Invalidation/Refresh of
|
||||
Metadata</cite> is enabled. See <xref href="impala_metadata.xml#impala_metadata">Automatic
|
||||
Invalidation/Refresh of Metadata</xref> also known as the Hive Metastore (HMS) event
|
||||
processor.<note id="note_eyx_qcp_fcc" type="note">From Impala 4.1, Automatic
|
||||
Invalidation/Refresh of Metadata is enabled by default.</note></p>
|
||||
|
||||
<p>
|
||||
This feature touches a number of aspects of Impala:
|
||||
|
||||
Reference in New Issue
Block a user