The Impala server is a distributed, massively parallel processing (MPP) database engine. It consists of
different daemon processes that run on specific hosts within your
The core Impala component is a daemon process that runs on each DataNode of the cluster, physically represented
by the
You can submit a query to the Impala daemon running on any DataNode, and that instance of the daemon serves as the
The Impala daemons are in constant communication with the
They also receive broadcast messages from the
In
Related information:
The Impala component known as the
Because the statestore's purpose is to help when things go wrong, it is not critical to the normal operation of an Impala cluster. If the statestore is not running or becomes unreachable, the Impala daemons continue running and distributing work among themselves as usual; the cluster just becomes less robust if other Impala daemons fail while the statestore is offline. When the statestore comes back online, it re-establishes communication with the Impala daemons and resumes its monitoring function.
Related information:
The Impala component known as the
The catalog service avoids the need to issue
This feature touches a number of aspects of Impala:
See
The
Related information: