IMPALA-13829: Postpone catalog deleteLog GC for waitForHmsEvent requests

mirror of https://github.com/apache/impala.git synced 2025-12-19 18:12:08 -05:00

When a db/table is removed in the catalog cache, catalogd assigns it a
new catalog version and put it into the deleteLog. This is used for the
catalog update thread to collect deletion updates. Once the catalog
update thread collects a range of updates, it triggers GC in the
deleteLog to clear items older than the last sent catalog version. The
deletions will be broadcasted by statestore to all the coordinators
eventually.

However, waitForHmsEvent requests is also a consumer of the deleteLog
and could be impacted by these GCs. waitForHmsEvent is a catalogd RPC
used by coordinators when a query wants to wait until the related
metadata is in synced with HMS. The response of waitForHmsEvent returns
the latest metadata including the deletions on related dbs/tables.
If the related deletions in deleteLog is GCed just before the
waitForHmsEvent request collects the results, they will be missing in
the response. Coordinator might keep using stale metadata of
non-existing dbs/tables.

This is a quick fix for the issue by postponing deleteLog GC in a
configurable number of topic updates, similar to what we have done on
the TopicUpdateLog. A thorough fix might need to carefully choose the
version to GC or let impalad waits for the deletions from statestore to
arrive.

A new flag, catalog_delete_log_ttl, is added for this. The deleteLog
items can survive for catalog_delete_log_ttl catalog updates. The
default is 60 so a deletion can survive for at least 120s. It should be
safe enough, i.e. the GCed deletions must have arrived in the impalad
side after 60 rounds of catalog updates, otherwise that's an abnormal
impalad and already has other more severe issues, e.g. lots of stale
tables due to metadata out of sync with catalogd.

Note that postponing deleteLog GCs might increase the memory
consumption. But since most of its memory is used by db/table/partition
names, the memory usage might still be trivial comparing to other
metadata like file descriptors and incremental stats in lived catalog
objects.

This patch also removed some unused imports.

Tests:
 - Added e2e test with a debug action to reproduce the issue. Ran the
   test 100 times. Without the fix, it consistently fails when runs for
   2-3 times.

Change-Id: I2441440bca2b928205dd514047ba742a5e8bf05e
Reviewed-on: http://gerrit.cloudera.org:8080/22816
Reviewed-by: Riza Suminto <riza.suminto@cloudera.com>
Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>

This commit is contained in:

stiga-huang

2025-04-25 09:58:12 +08:00

committed by

Impala Public Jenkins

parent 0e3ae5c339

commit 56b465d91f

11 changed files with 131 additions and 19 deletions

									
										2

common/thrift/BackendGflags.thrift
									
												View File
												
				@@ -325,4 +325,6 @@ struct TBackendGflags {

				  146: required string catalog_config_dir

				  147: required i32 catalog_partial_fetch_max_files

				  148: required i32 catalog_delete_log_ttl

				}

									
										3

common/thrift/CatalogService.thrift
									
												View File
												
				@@ -794,6 +794,9 @@ struct TWaitForHmsEventRequest {

				  // Whether to check tables used by views.

				  7: optional bool should_expand_views = false

				  // Passes the debug actions to catalogd if the query option is set.

				  8: optional string debug_action

				}

				struct TWaitForHmsEventResponse {

IMPALA-13829: Postpone catalog deleteLog GC for waitForHmsEvent requests

2 common/thrift/BackendGflags.thrift Unescape Escape View File

3 common/thrift/CatalogService.thrift Unescape Escape View File

2

common/thrift/BackendGflags.thrift

View File

3

common/thrift/CatalogService.thrift

View File