diff --git a/docs/topics/impala_iceberg.xml b/docs/topics/impala_iceberg.xml index 71d4d2745..2921e2b03 100644 --- a/docs/topics/impala_iceberg.xml +++ b/docs/topics/impala_iceberg.xml @@ -879,6 +879,12 @@ ORDER BY made_current_at; values in the HMS may be stale.

+ Some engines, e.g. Trino, also write the NDV as a property (with key "ndv") in the + "statistics" section of the metadata.json file for each blob, in addition to the + Puffin file. If such a property is present for a blob, Impala will read the value + from the metadata.json file instead of the Puffin file to reduce file I/O. +

+

Note that it is currently not possible to drop Puffin stats from Impala. For this reason, it is possible to disable reading Puffin stats in two ways: