impala

mirror of https://github.com/apache/impala.git synced 2025-12-19 18:12:08 -05:00

Files

Gabriella Gyorgyevics c4c9adf592 IMPALA-14386: Add benchmarks for Byte Stream Split encoding

This patch adds benchmarks to the Byte Stream Split encoding. It
compares different ways to use the decoder.

I added benchmarks for the following comparisons:
  * Compile VS Runtime initialized decoder
  * Float VS Int VS Double VS Long VS 6 and 11 byte size types
  * Repeating VS Sequential VS Random ordered data
  * Decoding one by one VS in batch VS with stride (!= byte_size)
  * Small VS Medium (10x small) VS Large (100x small) stride

Conclusions:
  * Passing the byte size as a template parameter is almost 5 times
    as fast as passing it in the constructor.
  * The size of the type heavily influences the speed
  * The data variation doesn't influence the speed at all
  * Reading values in batch is much faster than one-by-one
  * The stride sizes have a small influence on the speed

For more details and graphs, go to
https://docs.google.com/spreadsheets/d/129LwvR6gpZInlRhlVWktn6Haugwo_fnloAAYfI0Qp2s

Change-Id: I708af625348b0643aa3f37525b8a6e74f0c47057
Reviewed-on: http://gerrit.cloudera.org:8080/23401
Reviewed-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>
Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>

2025-11-19 17:49:42 +00:00

generated-sources

…

src

IMPALA-14386: Add benchmarks for Byte Stream Split encoding

2025-11-19 17:49:42 +00:00

.gitignore

…

.impala.doxy

…

CMakeLists.txt

IMPALA-13237: [Patch 8] - OpenTelemetry Traces for DML/DDL Queries and Handle Leading Comments

2025-09-03 04:38:36 +00:00