impala/testdata/workloads/functional-query/queries/QueryTest/aggregation.test at e94de02469a3dfeda7d9358d7a78cb54c7a67159

mirror of https://github.com/apache/impala.git synced 2026-01-01 00:00:20 -05:00

Files

Matthew Jacobs b83aa4984b Add compute histograms aggregate function

Adds an aggregate function to compute equi-depth histograms. The UDA
creates a sample of the column values using weighted reservoir sampling
and computes the histogram from the sorted sample.

TODO:
* Extract highly frequent values into separate buckets (i.e. 'compressed
  histogram').
* Expose separate finalize fn to produce samples and histogram data for stats

Change-Id: I314ce5fb8c73b935c4d61ea5bbd6816c59b3b41e
Reviewed-on: http://gerrit.ent.cloudera.com:8080/3552
Reviewed-by: Matthew Jacobs <mj@cloudera.com>
Tested-by: jenkins
(cherry picked from commit c5c475712f88244e15160befaf4e99d6e165a148)
Reviewed-on: http://gerrit.ent.cloudera.com:8080/3608

2014-07-25 00:21:10 -07:00

24 KiB

Raw Blame History

View Raw

24 KiB Raw Blame History

24 KiB

Raw Blame History