impala

mirror of https://github.com/apache/impala.git synced 2026-01-23 21:00:25 -05:00

Files

Attila Jeges d3da875684 IMPALA-9498: Allow returning arrays in select list

Until now ARRAYs had to be unnested in queries. This patch adds
support to return ARRAYs as STRINGs (JSON arrays) in select list,
for example:
select id, int_array from functional_parquet.complextypestbl where id = 1;
returns: 1, [1,2,3]

Returning ARRAYs from inline or HMS views is also supported -
these arrays can be used both in the select list or as relative
table references. Using them as non-relative table reference is
not supported (IMPALA-11052).

Though STRUCTs are already supported, ARRAYs and STRUCTs nested in
each other are not supported yet.

Things intentionally postponed for later commits:
- Add MAP suppport too - this shouldn't be too tricky after
  ARRAY support, but I don't want to make this patch even more
  complex.
- Unify HS2 / Beeswax logic with the way STRUCTs are handled.
  This could be done in a "final" logic that can handle
  STRUCTS/ARRAYS nested to each other
- Implement "deep copy" and "deep serialize" for ARRAYs in BE.
  This would enable all operators, e.g. ORDER BY and UNION.

Testing:
- FE tests were added for analyses and authorization
- EE tests were added
- core tests were ran

Change-Id: Ibb1e42ffb21c7ddc033aba0f754b0108e46f34d0
Reviewed-on: http://gerrit.cloudera.org:8080/17811
Reviewed-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>
Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>

2022-02-17 18:51:06 +00:00

functional-planner

IMPALA-9498: Allow returning arrays in select list

2022-02-17 18:51:06 +00:00

functional-query

IMPALA-9498: Allow returning arrays in select list

2022-02-17 18:51:06 +00:00

perf-regression

IMPALA-9709: Remove Impala-lzo from the development environment

2020-06-15 23:42:12 +00:00

targeted-perf

IMPALA-2581: LIMIT can be propagated down into some aggregations

2021-09-22 20:42:10 +00:00

targeted-stress

IMPALA-9709: Remove Impala-lzo from the development environment

2020-06-15 23:42:12 +00:00

tpcds

IMPALA-10034: Add remaining TPC-DS queries to workload.

2020-08-24 16:02:45 +00:00

tpcds-insert

IMPALA-10384: Make partition names consistent between BE and FE

2020-12-11 19:51:28 +00:00

tpcds-unmodified

IMPALA-9709: Remove Impala-lzo from the development environment

2020-06-15 23:42:12 +00:00

tpch

IMPALA-10821 Fix TestTPCHJoinQueries.test_outer_joins failed

2021-07-26 22:22:58 +00:00

tpch_nested

IMPALA-9604: Add TPCH-nested tests for column masking

2020-06-17 06:54:50 +00:00

README

Move functional data loading to new framework + initial changes for workload directory structure

2014-01-08 10:44:18 -08:00

README

This directory contains Impala test workloads. The directory layout for the workloads should follow:

workloads/
   <data set name>/<data set name>_dimensions.csv  <- The test dimension file
   <data set name>/<data set name>_core.csv  <- A test vector file
   <data set name>/<data set name>_pairwise.csv
   <data set name>/<data set name>_exhaustive.csv
   <data set name>/queries/<query test>.test <- The queries for this workload