mirror of
https://github.com/apache/impala.git
synced 2026-01-23 21:00:25 -05:00
Until now ARRAYs had to be unnested in queries. This patch adds support to return ARRAYs as STRINGs (JSON arrays) in select list, for example: select id, int_array from functional_parquet.complextypestbl where id = 1; returns: 1, [1,2,3] Returning ARRAYs from inline or HMS views is also supported - these arrays can be used both in the select list or as relative table references. Using them as non-relative table reference is not supported (IMPALA-11052). Though STRUCTs are already supported, ARRAYs and STRUCTs nested in each other are not supported yet. Things intentionally postponed for later commits: - Add MAP suppport too - this shouldn't be too tricky after ARRAY support, but I don't want to make this patch even more complex. - Unify HS2 / Beeswax logic with the way STRUCTs are handled. This could be done in a "final" logic that can handle STRUCTS/ARRAYS nested to each other - Implement "deep copy" and "deep serialize" for ARRAYs in BE. This would enable all operators, e.g. ORDER BY and UNION. Testing: - FE tests were added for analyses and authorization - EE tests were added - core tests were ran Change-Id: Ibb1e42ffb21c7ddc033aba0f754b0108e46f34d0 Reviewed-on: http://gerrit.cloudera.org:8080/17811 Reviewed-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com> Tested-by: Impala Public Jenkins <impala-public-jenkins@cloudera.com>
This directory contains Impala test workloads. The directory layout for the workloads should follow: workloads/ <data set name>/<data set name>_dimensions.csv <- The test dimension file <data set name>/<data set name>_core.csv <- A test vector file <data set name>/<data set name>_pairwise.csv <data set name>/<data set name>_exhaustive.csv <data set name>/queries/<query test>.test <- The queries for this workload