impala

mirror of https://github.com/apache/impala.git synced 2026-01-20 12:01:06 -05:00

Author	SHA1	Message	Date
ishaan	1beb8cc36d	Increase Hive's heap size while writing nested tpch. Recently, the full data load started failing because Hive ran out of heap space while writing the nested tpch tables. This patch simply bumps up the heap space, and the query is now successfull. Change-Id: I92d0029659c41417d76a15f703df1d42e5187d5e Reviewed-on: http://gerrit.cloudera.org:8080/776 Reviewed-by: Alex Behm <alex.behm@cloudera.com> Tested-by: Internal Jenkins	2015-09-09 10:32:06 +00:00
ishaan	4b007666eb	IMPALA-2302: Use a permitted value for parquet.block.size while loading nested tpch. Due to a possible change in behaviour in Hive/MR, it is no longer possible to use arbitrarily large values for parquet.block.size. This breaks the loading of nested tpch data on newer Hive. This patch addresses the problem by using a permissble value. Change-Id: Ib5b14651fb579cec6aa8d45bd2253cecb4346eb9 Reviewed-on: http://gerrit.cloudera.org:8080/755 Reviewed-by: Martin Grund <mgrund@cloudera.com> Tested-by: Internal Jenkins	2015-09-05 02:05:11 +00:00
Taras Bobrovytsky	704e3fa6bf	Add loading by partitions option to the loaded_nested script When loading a large nested table using the GROUP_CONCAT function, Impala runs out of memory. We prevent this from happening by adding an option to partition the table and load one partition at a time. Change-Id: I8d517f94ef97e98d36eb8ebc8180865023655114 Reviewed-on: http://gerrit.cloudera.org:8080/448 Reviewed-by: Taras Bobrovytsky <tbobrovytsky@cloudera.com> Tested-by: Internal Jenkins	2015-07-02 03:34:53 +00:00
Casey Ching	060f08ef69	Add tpch_nested_parquet database The database will be used for testing in the future. Change-Id: I60b54b36db9493a5bea308151b4027cd47d73047 Reviewed-on: http://gerrit.cloudera.org:8080/400 Reviewed-by: Ishaan Joshi <ishaan@cloudera.com> Tested-by: Internal Jenkins	2015-06-04 21:18:36 +00:00

4 Commits