impala

mirror of https://github.com/apache/impala.git synced 2026-01-03 06:00:52 -05:00

Author	SHA1	Message	Date
Michael Ho	968c61c940	IMPALA-2824: Restore query options after each test. A failed test case inside a test file will leave the rest of the test cases in the file unexecuted. Some test cases may modify some query options such as memory limit and then restore them in the subsequent test cases in the same file. The failure of those test cases will leave the query options modified, causing cascading failures to other test cases which aren't expected to be run with the modified query options (e.g. lowered memory limit). This problem may lead to broken builds which are recorded in IMPALA-2724 and IMPALA-2824. This change fixes the problem above by checking if a test case modifies any query option and if so, restore those modified query options to their default values. This change makes the assumption that a test should not modify an option specified in its test vector so it's safe to restore the modified query options to their default values. Change-Id: Ib88d1dcb6a65183e1afc8eef0c764179a9f6a8ce Reviewed-on: http://gerrit.cloudera.org:8080/1774 Reviewed-by: Michael Ho <kwho@cloudera.com> Tested-by: Internal Jenkins	2016-01-26 03:13:05 +00:00
Alex Behm	ecdd5688b9	Nested Types: Tuple pointers are owned by the containing RowBatch by default. This patch makes the ownership of the memory backing the tuple pointers of a RowBatch dependent on whether the legacy joins and aggs are enabled: By default, the memory is malloc'd and owned by the RowBatch: If enable_partitioned_hash_join=true and enable_partitioned_aggregation=true then the memory is owned by the RowBatch and is freed upon its destruction. This mode is more performant especially with SubplanNodes in the ExecNode tree because the tuple pointers are not transferred and do not have to be re-created in every Reset(). Memory is allocated from MemPool: Otherwise, the memory is allocated from the RowBatch's tuple pool. As a result, the pointer memory is transferred just like tuple data, and must be re-created in Reset(). This mode is required for the legacy join and agg which rely on the tuple pointers being allocated from the RowBatch's tuple pool, so they can acquire ownership of the tuple pointers. Performance impact for nested types: Initial cluster runs and profiling on nested TPCH identified excessive malloc/frees as a major performance bottleneck. This change paves the way for further optimizations which yielded a 2x improvement in response time for most nested TPCH queries. Change-Id: I4ac58b18058ce46b4db89fbe117b0bcad19e9ee7 Reviewed-on: http://gerrit.cloudera.org:8080/807 Reviewed-by: Alex Behm <alex.behm@cloudera.com> Tested-by: Internal Jenkins	2015-09-14 13:43:01 -07:00
Vlad Berindei	452ebee59d	IMPALA-1906: PARQUET_FILE_SIZE query option overflows for values >= 2GB. The value of PARQUET_FILE_SIZE overflows when RoundUp() is called because this function returns an int32. Even with this change, this value will still overflow when calling the HDFS API since it is passed to hdfsOpenFile() as blocksize, which is an int32 parameter (see HDFS-8949). Changes: - Return an error if PARQUET_FILE_SIZE is set to a value greater than or equal to 2GB. - If PARQUET_FILE_SIZE is set in an Impala session to a value greater than or equal to 2GB, then every query will fail with an error message. - If PARQUET_FILE_SIZE is changed to a value greater than or equal to 2GB as an impalad argument, impalad will not start and log an error. - Ceil(), RoundUp(), RoundDown() return int64. Change-Id: Ie4f2551b72954e2a57db5594e4789e3f7434d578 Reviewed-on: http://gerrit.cloudera.org:8080/678 Reviewed-by: Alex Behm <alex.behm@cloudera.com> Reviewed-by: Vlad Berindei <vlad.berindei@cloudera.com> Tested-by: Internal Jenkins	2015-08-25 23:28:13 +00:00
Alex Behm	f696861c5c	Throw error on unrecognized test sections. Our .test file parser used to not abort tests when there is a malformed test/section. This patch changes that behavior to report an error and treat the test as failed. Quite a few tests were not well-formed, and were not executed as a result. This patch fixes those tests. Arguably, the test file parser should be more flexible in which places to accept comments, but this patch does not address that problem. Change-Id: If53358eb0cb958b68e51940b071e64c1d6c3ec6f Reviewed-on: http://gerrit.sjc.cloudera.com:8080/5468 Reviewed-by: Alex Behm <alex.behm@cloudera.com> Tested-by: jenkins	2014-12-02 18:08:09 -08:00
Nong Li	d52a620737	Add support for writing compressed text. Change-Id: I314b925594801ae4b5c47248d998801aa0b37270 Reviewed-on: http://gerrit.sjc.cloudera.com:8080/4205 Tested-by: jenkins Reviewed-by: Nong Li <nong@cloudera.com>	2014-09-07 22:08:30 -07:00
Victor Bittorf	f2ef06bef1	SEQUENCEFILE: Add support for writing sequence files. This supports both uncompressed and block compressed formats. Row compressed formats are not supported. The type of compression is specified using a query parameter COMPRESSION_CODEC with values NONE, GZIP, BZIP2, and SNAPPY. Note: this patch only has basic testing. More extensive testing will be done when this avro writer is used in data loading. Change-Id: Id284bd4f3a28e27e49d56b1127cdc83c736feb61 Reviewed-on: http://gerrit.sjc.cloudera.com:8080/3541 Reviewed-by: Victor Bittorf <victor.bittorf@cloudera.com> Tested-by: jenkins	2014-08-17 12:45:10 -07:00
Dan Hecht	09bd8b7c27	Fix SetStmt.toSql(). It needs to handle the "SET" case. Also, add some missing test cases for "SET". Also, cleanup test_set/set.test. Change-Id: I34f6005ef17e196d94366e5301251a2987746fbf Reviewed-on: http://gerrit.sjc.cloudera.com:8080/3620 Reviewed-by: Daniel Hecht <dhecht@cloudera.com> Tested-by: jenkins (cherry picked from commit 41890b5a13f9429f058fb12453c78323df11fc7d) Reviewed-on: http://gerrit.sjc.cloudera.com:8080/3655	2014-07-30 11:37:11 -07:00
Dan Hecht	1fee56cb26	IMPALA-1080: Implement "SET <query_option>" as SQL statement. Also add support for "SET", which returns a table of query options and their respective values. The front-end parses the option into a (key, value) pair and then the existing backend logic is used to set the option, or return the result sets. Change-Id: I40dbd98537e2a73bdd5b27d8b2575a2fe6f8295b Reviewed-on: http://gerrit.ent.cloudera.com:8080/3582 Reviewed-by: Daniel Hecht <dhecht@cloudera.com> Tested-by: jenkins (cherry picked from commit aa0f6a2fc1d3fe21f22cc7bc56887e1fdb02250b) Reviewed-on: http://gerrit.ent.cloudera.com:8080/3614	2014-07-25 10:25:09 -07:00

8 Commits