impala

mirror of https://github.com/apache/impala.git synced 2026-01-10 00:00:16 -05:00

Files

Ippokratis Pandis f57ce3436c IMPALA-2256: Handle joins with right side of high cardinality and zero materialized slots

The hash join and tuple stream code was not handling correctly the
case of joins whose right side had very high cardinality but where
tuple had zero footprint. Any such join with more than 16M tuples
on the right side would crash. In particular, if the tuple footprint
is zero, an infinite number of rows fit in one block. But according to
the old way we were iterating over the rows of the stream, we would
increment by 1 the idx to get the next "row" eventually overflowing
and hitting dcheck.

Another, second, problem was the calculation of the size of the hash
table in such where the footprint of tuples is zero. In such case,
a hash table of minimum size would suffice. Instead we would try to
create a very large hash table to fit the large number of tuples,
resulting to OOM errors.

This patch fixes the two problems by having specific calculation of
the next idx in the stream as well as the size of the hash table in
case the stream contains tuples with zero footprint.

Change-Id: I12469b9c63581fcbc78c87200de7797eac3428c9
Reviewed-on: http://gerrit.cloudera.org:8080/811
Reviewed-by: Ippokratis Pandis <ipandis@cloudera.com>
Tested-by: Internal Jenkins

2015-09-14 13:43:01 -07:00

aggregation_no_codegen_only.test

Mark test_non_codegen_tinyint_grouping as execute_serially

2014-04-10 15:17:25 -07:00

aggregation.test

IMPALA-2089: Retain eq predicates bound by grouping slots with complex grouping exprs.

2015-08-23 04:43:37 +00:00

alter-table.test

Nested Types: Pretty print complex types in DESCRIBE.

2015-08-22 09:26:35 +00:00

analytic-fns-tpcds.test

[CDH5] Fix tpcds analytical functions test.

2014-09-26 16:56:40 -07:00

analytic-fns.test

IMPALA-2081: Add PERCENT_RANK, NTILE, CUME_DIST analytic window functions

2015-08-07 04:57:37 +00:00

avro-schema-resolution.test

Address several shortcomings with respect to the usability of Avro tables.

2015-08-25 09:52:18 +00:00

avro-writer.test

IMPALA-1185: Make Avro and Seq writers unsupported

2014-09-26 12:28:03 -07:00

chars-formats.test

Char PARQUET, AVRO, and TEXT tests

2014-09-26 12:24:07 -07:00

chars.test

IMPALA-1636: Generalize index-based partition pruning to allow constant

2015-03-07 09:51:27 +00:00

compute-stats-decimal.test

IMPALA-1595: Add 'location' to SHOW [TABLE STATS|PARTITIONS] for HDFS tables

2015-04-21 19:27:50 +00:00

compute-stats-incremental.test

IMPALA-2199: Row count not set for empty partition when spec is used with compute incremental stats

2015-08-13 09:38:30 +00:00

compute-stats-many-partitions.test

IMPALA-1595: Add 'location' to SHOW [TABLE STATS|PARTITIONS] for HDFS tables

2015-04-21 19:27:50 +00:00

compute-stats.test

IMPALA-1136: Support loading Avro tables without an explicit Avro schema

2015-07-31 12:13:37 +00:00

corrupt_stats.test

IMPALA-1983: Warn if table stats are potentially corrupt.

2015-08-26 22:19:33 +00:00

create.test

IMPALA-2316: Add RESTRICT to DROP DATABASE

2015-09-11 20:37:27 +00:00

data-source-tables.test

IMPALA-1025: Use converse of data source predicate operators if expr has val before slot

2014-06-09 23:54:09 -07:00

decimal_avro.test

Decimal: read from Avro

2014-05-16 22:26:11 -07:00

decimal.test

Improve Hll estimate for small cardinalities.

2015-07-16 19:38:17 +00:00

delimited-latin-text.test

Add nested types support to Create Table Like File

2015-08-22 01:46:26 +00:00

delimited-text.test

Add nested types support to Create Table Like File

2015-08-22 01:46:26 +00:00

describe.test

Nested Types: Pretty print complex types in DESCRIBE.

2015-08-22 09:26:35 +00:00

distinct-estimate.test

Improve Hll estimate for small cardinalities.

2015-07-16 19:38:17 +00:00

distinct.test

Improve Hll estimate for small cardinalities.

2015-07-16 19:38:17 +00:00

empty.test

IMPALA-2088: Fix planning of empty union operands with analytics.

2015-07-27 15:46:41 -07:00

explain-level0.test

ExecSummary

2014-06-11 03:10:11 -07:00

explain-level1.test

ExecSummary

2014-06-11 03:10:11 -07:00

explain-level2.test

ExecSummary

2014-06-11 03:10:11 -07:00

explain-level3.test

ExecSummary

2014-06-11 03:10:11 -07:00

exprs.test

IMPALA-2290: Fix btrim() thread-safety.

2015-09-09 04:15:30 +00:00

functions-ddl.test

S3: Some more work toward enabling additional S3 test coverage

2015-03-03 08:29:13 +00:00

grant_revoke_no_insert.test

IMPALA-2275: S3: authorization.test_grant_revoke failure due to stale

2015-09-02 04:29:41 +00:00

grant_revoke.test

CDH-23206: Impala support for column-level authorization (part 1)

2015-08-28 23:58:36 +00:00

hbase-compute-stats-incremental.test

Remove explicit references to functional_hbase tables from .test files.

2015-02-23 23:32:41 +00:00

hbase-compute-stats.test

Improve Hll estimate for small cardinalities.

2015-07-16 19:38:17 +00:00

hbase-filters.test

IMPALA-642: Conjunctive predicates on HBase table not working...

2014-05-08 13:59:00 -07:00

hbase-inline-view.test

Remove explicit references to functional_hbase tables from .test files.

2015-02-23 23:32:41 +00:00

hbase-inserts.test

Add nested types support to Create Table Like File

2015-08-22 01:46:26 +00:00

hbase-limit.test

Remove explicit references to functional_hbase tables from .test files.

2015-02-23 23:32:41 +00:00

hbase-rowkeys.test

Remove explicit references to functional_hbase tables from .test files.

2015-02-23 23:32:41 +00:00

hbase-scan-node.test

Add partition pruning tests

2014-06-24 02:14:27 -07:00

hbase-show-create-table.test

Remove explicit references to functional_hbase tables from .test files.

2015-02-23 23:32:41 +00:00

hbase-show-stats.test

Remove explicit references to functional_hbase tables from .test files.

2015-02-23 23:32:41 +00:00

hbase-subquery.test

Treat HBase as a file format for functional tests

2014-01-08 10:52:36 -08:00

hbase-top-n.test

Remove explicit references to functional_hbase tables from .test files.

2015-02-23 23:32:41 +00:00

hdfs-caching-validation.test

IMPALA-1595: Add 'location' to SHOW [TABLE STATS|PARTITIONS] for HDFS tables

2015-04-21 19:27:50 +00:00

hdfs-caching.test

IMPALA-1595: Add 'location' to SHOW [TABLE STATS|PARTITIONS] for HDFS tables

2015-04-21 19:27:50 +00:00

hdfs-partitions.test

Removing duplicate query test

2015-07-15 03:28:36 +00:00

hdfs-scan-node.test

Add partition pruning tests

2014-06-24 02:14:27 -07:00

hdfs-text-scan.test

IMPALA-1973: Fixing crash when uninitialized, empty row is added in HdfsTextScanner

2015-05-05 00:19:12 +00:00

hdfs-tiny-scan.test

Fix IMPALA-129, IMPALA-534, and other scanner bugs.

2014-01-08 10:52:14 -08:00

hidden-files.test

IMPALA-1595: Fix exhaustive test failure

2015-04-23 19:46:31 +00:00

hive-udf.test

Throw error on unrecognized test sections.

2014-12-02 18:08:09 -08:00

impala-demo.test

Test data loading framework improvements

2014-01-08 10:46:49 -08:00

inline-view-limit.test

Query transformation of nested queries.

2014-08-29 15:35:21 -07:00

inline-view.test

IMPALA-1987: Fix TupleIsNullPredicate to return false if no tuples are

2015-06-11 03:37:18 +00:00

insert_null.test

Add nested types support to Create Table Like File

2015-08-22 01:46:26 +00:00

insert_overwrite.test

Added SHOW TABLE/COLUMN STATS command.

2014-01-08 10:53:51 -08:00

insert_parquet_invalid_codec.test

Enable isilon end to end tests for Impala.

2015-05-27 22:25:12 +00:00

insert_part_key.test

Throw error on unrecognized test sections.

2014-12-02 18:08:09 -08:00

insert_permutation.test

IMPALA-945: Fix column reordering with SELECT expressions

2014-04-18 00:12:12 -07:00

insert.test

Add nested types support to Create Table Like File

2015-08-22 01:46:26 +00:00

joins-against-hbase.test

Remove explicit references to functional_hbase tables from .test files.

2015-02-23 23:32:41 +00:00

joins.test

IMPALA-2319: correctly enforce NLJ limit

2015-09-11 22:45:39 +00:00

legacy-joins-aggs.test

Fail queries that require a SubplanNode when using legacy joins and aggs.

2015-09-10 04:50:31 +00:00

libs_with_same_filenames.test

S3: enable more tests for S3

2015-03-11 16:39:39 -07:00

limit.test

Remove explicit references to functional_hbase tables from .test files.

2015-02-23 23:32:41 +00:00

load-hive-udfs.test

S3: enable more tests for S3

2015-03-11 16:39:39 -07:00

load.test

Fix metadata/test_load.py to work with Isilon.

2015-06-05 00:52:14 +00:00

local-filesystem.test

Enable local filesystem tables

2015-02-27 18:48:56 +00:00

misc.test

IMPALA-2239: update misc.test to match the new .test file format

2015-08-25 00:12:52 +00:00

mixed-format.test

Change the way data is loaded

2014-01-08 10:48:09 -08:00

multiple-filesystems.test

Fix S3 build v2: Adjust expected SHOW TABLE STATS output.

2015-05-16 02:47:01 +00:00

nested-types-runtime.test

IMPALA-2297: Handle collection types in ExprContext::GetValue().

2015-09-10 17:46:21 +00:00

nested-types-scanner-array-materialization.test

Nested types: read and materialize nested types in Parquet scanner

2015-09-02 19:23:54 +00:00

nested-types-scanner-basic.test

Nested types: read and materialize nested types in Parquet scanner

2015-09-02 19:23:54 +00:00

nested-types-scanner-maps.test

Nested types: read and materialize nested types in Parquet scanner

2015-09-02 19:23:54 +00:00

nested-types-scanner-multiple-materialization.test

Nested types: read and materialize nested types in Parquet scanner

2015-09-02 19:23:54 +00:00

nested-types-scanner-position.test

Nested types: read and materialize nested types in Parquet scanner

2015-09-02 19:23:54 +00:00

nested-types-tpch.test

IMPALA-2318: Rework projection of collection-typed slots.

2015-09-11 21:45:16 +00:00

null_data.test

Add nested types support to Create Table Like File

2015-08-22 01:46:26 +00:00

outer-joins.test

IMPALA-2065: Workaround IMPALA-1619 in BufferedBlockMgr::ConsumeMemory()

2015-06-27 01:17:50 +00:00

overflow.test

IMPALA-724: Support infinite / nan values in text files

2014-05-08 12:28:53 -07:00

parquet.test

IMPALA-2130: Wrong verification of Parquet file version

2015-07-14 02:52:02 +00:00

partition-col-types.test

IMPALA-2100: Exclude explain header from expected results of test_partitioning.py.

2015-09-08 19:57:55 +00:00

scanners.test

Add partition pruning tests

2014-06-24 02:14:27 -07:00

semi-joins.test

IMPALA-2256: Handle joins with right side of high cardinality and zero materialized slots

2015-09-14 13:43:01 -07:00

seq-writer.test

Adding SEQUENCEFILE compressed record format

2014-11-19 17:21:36 -08:00

set.test

Nested Types: Tuple pointers are owned by the containing RowBatch by default.

2015-09-14 13:43:01 -07:00

show-create-table.test

Remove explicit references to functional_hbase tables from .test files.

2015-02-23 23:32:41 +00:00

show-data-sources.test

S3: Some more work toward enabling additional S3 test coverage

2015-03-03 08:29:13 +00:00

show-stats.test

Improve Hll estimate for small cardinalities.

2015-07-16 19:38:17 +00:00

show.test

IMPALA-1437: Implement SHOW FILES IN <table>

2015-03-05 05:13:50 +00:00

single-node-nlj.test

Temporarily remove nested loop join with limit test

2015-09-13 08:29:25 -07:00

sort.test

IMPALA-1148: Do not generate a sort node if the sort tuple has no materialized slots.

2014-09-05 10:12:55 -07:00

spilling.test

IMPALA-1820: Start with small pages for hash tables during repartitioning

2015-02-28 00:42:04 +00:00

subplans.test

Nested Types: Reset() for partitioned hash join node

2015-07-08 01:51:09 +00:00

subquery.test

IMPALA-1550: Invalid rewrite when EXISTS subqueries contain aggregate

2015-04-02 19:11:00 +00:00

test-unmatched-schema.test

IMPALA-502: Impala does not return NULL for case where table has extra string column and data does not (it returns an empty string)

2014-01-08 10:52:02 -08:00

text-writer.test

IMPALA-1185: Make Avro and Seq writers unsupported

2014-09-26 12:28:03 -07:00

top-n.test

Remove explicit references to functional_hbase tables from .test files.

2015-02-23 23:32:41 +00:00

truncate-table.test

Improve Hll estimate for small cardinalities.

2015-07-16 19:38:17 +00:00

uda-mem-limit.test

S3: enable more tests for S3

2015-03-11 16:39:39 -07:00

uda.test

IMPALA-1829: UDAs with different intermediate type

2015-08-19 04:37:39 +00:00

udf-errors.test

S3: enable more tests for S3

2015-03-11 16:39:39 -07:00

udf-init-close.test

IMPALA-1030: HdfsTableSink was evaluating exprs in Prepare()

2014-06-12 02:23:20 -07:00

udf-mem-limit.test

S3: enable more tests for S3

2015-03-11 16:39:39 -07:00

udf.test

IMPALA-1589: allow up to 8 non-variadic arguments in the interpreted UDF path

2014-12-16 18:53:16 -08:00

union.test

IMPALA-1340: removing implicit casts during expr substitution is not always safe

2014-10-06 17:47:37 -07:00

use.test

Change the way data is loaded

2014-01-08 10:48:09 -08:00

values.test

Throw error on unrecognized test sections.

2014-12-02 18:08:09 -08:00

views-compatibility.test

IMPALA-995: Add plan hints embedded in comments and preserve them in views.

2014-09-18 00:36:03 -07:00

views-ddl.test

IMPALA-995: Add plan hints embedded in comments and preserve them in views.

2014-09-18 00:36:03 -07:00

views.test

Added order by query tests

2014-06-20 13:35:10 -07:00

wide-row.test

IMPALA-525: Adjust IO buffer size based on read length and other memory fixes

2014-01-08 10:54:01 -08:00

with-clause.test

IMPALA-898: Support explicit column names in WITH-clause views.

2015-09-03 01:19:43 +00:00