Commit Graph

606 Commits

Author SHA1 Message Date
Alex Behm
c9040aee22 IMPALA-111: COUNT(DISTINCT col) returns wrong results -- does not ignore NULLs. 2014-01-08 10:50:09 -08:00
Alex Behm
14557c7bab IMPALA-297: Remove distinction between value_expr and expr in parser. 2014-01-08 10:50:08 -08:00
Marcel Kornacker
21ec49e810 IMPALA-150: Performing dynamic partition insert via Impala on "large" table fails and takes down HDFS
This is solved by repartitioning the input to the hdfs table sinks on the partition key columns of the hdfs
table, so that each partition is only written by a single node.
2014-01-08 10:50:07 -08:00
Alex Behm
558590140c IMPALA-238: Problems inserting into tables with TIMESTAMP partition columns ... 2014-01-08 10:50:05 -08:00
Skye Wanderman-Milne
0c343913fa IMPALA-266: Round() does not output the right precision 2014-01-08 10:50:02 -08:00
Henry Robinson
018028f8e3 IMPALA-269: Throw exception if serde unrecognised 2014-01-08 10:50:01 -08:00
Marcel Kornacker
5bfc477ccc IMPALA-291: Plans should explicitly mention the join strategy 2014-01-08 10:49:59 -08:00
Alex Behm
132513f98c IMPALA-75: 'at least one equality predicate' error message needs improvement 2014-01-08 10:49:58 -08:00
Henry Robinson
7d2c47ad72 IMPALA-258: Make partition key string encoding Hive-compatible 2014-01-08 10:49:54 -08:00
Alex Behm
abafcf81ff IMPALA-287: Full outer join is missing results. 2014-01-08 10:49:54 -08:00
Alex Behm
4c45bc06c4 IMPALA-84: Predicates not evaluated if select exprs are constant. 2014-01-08 10:49:53 -08:00
Alex Behm
dbe3127383 IMPALA-285: Multiple outer joins with nesting crash impalad 2014-01-08 10:49:53 -08:00
Skye Wanderman-Milne
cc6007cf9e IMPALA-262: Querying text/lzo table that is not indexed causes an impalad segfault 2014-01-08 10:49:52 -08:00
Alex Behm
21685d4f8f Fixed a failed Preconditions check if a join predicate has constants. 2014-01-08 10:49:52 -08:00
Alan Choi
7aadac236d IMPALA-231 fix hbase perf issue by deleteing local java ref 2014-01-08 10:49:51 -08:00
Marcel Kornacker
7bf87a4b54 fix for IMPALA-90/IMPALA-221 2014-01-08 10:49:50 -08:00
Alex Behm
5db3f2cdf5 IMPALA-227: SELECT * on partitioned table returns columns in different order than Hive. 2014-01-08 10:49:48 -08:00
Alex Behm
805fa50d6f IMPALA-67: Constant SELECT clauses do not work in subqueries. 2014-01-08 10:49:48 -08:00
Skye Wanderman-Milne
04bee45af5 Update query test to use dayofyear() 2014-01-08 10:49:47 -08:00
Lenni Kuff
c74b7e41dd Enable insert tests to run against parquet 2014-01-08 10:49:47 -08:00
Nong Li
58631d9ce0 Fix parquet insert .test files. 2014-01-08 10:49:46 -08:00
Alex Behm
2277386d4d IMPALA-225: Compound predicate ranges on partition keys crash impalad. 2014-01-08 10:49:45 -08:00
Nong Li
563cbfa3a8 Enable parquet testing 2014-01-08 10:49:40 -08:00
Lenni Kuff
3e70594484 Fix unstable top-n test case 2014-01-08 10:49:37 -08:00
Lenni Kuff
cba9cd00dd Fix full data load build break due to constructing incorrect HDFS paths 2014-01-08 10:49:34 -08:00
Marcel Kornacker
398e725a23 make broadcast joins the default join strategy 2014-01-08 10:49:34 -08:00
Alex Behm
1b2e8280d4 Fix NULL issues. 2014-01-08 10:49:32 -08:00
Lenni Kuff
e218721386 IMPALA-198: Support setting file format, table comment in CREATE TABLE LIKE statements 2014-01-08 10:49:31 -08:00
Marcel Kornacker
d7e22f44bb Partitioned hash joins
- added PlanNode.numNodes, PlanNode.avgRowSize and PlanNode.computeStats()
- fixing up some cardinality estimates
- Planner now tries to do a cost-based decision between broadcast join and join with full repartitioning (both inputs)
- ExchangeNode now distinguishes between its input and output row descriptor: the output potentially contains more tuples
- fixed problem related to cancellation and concurrent hash table builds.

Not included:
- partitioned joins that take advantage of existing partitions of the inputs; those will have to wait for a follow-on change
2014-01-08 10:49:29 -08:00
Lenni Kuff
558d5ce755 Data loading: Exec DDL statements via Impala and don't recreate metadata if it exists 2014-01-08 10:49:28 -08:00
Nong Li
1fcfb72bc4 IMPALA-145: Fix order by limit 0 crash. 2014-01-08 10:49:27 -08:00
Nong Li
ebab23841a Add back pressure in hdfs-scan-node to prevent excessive buffer queueing. 2014-01-08 10:49:25 -08:00
Skye Wanderman-Milne
a7e15b1417 Update Parquet scanner to only scan a file if assigned the first split.
Also re-enable Parquet tests.
2014-01-08 10:49:25 -08:00
Alex Behm
673d7b97cf IMPALA-190: Insert with NULL partition keys results in SIGSEGV. 2014-01-08 10:49:22 -08:00
Lenni Kuff
15f0313283 Add analysis checks for length of RowFormat strings, fix escaping of row format values 2014-01-08 10:49:21 -08:00
Lenni Kuff
018a72bfe2 IMPALA-189: Properly support NULL partition key values in ALTER .. PARTITION statements 2014-01-08 10:49:21 -08:00
Alan Choi
4a503a4e35 IMP-808 construct runtime state in fe-support to eval now() 2014-01-08 10:49:20 -08:00
Nong Li
329763e5ab Disable parquet tests. 2014-01-08 10:49:20 -08:00
Lenni Kuff
36e9fe1c1a Run compute table stats statements using Hive CLI
This works around a problem with computing table stats via the Hive Meta Store client
API. When executing these stements via the MetaStoreClient, all tables were getting a
num_rows=0 value returned from the ANALYZE TABLE query.
2014-01-08 10:49:19 -08:00
Nong Li
20fc700002 Fix precision issue in text table writer. 2014-01-08 10:49:19 -08:00
Alex Behm
0821e2f826 IMPALA-66: Support for UNION with constant SELECT clauses. 2014-01-08 10:49:18 -08:00
Lenni Kuff
5a0b1270c4 Add support for ALTER ... PARTITION (partitionSpec) SET FILEFORMAT/LOCATION
Adds support for:
* ALTER TABLE <table> PARTITION (partitionSpec) SET FILEFORMAT
* ALTER TABLE <table> PARTITION (partitionSpec) SET LOCATION

This enables setting the location and fileformat of specific partitions.
2014-01-08 10:49:17 -08:00
Lenni Kuff
f4a5c0628f Cleanup HDFS directories before and after running ALTER TABLE tests 2014-01-08 10:49:17 -08:00
Skye Wanderman-Milne
8ef36831f6 Update DataErrorsTest to reflect LZO_MAX_BLOCK_SIZE rename 2014-01-08 10:49:16 -08:00
Lenni Kuff
1fb72fbc73 IMPALA-156: Support core 'ALTER TABLE' DDL command
This patch adds support for
- ALTER TABLE ADD|REPLACE COLUMNS
- ALTER TABLE DROP COLUMN
- ALTER TABLE ADD/DROP PARTITION
- ALTER TABLE SET FILEFORMAT
- ALTER TABLE SET LOCATION
- ALTER TABLE RENAME
2014-01-08 10:49:14 -08:00
Elliott Clark
0e0c02b6bd Add the ability to Select into HBase table.
* Changed frontend analysis for HBase tables
* Changed Thrift messages to allow HBase as a sink type.
* JNI Wrapper around htable
* Create hbase-table-sink
* Create hbase-table-writer
* Static init lots of JNI related code for HBase.
* Cleaned up some cpplint issues.
* Changed junit analysis tests
* Create a new HBase test table.
* Added functional tests for HBase inserts.
2014-01-08 10:49:06 -08:00
Lenni Kuff
993da8fcba Fix bug in how insert tables are generated 2014-01-08 10:49:05 -08:00
ishaan
15658f384b Include targeted performance tests in experiments and add a new query 2014-01-08 10:49:02 -08:00
Lenni Kuff
5f81becd84 Create tables used by insert tests in a supported insert format 2014-01-08 10:49:00 -08:00
Marcel Kornacker
0c36c7f327 Partitioned merge aggregation. 2014-01-08 10:48:59 -08:00