Commit Graph

606 Commits

Author SHA1 Message Date
Alex Behm
52c9d26d16 IMPALA-475: Impala should avoid the use of c_# style autogenerated column aliases unless necessary.
Change-Id: I959e35bcee1698ebc35534dc4f390c5c2c7dc919
Reviewed-on: http://gerrit.ent.cloudera.com:8080/141
Reviewed-by: Alex Behm <alex.behm@cloudera.com>
Tested-by: Alex Behm <alex.behm@cloudera.com>
2014-01-08 10:52:03 -08:00
Alex Behm
9754f5bf52 IMPALA-504: Right and full outer joins do not return row with NULL value for rhs table.
Change-Id: Ia3f8d474fb30189b36fb587b2920d7b9b224ea71
Reviewed-on: http://gerrit.ent.cloudera.com:8080/129
Tested-by: Alex Behm <alex.behm@cloudera.com>
Reviewed-by: Alex Behm <alex.behm@cloudera.com>
2014-01-08 10:52:03 -08:00
Skye Wanderman-Milne
6e7406df8b IMPALA-502: Impala does not return NULL for case where table has extra string column and data does not (it returns an empty string)
Change-Id: I0cfe5ce5fc279d46610a3cc191a501ccbc335296
Reviewed-on: http://gerrit.ent.cloudera.com:8080/127
Reviewed-by: Skye Wanderman-Milne <skye@cloudera.com>
Tested-by: Skye Wanderman-Milne <skye@cloudera.com>
2014-01-08 10:52:02 -08:00
Nong Li
fd53edbbe4 Fix parquet writer bug with not setting dictionary metadata.
Change-Id: Ia5c0886497678d31b82cb5052e06df437bb201be
Reviewed-on: http://gerrit.ent.cloudera.com:8080/114
Reviewed-by: Marcel Kornacker <marcel@cloudera.com>
Tested-by: Marcel Kornacker <marcel@cloudera.com>
2014-01-08 10:52:02 -08:00
Lenni Kuff
faeb7f5fa3 Add scanner test case for scenario where data and table schema do not match
Change-Id: I16f007ad1cb2caac47506914512c5665fc3d5f56
Reviewed-on: http://gerrit.ent.cloudera.com:8080/98
Reviewed-by: Lenni Kuff <lskuff@cloudera.com>
Tested-by: Lenni Kuff <lskuff@cloudera.com>
2014-01-08 10:52:01 -08:00
Marcel Kornacker
d85b90cb22 SlotDescriptor.label plus repartitioning for inserts when column stats are missing. 2014-01-08 10:51:56 -08:00
Alan Choi
4c32278782 IMP-973 Reassign HBase regions after HBase restart 2014-01-08 10:51:54 -08:00
Marcel Kornacker
c8afd16bbb IMPALA-85: planner fails with "Join requires at least one equality predicate between the two tables" when "from" table order does not match "where" join order
This fix contains two parts:
- functionality inside the analyzer to compute a value transfer graph (from equality predicates
between slotrefs) and from that equivalence classes for all slots; this functionality is required for this fix
but will be generally useful when adding propagation of binding predicates in the future
- a "shortest path" implementation inside the planner of a fix for the problem at hand; this leaves a lot to be desired:
* correct handling of assigned predicates: the added test case shows that the planner will try to assign all predicates
  to some node in the tree, even if that predicate is superfluous because it was subsumed by an equality derived from
  equivalence class membership
* complete lack of propagation of binding predicates (e.g., propagate "col1 = 5" to all slotrefs that are in the same
  equivalence class as col1)

This is beyond what can be accomplished for 1.1 and therefore will have to wait for 1.2.
2014-01-08 10:51:51 -08:00
ishaan
2f7d24b35b Fix tpch-q18, to not use qualified table names. 2014-01-08 10:51:49 -08:00
ishaan
ece902d953 Fix tpch-q18 to inser into the database associated with its scale-factor. 2014-01-08 10:51:45 -08:00
Skye Wanderman-Milne
3fecdeb793 IMPALA-441: support default values for Avro tables 2014-01-08 10:51:39 -08:00
Alex Behm
8ad15fabcf IMPALA-372: Added CREATE/DROP/ALTER VIEW. 2014-01-08 10:51:35 -08:00
Alex Behm
3bba336bbf IMPALA-359: Return proper tuple id of inline view with distinct aggregation. 2014-01-08 10:51:26 -08:00
Skye Wanderman-Milne
c8a8308ece Avro schema resolution (minus default values) 2014-01-08 10:51:26 -08:00
ishaan
e7c6d57f9c IMP-773: Add better logging/error detection to start-impala-cluster.py 2014-01-08 10:51:25 -08:00
Alan Choi
254ee6ef89 IMPALA-434 Support binary hbase encoding 2014-01-08 10:51:18 -08:00
Skye Wanderman-Milne
e8344bb0d0 Dictionary encoding/decoding 2014-01-08 10:51:15 -08:00
Lenni Kuff
abdfae5b24 Update DESCRIBE FORMATTED results to match the Hive HS2 output 2014-01-08 10:51:14 -08:00
Alex Behm
ece9f76a0b IMP-967: Recognize predicates referring to more than two tuples as eq conjuncts. 2014-01-08 10:51:13 -08:00
Lenni Kuff
c2cfc7e2a3 IMPALA-373: Add support for 'LOAD DATA' statements
This change adds Impala support for LOAD DATA statements. This allows the user
to load one or more files into a table or partition from a given HDFS location. The
load operation only moves files, it does not convert data to match the target
table/partition's file format.
2014-01-08 10:51:02 -08:00
Alex Behm
045038e479 IMPALA-374: Added WITH clause without recursion. 2014-01-08 10:51:00 -08:00
Henry Robinson
79b36a5eb3 IMPALA-375: Add column permutation clause to INSERT statement 2014-01-08 10:50:59 -08:00
Alan Choi
15a3d92492 Qualify table with database 2014-01-08 10:50:57 -08:00
Alan Choi
58687d16b8 IMPALA-406 Raise an error when inserting into HBase table using a null row key. 2014-01-08 10:50:56 -08:00
Lenni Kuff
7ac88e1fa9 IMPALA-400: Add support for SQL statement authorization
This changes adds support for SQL statement authorization in Impala. The authorization
works by updating the Catalog API to require a User + Privilege when getting Table/Db
objects (and in the future can be extended to cover columns as well).
If the user doesn't have permission to access the object, an AuthorizationException is
thrown. The authorization checks are done during analysis as new Catalog objects are
encountered.

These changes build on top of the Hive Access code which handles the actually
processing of authorization requests.  The authorization is currently based
on a "policy file" which will be stored in HDFS. This policy file is read once
on startup and then reloaded every 5 minutes. It can also be reloaded on a
specific impalad by executing a "refresh" command.

Authorization is enabled by setting:
--server_name='server1'
and then pointing the impalad to the policy file using the flag:
--authorization_policy_file=/path/to/policy/file

any authorization configuration problems will result in impalad failing to
start.
2014-01-08 10:50:56 -08:00
Alan Choi
2bdba77f61 Perform HBase deterministic region assigment and enable HBase scan range location test in the planner test 2014-01-08 10:50:54 -08:00
Nong Li
ce092065be Fix bug with how exec sets if the conjuncts are thread safe. 2014-01-08 10:50:53 -08:00
Skye Wanderman-Milne
1ab189c789 Fix build 2014-01-08 10:50:52 -08:00
Skye Wanderman-Milne
c8fd4f8016 IMPALA-362: impalad hangs when read sequence file without contents 2014-01-08 10:50:49 -08:00
Alan Choi
bd59bbb07a IMPALA-300/356
Always reload region server info.
Clear keyRange.start/stopkey before setting it in setKeyRangeStart/End.
Split HBase tables into multiple regions.
I've to disable HBase scanrangelocations planner test because region assigment
is non-deterministic. I'll have a follow up patch to address that.
2014-01-08 10:50:48 -08:00
Lenni Kuff
2e19107496 Fixed TPCH planner test due to column stat changes in CDH4.3.0 hive 2014-01-08 10:50:46 -08:00
Lenni Kuff
2f7198292a Add support for auxiliary workloads, tests, and datasets
This change adds support for auxiliary worksloads, tests, and datasets. This is useful
to augment the regular test runs with some additional tests that do not belong in the
main Impala repo.
2014-01-08 10:50:32 -08:00
Alan Choi
b1de018298 IMPALA-31 Support EXPLAIN <query>
Hue is moving to HiveServer2 but HiveServer2 does not have an "explain" RPC
call. To support "explain", I added it to the language.

An "explain" statement will return a result set: one row per explain line.
2014-01-08 10:50:32 -08:00
Alex Behm
937a44f9f8 IMPALA-68: Support Values() statement. 2014-01-08 10:50:31 -08:00
Alex Behm
c7819f4db7 IMPALA-87: Support INSERT from SELECT without FROM. 2014-01-08 10:50:30 -08:00
ishaan
f026354721 IMP-912 Make force killing an option. Update run-all-tests to pre-emptively force kill. 2014-01-08 10:50:29 -08:00
Alex Behm
9ff09cd3f4 IMPALA-70: Respect tbl properties to allow empty strings to be treated as NULL 2014-01-08 10:50:28 -08:00
Lenni Kuff
627e74a068 Fix insert test failure by cleaning up table before executing query 2014-01-08 10:50:27 -08:00
Lenni Kuff
e0507e192b Fix unstable alter table test 2014-01-08 10:50:26 -08:00
Nong Li
261119b91f Forgot to update the test in previous commit. 2014-01-08 10:50:23 -08:00
Nong Li
8af35425e6 Fix unstable ordering with nans. 2014-01-08 10:50:22 -08:00
Nong Li
68e4c14527 Fix parquet incompatibilities. 2014-01-08 10:50:22 -08:00
Skye Wanderman-Milne
223b1a8e47 IMPALA-293: Impala is unable to query RCFile tables which describe less columns than the file's header. 2014-01-08 10:50:17 -08:00
Henry Robinson
ead69d377f IMPALA-249, IMPALA-252: Fixes for static partition keys. 2014-01-08 10:50:14 -08:00
Lenni Kuff
11556a1ad2 Add targeted perf regression test for IMPALA-288 2014-01-08 10:50:13 -08:00
Skye Wanderman-Milne
f4d8df7119 Don't suppress "incomplete read" bad status, revert DataErrorsTest accordingly. 2014-01-08 10:50:13 -08:00
Lenni Kuff
3f0252c9f3 Fix DataErrors test failures 2014-01-08 10:50:12 -08:00
Nong Li
4235bf5009 Fix planner test result. 2014-01-08 10:50:11 -08:00
Alan Choi
2d25f11ec3 IMPALA-91 new explain plan output 2014-01-08 10:50:10 -08:00
Alex Behm
861ba05989 IMPALA-197: Outer join on constant expressions returns incorrect results. 2014-01-08 10:50:09 -08:00