Commit Graph

1124 Commits

Author SHA1 Message Date
Nong Li
58631d9ce0 Fix parquet insert .test files. 2014-01-08 10:49:46 -08:00
Marcel Kornacker
49160d0ad8 join hints 2014-01-08 10:49:46 -08:00
Henry Robinson
829c0dc948 Remove line break from version metric 2014-01-08 10:49:45 -08:00
Alex Behm
2277386d4d IMPALA-225: Compound predicate ranges on partition keys crash impalad. 2014-01-08 10:49:45 -08:00
Nong Li
ab709cb517 Use fvn hash for data sender and different seeds for each step of the execution. 2014-01-08 10:49:45 -08:00
Henry Robinson
cd9debb792 Typo in HdfsTableSink::GetHashTblKey 2014-01-08 10:49:44 -08:00
Lenni Kuff
16504f2a27 Fix formatting of errors/exceptions returned from FE 2014-01-08 10:49:44 -08:00
Alan Choi
b282175461 IMPA-213 Disable DN server check; disable all checks if impala cannot detech cdh version 2014-01-08 10:49:44 -08:00
ishaan
a3ae4e3252 The shell should inform the user that a command is incomplete by changing its prompt. 2014-01-08 10:49:43 -08:00
Alex Behm
7ccb1b8194 IMPALA-229: The built-in function regexp_extract() return wrong results. 2014-01-08 10:49:43 -08:00
Lenni Kuff
1693ad4eb3 Force kill existing Impala processes when running start-impala-cluster 2014-01-08 10:49:43 -08:00
Nong Li
925223d437 Change query id for debug page urls to the same as all our other query id formats. 2014-01-08 10:49:42 -08:00
Nong Li
57aac373ae Add last refresh time metric. 2014-01-08 10:49:42 -08:00
Nong Li
2a4982ffd4 Remove <pre> tag for encoded profiles. 2014-01-08 10:49:41 -08:00
Nong Li
1f6481382e Fix parquet test setup. 2014-01-08 10:49:41 -08:00
Henry Robinson
189575f23f Version metric for statestored and impalad 2014-01-08 10:49:41 -08:00
Henry Robinson
711e3f1aed IMPALA-220: Type check static partition columns 2014-01-08 10:49:40 -08:00
Nong Li
741599dc2a Move compressed table test out of core. 2014-01-08 10:49:40 -08:00
Nong Li
563cbfa3a8 Enable parquet testing 2014-01-08 10:49:40 -08:00
Nong Li
0dcfbfafed Fix bugs in parquet scanner. 2014-01-08 10:49:39 -08:00
ishaan
3093006ca9 Non-interactive mode in the shell should work with multi-line queries enabled. 2014-01-08 10:49:39 -08:00
Nong Li
c998c30771 Compress encoded runtime profiles and persist them to log. 2014-01-08 10:49:39 -08:00
Alex Behm
2575a34fa3 Improved error reporting when using impala-shell. 2014-01-08 10:49:38 -08:00
Henry Robinson
14d29aa579 Add plan and number of fragment instances to profile 2014-01-08 10:49:38 -08:00
ishaan
3567935f11 IMPALA-78 The shell should support multiline commands. 2014-01-08 10:49:38 -08:00
Lenni Kuff
c0c2c73518 Modify impala-beeswax test library to get runtime profile before query handle is closed 2014-01-08 10:49:37 -08:00
Lenni Kuff
3e70594484 Fix unstable top-n test case 2014-01-08 10:49:37 -08:00
Lenni Kuff
d3375d4aec Amend commit: IMPALA-215: DDL commands stay in the in flight query log
Missed a line when pushing the earlier change.
2014-01-08 10:49:37 -08:00
Henry Robinson
e3a16903d5 IMPALA-224: Unknown target DB for INSERT should not raise an NPE 2014-01-08 10:49:36 -08:00
Henry Robinson
1f9f656247 Throughput counters in data-stream sender 2014-01-08 10:49:36 -08:00
Henry Robinson
b72b711bdb IMPALA-211: Fix excessive logging in state-store subscriber 2014-01-08 10:49:36 -08:00
Lenni Kuff
d0c08eb8d6 IMPALA-215: DDL commands stay in the in flight query log 2014-01-08 10:49:35 -08:00
Nong Li
0891179bbf Add load factor to hash table counters. 2014-01-08 10:49:35 -08:00
Alan Choi
51c47a94d2 Fix volume id in FE 2014-01-08 10:49:35 -08:00
Henry Robinson
1cc976819e Revert "IMPALA-206: Stop INSERT queries from always finishing in EXCEPTION state"
This reverts commit 40ea325b53d3154328686ea1152417b8abbcb2ac.
2014-01-08 10:49:35 -08:00
Lenni Kuff
cba9cd00dd Fix full data load build break due to constructing incorrect HDFS paths 2014-01-08 10:49:34 -08:00
Marcel Kornacker
398e725a23 make broadcast joins the default join strategy 2014-01-08 10:49:34 -08:00
Nong Li
32ee207de4 Fix data errors test. 2014-01-08 10:49:34 -08:00
Alan Choi
0b0367d82e Fix volume id for partitioned table 2014-01-08 10:49:33 -08:00
Alan Choi
612e1b22dc Fix impala-server.scan-ranges.num-missing-volume-id metrics
There's a bug in hdfs-scan-node.cc where we only increment this metrics once per query.
2014-01-08 10:49:33 -08:00
Henry Robinson
02183620d7 IMPALA-206: Stop INSERT queries from always finishing in EXCEPTION state 2014-01-08 10:49:33 -08:00
ishaan
2773f00c21 IMPALA-205 The shell should default to the FQDN of the localhost is no hostname is specified. 2014-01-08 10:49:32 -08:00
Alex Behm
1b2e8280d4 Fix NULL issues. 2014-01-08 10:49:32 -08:00
Nong Li
189f4313dc Fix disk-io-mgr-test. 2014-01-08 10:49:32 -08:00
Lenni Kuff
e218721386 IMPALA-198: Support setting file format, table comment in CREATE TABLE LIKE statements 2014-01-08 10:49:31 -08:00
Alan Choi
c419ae1891 Add 4.1 direct read configuration check
Impala detects the HDFS version by reading the Namenode web UI and run
the corresponding check.

On 4.1, Impala tries to check the datanode (server side) config by reading
the datanode web UI.
2014-01-08 10:49:31 -08:00
Nong Li
8c3287db82 Integrate io mgr and mem limits 2014-01-08 10:49:31 -08:00
Henry Robinson
bc63ac2461 Add destination plan node ID to data-stream sender profile 2014-01-08 10:49:30 -08:00
ishaan
c85acf424d IMPALA-209 desc <tablename> should be the same as describe <tablename> in the shell. 2014-01-08 10:49:30 -08:00
Alan Choi
5f9e26b4a8 Average Scanner Thread Concurrency is a new metrics in the profile that reports
the average number of active scanner thread (i.e. those that are not blocked by
IO).

In the hdfs-scan-node, whenever a thread is started, it will increment the
active_scanner_thread_counter_. When a scanner thread enter the
scan-range-context's GetRawBytes or GetBytes, the counter will be decremented.

A new sampling thread is created to sample the value of
active_scanner_thread_counter_ and compute the average.

A bucket couting of HdfsReadThreadConcurrent is also added.

The output of the hdfs-scan-node profile is also updated. Here's the new output
for hdfs-scan-node after running count(*) from tpch.lineitem.

      HDFS_SCAN_NODE (id=0):(10s254ms 99.75%)
        File Formats: TEXT/NONE:12
        Hdfs split stats (<volume id>:<# splits>/<split lengths>): 0:6/351.21M
(351208888) 1:6/402.65M (402653184)
         - AverageHdfsReadThreadConcurrency: 1.95
           - HdfsReadThreadConcurrencyCountPercentage=0: 0.00
           - HdfsReadThreadConcurrencyCountPercentage=1: 5.00
           - HdfsReadThreadConcurrencyCountPercentage=2: 95.00
           - HdfsReadThreadConcurrencyCountPercentage=3: 0.00
         - AverageScannerThreadConcurrency: 0.15
         - BytesRead: 718.94 MB
         - MemoryUsed: 0.00
         - NumDisksAccessed: 2
         - PerReadThreadRawHdfsThroughput: 36.75 MB/sec
         - RowsReturned: 6.00M (6001215)
         - RowsReturnedRate: 585.25 K/sec
         - ScanRangesComplete: 12
         - ScannerThreadsInvoluntaryContextSwitches: 168
         - ScannerThreadsTotalWallClockTime: 1m40s
           - DelimiterParseTime: 2s128ms
           - MaterializeTupleTime: 723.0us
           - ScannerThreadsSysTime: 10.0ms
           - ScannerThreadsUserTime: 2s090ms
         - ScannerThreadsVoluntaryContextSwitches: 99
         - TotalRawHdfsReadTime: 19s561ms
         - TotalReadThroughput: 68.69 MB/sec
2014-01-08 10:49:30 -08:00