Files
impala/docs/topics/impala_fixed_issues.xml
Jim Apple d484d2f684 Add Apache license header to files in doc directory
This now gives a clean RAT check with bin/check-rat-report.py, which
is one way for the Impala community to check compliance with ASF rules
on intellectual property.

Change-Id: I2ad06435f84a65ba126759e42a18fdaf52cd7036
Reviewed-on: http://gerrit.cloudera.org:8080/5232
Reviewed-by: Jim Apple <jbapple-impala@apache.org>
Tested-by: Impala Public Jenkins
Reviewed-by: John Russell <jrussell@cloudera.com>
2016-12-02 23:54:32 +00:00

7111 lines
262 KiB
XML

<?xml version="1.0" encoding="UTF-8"?><!--
Licensed to the Apache Software Foundation (ASF) under one
or more contributor license agreements. See the NOTICE file
distributed with this work for additional information
regarding copyright ownership. The ASF licenses this file
to you under the Apache License, Version 2.0 (the
"License"); you may not use this file except in compliance
with the License. You may obtain a copy of the License at
http://www.apache.org/licenses/LICENSE-2.0
Unless required by applicable law or agreed to in writing,
software distributed under the License is distributed on an
"AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY
KIND, either express or implied. See the License for the
specific language governing permissions and limitations
under the License.
-->
<!DOCTYPE concept PUBLIC "-//OASIS//DTD DITA Concept//EN" "concept.dtd">
<concept rev="ver" id="fixed_issues">
<title><ph audience="standalone">Fixed Issues in Apache Impala (incubating)</ph><ph audience="integrated">Fixed Issues in Apache Impala (incubating)</ph></title>
<prolog>
<metadata>
<data name="Category" value="Impala"/>
<data name="Category" value="Release Notes"/>
<data name="Category" value="Fixed Issues"/>
<data name="Category" value="Troubleshooting"/>
<data name="Category" value="Upgrading"/>
<data name="Category" value="Data Analysts"/>
<data name="Category" value="Developers"/>
<data name="Category" value="Data Analysts"/>
</metadata>
</prolog>
<conbody>
<p>
The following sections describe the major issues fixed in each Impala release.
</p>
<p>
For known issues that are currently unresolved, see <xref href="impala_known_issues.xml#known_issues"/>.
</p>
<p outputclass="toc inpage"/>
</conbody>
<!-- All 2.7.x / 5.9.x subsections go under here -->
<concept rev="2.7.0" id="fixed_issues_270">
<title>Issues Fixed in Impala for CDH 5.9.0</title>
<conbody>
<p>
<!--
The following list contains the most critical fixed issues
(<codeph>priority='Blocker'</codeph>) from the JIRA system.
-->
For the full list of Impala fixed issues in Impala 2.7.0, see
<xref href="https://issues.cloudera.org/issues/?jql=%20type%20%3D%20bug%20and%20project%20%3D%20IMPALA%20AND%20resolution%20%3D%20fixed%20AND%20affectedVersion%20!%3D%20%22Impala%202.7.0%22%20AND%20fixVersion%20%3D%20%22Impala%202.7.0%22%20and%20not%20labels%20%3D%20broken-build%20order%20by%20priority%20desc" scope="external" format="html">this report in the Impala JIRA tracker</xref>.
</p>
<p> For the full list of fixed issues for all CDH components in CDH 5.9.0, see <xref
audience="standalone"
href="http://www.cloudera.com/documentation/enterprise/latest/topics/cdh_rn_fixed_in_59.html"
format="html" scope="external">the CDH 5.9.x release notes</xref><xref
audience="integrated" href="cdh_rn_fixed_in_59.xml"
/>. </p>
<ul audience="integrated">
<li conref="cdh_rn_fixed_in_59.xml#ui_fixed_in_59/impala_start"
conrefend="cdh_rn_fixed_in_59.xml#ui_fixed_in_59/impala_end" />
</ul>
</conbody>
</concept>
<!-- All 2.6.x / 5.8.x subsections go under here -->
<concept rev="2.6.3" id="fixed_issues_263">
<title>Issues Fixed in Impala for CDH 5.8.3</title>
<conbody>
<p> For the full list of fixed issues for all CDH components, see <xref
audience="standalone"
href="http://www.cloudera.com/documentation/enterprise/latest/topics/cdh_rn_fixed_in_58.html"
format="html" scope="external">the CDH 5.8.x release notes</xref><xref
audience="integrated" href="cdh_rn_fixed_in_58.xml#ui_fixed_in_583"
/>. </p>
<ul audience="integrated">
<li conref="cdh_rn_fixed_in_58.xml#ui_fixed_in_583/impala_start"
conrefend="cdh_rn_fixed_in_58.xml#ui_fixed_in_583/impala_end" />
</ul>
</conbody>
</concept>
<concept rev="2.6.2" id="fixed_issues_262">
<title>Issues Fixed in Impala for CDH 5.8.2</title>
<conbody>
<p> For the full list of fixed issues for all CDH components, see <xref
audience="standalone"
href="http://www.cloudera.com/documentation/enterprise/latest/topics/cdh_rn_fixed_in_58.html"
format="html" scope="external">the CDH 5.8.x release notes</xref><xref
audience="integrated" href="cdh_rn_fixed_in_58.xml#upstream582"
/>. </p>
<ul audience="integrated">
<li conref="cdh_rn_fixed_in_58.xml#upstream582/impala_start"
conrefend="cdh_rn_fixed_in_58.xml#upstream582/impala_end" />
</ul>
</conbody>
</concept>
<concept rev="2.6.0" id="fixed_issues_260">
<title>Issues Fixed in Impala for CDH 5.8.0</title>
<conbody>
<p>
The following list contains the most critical fixed issues
(<codeph>priority='Blocker'</codeph>) from the JIRA system.
For the full list of fixed issues in CDH 5.8.0 / Impala 2.6.0, see
<xref href="https://issues.cloudera.org/issues/?jql=%20type%20%3D%20bug%20and%20project%20%3D%20IMPALA%20AND%20resolution%20%3D%20fixed%20AND%20affectedVersion%20!%3D%20%22Impala%202.6.0%22%20AND%20fixVersion%20%3D%20%22Impala%202.6.0%22%20and%20not%20labels%20%3D%20broken-build%20order%20by%20priority%20desc" scope="external" format="html">this report in the Impala JIRA tracker</xref>.
</p>
</conbody>
<concept id="IMPALA-3385">
<title>RuntimeState::error_log_ crashes</title>
<conbody>
<p>
A crash could occur, with stack trace pointing to <codeph>impala::RuntimeState::ErrorLog</codeph>.
</p>
<p><b>Bug:</b> <xref scope="external" format="html" href="https://issues.cloudera.org/browse/IMPALA-3385">IMPALA-3385</xref></p>
<p><b>Severity:</b> High</p>
</conbody>
</concept>
<concept id="IMPALA-3378">
<title>HiveUdfCall::Open() produces unsynchronized access to JniUtil::global_refs_ vector</title>
<conbody>
<p>
A crash could occur because of contention between multiple calls to Java UDFs.
</p>
<p><b>Bug:</b> <xref scope="external" format="html" href="https://issues.cloudera.org/browse/IMPALA-3378">IMPALA-3378</xref></p>
<p><b>Severity:</b> High</p>
</conbody>
</concept>
<concept id="IMPALA-3379">
<title>HBaseTableWriter::CreatePutList() produces unsynchronized access to JniUtil::global_refs_ vector</title>
<conbody>
<p>
A crash could occur because of contention between multiple concurrent statements writing to HBase.
</p>
<p><b>Bug:</b> <xref scope="external" format="html" href="https://issues.cloudera.org/browse/IMPALA-3379">IMPALA-3379</xref></p>
<p><b>Severity:</b> High</p>
</conbody>
</concept>
<concept id="IMPALA-3317">
<title>Stress test failure: sorter.cc:745] Check failed: i == 0 (1 vs. 0) </title>
<conbody>
<p>
A crash or wrong results could occur if the spill-to-disk mechanism encountered a zero-length string at
the very end of a data block.
</p>
<p><b>Bug:</b> <xref scope="external" format="html" href="https://issues.cloudera.org/browse/IMPALA-3317">IMPALA-3317</xref></p>
<p><b>Severity:</b> High</p>
</conbody>
</concept>
<concept id="IMPALA-3311">
<title>String data coming out of agg can be corrupted by blocking operators</title>
<conbody>
<p>
If a query plan contains an aggregation node producing string values anywhere within a subplan
(that is,if in the SQL statement, the aggregate function appears within an inline view over a collection column),
the results of the aggregation may be incorrect.
</p>
<p><b>Bug:</b> <xref scope="external" format="html" href="https://issues.cloudera.org/browse/IMPALA-3311">IMPALA-3311</xref></p>
<p><b>Severity:</b> High</p>
</conbody>
</concept>
<concept id="IMPALA-3269">
<title>CTAS with subquery throws AuthzException</title>
<conbody>
<p>
A <codeph>CREATE TABLE AS SELECT</codeph> operation could fail with an authorization error,
due to a slight difference in the privilege checking for the CTAS operation.
</p>
<p><b>Bug:</b> <xref scope="external" format="html" href="https://issues.cloudera.org/browse/IMPALA-3269">IMPALA-3269</xref></p>
<p><b>Severity:</b> High</p>
</conbody>
</concept>
<concept id="IMPALA-3237">
<title>Crash on inserting into table with binary and parquet</title>
<conbody>
<p>
Impala incorrectly allowed <codeph>BINARY</codeph> to be specified as a column type,
resulting in a crash during a write to a Parquet table with a column of that type.
</p>
<p><b>Bug:</b> <xref scope="external" format="html" href="https://issues.cloudera.org/browse/IMPALA-3237">IMPALA-3237</xref></p>
<p><b>Severity:</b> High</p>
</conbody>
</concept>
<concept id="IMPALA-3105">
<title>RowBatch::MaxTupleBufferSize() calculation incorrect, may lead to memory corruption</title>
<conbody>
<p>
A crash could occur while querying tables with very large rows, for example wide tables with many
columns or very large string values. This problem was identified in Impala 2.3, but had low
reproducibility in subsequent releases. The fix ensures the memory allocation size is correct.
</p>
<p><b>Bug:</b> <xref scope="external" format="html" href="https://issues.cloudera.org/browse/IMPALA-3105">IMPALA-3105</xref></p>
<p><b>Severity:</b> High</p>
</conbody>
</concept>
<concept id="IMPALA-3494">
<title>Thrift buffer overflows when serialize more than 3355443200 bytes in impala</title>
<conbody>
<p>
A very large memory allocation within the <cmdname>catalogd</cmdname> daemon could exceed an internal Thrift limit,
causing a crash.
</p>
<p><b>Bug:</b> <xref scope="external" format="html" href="https://issues.cloudera.org/browse/IMPALA-3494">IMPALA-3494</xref></p>
<p><b>Severity:</b> High</p>
</conbody>
</concept>
<concept id="IMPALA-3314">
<title>Altering table partition's storage format is not working and crashing the daemon</title>
<conbody>
<p>
If a partitioned table used a file format other than Avro, and the file format of an individual partition
was changed to Avro, subsequent queries could encounter a crash.
</p>
<p><b>Bug:</b> <xref scope="external" format="html" href="https://issues.cloudera.org/browse/IMPALA-3314">IMPALA-3314</xref></p>
<p><b>Severity:</b> High</p>
</conbody>
</concept>
<concept id="IMPALA-3798">
<title>Race condition may cause scanners to spin with runtime filters on Avro or Sequence files</title>
<conbody>
<p>
A timing problem during runtime filter processing could cause queries against Avro or SequenceFile tables
to hang.
</p>
<p><b>Bug:</b> <xref scope="external" format="html" href="https://issues.cloudera.org/browse/IMPALA-3798">IMPALA-3798</xref></p>
<p><b>Severity:</b> High</p>
</conbody>
</concept>
</concept>
<!-- All 2.5.x / 5.7.x subsections go under here -->
<concept rev="2.5.4" id="fixed_issues_254">
<title>Issues Fixed in Impala for CDH 5.7.4</title>
<conbody>
<p> For the full list of fixed issues for all CDH components, see <xref
audience="standalone"
href="http://www.cloudera.com/documentation/enterprise/latest/topics/cdh_rn_fixed_in_57.html"
format="html" scope="external">the CDH 5.7.x release notes</xref><xref
audience="integrated" href="cdh_rn_fixed_in_57.xml#ui_fixed_in_574"
/>. </p>
<ul audience="integrated">
<li conref="cdh_rn_fixed_in_57.xml#ui_fixed_in_574/impala_start"
conrefend="cdh_rn_fixed_in_57.xml#ui_fixed_in_574/impala_end" />
</ul>
</conbody>
</concept>
<!--
<concept rev="2.5.3" id="fixed_issues_253" audience="pre-release">
<title>Issues Fixed in Impala for CDH 5.7.3</title>
<conbody>
<p>
For the full list of fixed issues for all CDH components, see
<xref audience="standalone"
href="http://www.cloudera.com/documentation/enterprise/latest/topics/cdh_rn_fixed_in_57.html#ui_fixed_in_573"
format="html" scope="external">the CDH 5.7.x release notes</xref><xref
audience="integrated" href="cdh_rn_fixed_in_57.xml#ui_fixed_in_573"/>.
</p>
<ul audience="integrated">
<li conref="cdh_rn_fixed_in_57.xml#ui_fixed_in_573/impala_start" conrefend="cdh_rn_fixed_in_57.xml#ui_fixed_in_573/impala_end"/>
</ul>
</conbody>
</concept>
-->
<concept rev="2.5.2" id="fixed_issues_252">
<title>Issues Fixed in Impala for CDH 5.7.2</title>
<conbody>
<p> For the full list of fixed issues for all CDH components, see <xref
audience="standalone"
href="http://www.cloudera.com/documentation/enterprise/latest/topics/cdh_rn_fixed_in_57.html#ui_fixed_in_572"
format="html" scope="external">the CDH 5.7.x release notes</xref><xref
audience="integrated" href="cdh_rn_fixed_in_57.xml#ui_fixed_in_572"
/>. </p>
<ul audience="integrated">
<li conref="cdh_rn_fixed_in_57.xml#ui_fixed_in_572/impala_start"
conrefend="cdh_rn_fixed_in_57.xml#ui_fixed_in_572/impala_end" />
</ul>
</conbody>
</concept>
<concept rev="2.5.1" id="fixed_issues_251">
<title>Issues Fixed in Impala for CDH 5.7.1</title>
<conbody>
<p>
For the full list of fixed issues for all CDH components, see
<xref audience="standalone"
href="http://www.cloudera.com/documentation/enterprise/latest/topics/cdh_rn_fixed_in_57.html#ui_fixed_in_571"
format="html" scope="external">the CDH 5.7.x release notes</xref><xref
audience="integrated" href="cdh_rn_fixed_in_57.xml#ui_fixed_in_571"/>.
</p>
<ul audience="integrated">
<li conref="cdh_rn_fixed_in_57.xml#ui_fixed_in_571/impala_start" conrefend="cdh_rn_fixed_in_57.xml#ui_fixed_in_571/impala_end"/>
</ul>
</conbody>
</concept>
<concept rev="2.5.0" id="fixed_issues_250">
<title>Issues Fixed in Impala for CDH 5.7.0</title>
<conbody>
<p>
The following list contains the most critical issues (<codeph>priority='Blocker'</codeph>) from the JIRA system.
For the full list of fixed issues in CDH 5.7.0 / Impala 2.5.0, see
<xref href="https://issues.cloudera.org/issues/?jql=%20type%20%3D%20bug%20and%20project%20%3D%20IMPALA%20AND%20resolution%20%3D%20fixed%20AND%20affectedVersion%20!%3D%20%22Impala%202.5.0%22%20AND%20fixVersion%20%3D%20%22Impala%202.5.0%22%20and%20not%20labels%20%3D%20broken-build%20order%20by%20priority%20desc" scope="external" format="html">this report in the Impala JIRA tracker</xref>.
</p>
</conbody>
<concept id="IMPALA-2683">
<title>Stress test hit assert in LLVM: external function could not be resolved</title>
<conbody>
<p><b>Bug:</b> <xref href="https://issues.cloudera.org/browse/IMPALA-2683" scope="external" format="html">IMPALA-2683</xref></p>
<p>The stress test was running a build with the TPC-H, TPC-DS, and TPC-H nested queries with scale factor 3.</p>
</conbody>
</concept>
<concept id="IMPALA-2365">
<title>Impalad is crashing if udf jar is not available in hdfs location for first time</title>
<conbody>
<p><b>Bug:</b> <xref href="https://issues.cloudera.org/browse/IMPALA-2365" scope="external" format="html">IMPALA-2365</xref></p>
<p>
If a UDF JAR was not available in the HDFS location specified in the <codeph>CREATE FUNCTION</codeph> statement,
the <cmdname>impalad</cmdname> daemon could crash.
</p>
</conbody>
</concept>
<concept id="IMPALA-2535-570">
<title>PAGG hits mem_limit when switching to I/O buffers</title>
<conbody>
<p><b>Bug:</b> <xref href="https://issues.cloudera.org/browse/IMPALA-2535" scope="external" format="html">IMPALA-2535</xref></p>
<p>
A join query could fail with an out-of-memory error despite the apparent presence of sufficient memory.
The cause was the internal ordering of operations that could cause a later phase of the query to
allocate memory required by an earlier phase of the query. The workaround was to either increase
or decrease the <codeph>MEM_LIMIT</codeph> query option, because the issue would only occur for a specific
combination of memory limit and data volume.
</p>
</conbody>
</concept>
<concept id="IMPALA-2643-570">
<title>Prevent migrating incorrectly inferred identity predicates into inline views</title>
<conbody>
<p><b>Bug:</b> <xref href="https://issues.cloudera.org/browse/IMPALA-2643" scope="external" format="html">IMPALA-2643</xref></p>
<p>
Referring to the same column twice in a view definition could cause the view to omit
rows where that column contained a <codeph>NULL</codeph> value. This could cause
incorrect results due to an inaccurate <codeph>COUNT(*)</codeph> value or rows missing
from the result set.
</p>
</conbody>
</concept>
<concept id="IMPALA-1459-570">
<title>Fix migration/assignment of On-clause predicates inside inline views</title>
<conbody>
<p><b>Bug:</b> <xref href="https://issues.cloudera.org/browse/IMPALA-1459" scope="external" format="html">IMPALA-1459</xref></p>
<p>
Some combinations of <codeph>ON</codeph> clauses in join queries could result in comparisons
being applied at the wrong stage of query processing, leading to incorrect results.
Wrong predicate assignment could happen under the following conditions:
</p>
<ul>
<li>
The query includes an inline view that contains an outer join.
</li>
<li>
That inline view is joined with another table in the enclosing query block.
</li>
<li>
That join has an <codeph>ON</codeph> clause containing a predicate that
only references columns originating from the outer-joined tables inside the inline view.
</li>
</ul>
</conbody>
</concept>
<concept id="IMPALA-2093">
<title>Wrong plan of NOT IN aggregate subquery when a constant is used in subquery predicate</title>
<conbody>
<p><b>Bug:</b> <xref href="https://issues.cloudera.org/browse/IMPALA-2093" scope="external" format="html">IMPALA-2093</xref></p>
<p>
<codeph>IN</codeph> subqueries might return wrong results if the left-hand side of the <codeph>IN</codeph> is a constant.
For example:
</p>
<codeblock>
select * from alltypestiny t1
where 10 not in (select sum(int_col) from alltypestiny);
</codeblock>
</conbody>
</concept>
<concept id="IMPALA-2940">
<title>Parquet DictDecoders accumulate throughout query</title>
<conbody>
<p><b>Bug:</b> <xref href="https://issues.cloudera.org/browse/IMPALA-2940" scope="external" format="html">IMPALA-2940</xref></p>
<p>
Parquet dictionary decoders can accumulate throughout query execution, leading to excessive memory usage. One decoder is created per-column per-split.
</p>
</conbody>
</concept>
<concept id="IMPALA-3056">
<title>Planner doesn't set the has_local_target field correctly</title>
<conbody>
<p><b>Bug:</b> <xref href="https://issues.cloudera.org/browse/IMPALA-3056" scope="external" format="html">IMPALA-3056</xref></p>
</conbody>
</concept>
<concept id="IMPALA-2742">
<title>MemPool allocation growth behavior</title>
<conbody>
<p><b>Bug:</b> <xref href="https://issues.cloudera.org/browse/IMPALA-2742" scope="external" format="html">IMPALA-2742</xref></p>
<p>
Currently, the MemPool would always double the size of the last allocation.
This can lead to bad behavior if the MemPool transferred the ownership of all its data
except the last chunk. In the next allocation, the next allocated chunk would double
the size of this large chunk, which can be undesirable.
</p>
</conbody>
</concept>
<concept id="IMPALA-3035">
<title>Drop partition operations don't follow the catalog's locking protocol</title>
<conbody>
<p><b>Bug:</b> <xref href="https://issues.cloudera.org/browse/IMPALA-3035" scope="external" format="html">IMPALA-3035</xref></p>
<p>
The <codeph>CatalogOpExecutor.alterTableDropPartition()</codeph> function violates
the locking protocol used in the catalog that requires <codeph>catalogLock_</codeph>
to be acquired before any table-level lock. That may cause deadlocks when <codeph>ALTER TABLE DROP PARTITION</codeph>
is executed concurrently with other DDL operations.
</p>
</conbody>
</concept>
<concept id="IMPALA-2215">
<title>HAVING clause without aggregation not applied properly</title>
<conbody>
<p><b>Bug:</b> <xref href="https://issues.cloudera.org/browse/IMPALA-2215" scope="external" format="html">IMPALA-2215</xref></p>
<p>
A query with a <codeph>HAVING</codeph> clause but no <codeph>GROUP BY</codeph> clause was not being rejected,
despite being invalid syntax. For example:
</p>
<codeblock>
select case when 1=1 then 'didit' end as c1 from (select 1 as one) a having 1!=1;
</codeblock>
</conbody>
</concept>
<concept id="IMPALA-2914">
<title>Hit DCHECK Check failed: HasDateOrTime()</title>
<conbody>
<p><b>Bug:</b> <xref href="https://issues.cloudera.org/browse/IMPALA-2914" scope="external" format="html">IMPALA-2914</xref></p>
<p>
<codeph>TimestampValue::ToTimestampVal()</codeph> requires a valid <codeph>TimestampValue</codeph> as input.
This requirement was not enforced in some places, leading to serious errors.
</p>
</conbody>
</concept>
<concept id="IMPALA-2986">
<title>Aggregation spill loop gives up too early leading to mem limit exceeded errors</title>
<conbody>
<p><b>Bug:</b> <xref href="https://issues.cloudera.org/browse/IMPALA-2986" scope="external" format="html">IMPALA-2986</xref></p>
<p>
An aggregation query could fail with an out-of-memory error, despite sufficient memory being reported as available.
</p>
</conbody>
</concept>
<concept id="IMPALA-2592">
<title>DataStreamSender::Channel::CloseInternal() does not close the channel on an error.</title>
<conbody>
<p><b>Bug:</b> <xref href="https://issues.cloudera.org/browse/IMPALA-2592" scope="external" format="html">IMPALA-2592</xref></p>
<p>
Some queries do not close an internal communication channel on an error.
This will cause the node on the other side of the channel to wait indefinitely, causing the query to hang.
For example, this issue could happen on a Kerberos-enabled system if the credential cache was outdated.
Although the affected query hangs, the <cmdname>impalad</cmdname> daemons continue processing other queries.
</p>
</conbody>
</concept>
<concept id="IMPALA-2184">
<title>Codegen does not catch exceptions in FROM_UNIXTIME()</title>
<conbody>
<p><b>Bug:</b> <xref href="https://issues.cloudera.org/browse/IMPALA-2184" scope="external" format="html">IMPALA-2184</xref></p>
<p>
Querying for the min or max value of a timestamp cast from a bigint via <codeph>from_unixtime()</codeph>
fails silently and crashes instances of <cmdname>impalad</cmdname> when the input includes a value outside of the valid range.
</p>
<p><b>Workaround:</b> Disable native code generation with:</p>
<codeblock>
SET disable_codegen=true;
</codeblock>
</conbody>
</concept>
<concept id="IMPALA-2788">
<title>Impala returns wrong result for function 'conv(bigint, from_base, to_base)'</title>
<conbody>
<p><b>Bug:</b> <xref href="https://issues.cloudera.org/browse/IMPALA-2788" scope="external" format="html">IMPALA-2788</xref></p>
<p>
Impala returns wrong result for function <codeph>conv()</codeph>.
Function <codeph>conv(bigint, from_base, to_base)</codeph> returns an correct result,
while <codeph>conv(string, from_base, to_base)</codeph> returns the correct value.
For example:
</p>
<codeblock>
<![CDATA[
select 2061013007, conv(2061013007, 16, 10), conv('2061013007', 16, 10);
+------------+--------------------------+----------------------------+
| 2061013007 | conv(2061013007, 16, 10) | conv('2061013007', 16, 10) |
+------------+--------------------------+----------------------------+
| 2061013007 | 1627467783 | 139066421255 |
+------------+--------------------------+----------------------------+
Fetched 1 row(s) in 0.65s
select 2061013007, conv(cast(2061013007 as bigint), 16, 10), conv('2061013007', 16, 10);
+------------+------------------------------------------+----------------------------+
| 2061013007 | conv(cast(2061013007 as bigint), 16, 10) | conv('2061013007', 16, 10) |
+------------+------------------------------------------+----------------------------+
| 2061013007 | 1627467783 | 139066421255 |
+------------+------------------------------------------+----------------------------+
select 2061013007, conv(cast(2061013007 as string), 16, 10), conv('2061013007', 16, 10);
+------------+------------------------------------------+----------------------------+
| 2061013007 | conv(cast(2061013007 as string), 16, 10) | conv('2061013007', 16, 10) |
+------------+------------------------------------------+----------------------------+
| 2061013007 | 139066421255 | 139066421255 |
+------------+------------------------------------------+----------------------------+
select 2061013007, conv(cast(cast(2061013007 as decimal(20,0)) as bigint), 16, 10), conv('2061013007', 16, 10);
+------------+-----------------------------------------------------------------+----------------------------+
| 2061013007 | conv(cast(cast(2061013007 as decimal(20,0)) as bigint), 16, 10) | conv('2061013007', 16, 10) |
+------------+-----------------------------------------------------------------+----------------------------+
| 2061013007 | 1627467783 | 139066421255 |
+------------+-----------------------------------------------------------------+----------------------------+
]]>
</codeblock>
<p><b>Workaround:</b>
Cast the value to string and use <codeph>conv(string, from_base, to_base)</codeph> for conversion.
</p>
</conbody>
</concept>
<!-- I think this issue is marked incorrectly in JIRA. It arose during the 5.7 development cycle, it's not a
customer-facing issue.
<concept id="IMPALA-3039">
<title>Restrict the number of runtime filters generated</title>
<conbody>
<p><b>Bug:</b> <xref href="https://issues.cloudera.org/browse/IMPALA-3039" scope="external" format="html">IMPALA-3039</xref></p>
<p>
Large queries with many runtime filters might fail for various reasons, such as memory for Thrift data structures
not being accounted for.
Having to disable all runtime filters to make a query run is not desirable.
The new <codeph>MAX_NUM_RUNTIME_FILTERS</codeph> query option specifies an upper bound
on the number of runtime filters generated for a query.
</p>
</conbody>
</concept>
-->
</concept>
<!-- All 2.4.x / 5.6.x subsections go under here -->
<concept rev="2.4.1" id="fixed_issues_241">
<title>Issues Fixed in Impala for CDH 5.6.1</title>
<conbody>
<p>
For the full list of fixed issues for all CDH components, see
<xref audience="standalone"
href="http://www.cloudera.com/documentation/enterprise/latest/topics/cdh_rn_fixed_in_56.html#ui_fixed_in_561"
format="html" scope="external">the CDH 5.6.x release notes</xref><xref
audience="integrated" href="cdh_rn_fixed_in_56.xml#ui_fixed_in_561"/>.
</p>
<!-- Maybe time to start dropping the 'CDH 5 only' notes... wasn't included for 5.6.0, so why start for 5.6.1...
<note conref="../shared/impala_common.xml#common/only_cdh5_24x"/> -->
<ul audience="integrated">
<li conref="cdh_rn_fixed_in_56.xml#ui_fixed_in_561/impala_start" conrefend="cdh_rn_fixed_in_56.xml#ui_fixed_in_561/impala_end"/>
</ul>
</conbody>
</concept>
<concept rev="2.4.0" id="fixed_issues_240">
<title>Issues Fixed in Impala for CDH 5.6.0</title>
<conbody>
<p> The set of fixes for Impala in CDH 5.6.0 is the same as in CDH 5.5.2.
See <xref href="impala_fixed_issues.xml#fixed_issues_232"
audience="integrated"/> for details. </p>
<!--
<p>
This section lists the most serious or frequently encountered customer issues fixed in CDH 5.6.0 / Impala 2.4.0.
For the full list of fixed Impala issues, see
<xref audience="standalone" href="http://www.cloudera.com/documentation/enterprise/latest/topics/cdh_rn_fixed_in_560.html" format="html" scope="external">the CDH 5.6.x release notes</xref><xref audience="integrated" href="cdh_rn_fixed_in_560.xml"/>.
</p>
-->
</conbody>
</concept>
<!-- All 2.3.x / 5.5.x subsections go under here -->
<concept rev="2.3.4" id="fixed_issues_234">
<title>Issues Fixed in Impala for CDH 5.5.4</title>
<conbody>
<p>
For the full list of fixed issues for all CDH components, see
<xref audience="standalone" href="http://www.cloudera.com/documentation/enterprise/latest/topics/cdh_rn_fixed_in_55.html#fixed_issues_554" format="html" scope="external">the CDH 5.5.x release notes</xref><xref audience="integrated" href="cdh_rn_fixed_in_55.xml#fixed_issues_554"/>.
</p>
<note conref="../shared/impala_common.xml#common/only_cdh5_23x"/>
<ul audience="integrated">
<li conref="cdh_rn_fixed_in_55.xml#ui_fixed_in_554/impala_start" conrefend="cdh_rn_fixed_in_55.xml#ui_fixed_in_554/impala_end"/>
</ul>
</conbody>
</concept>
<concept rev="2.3.2" id="fixed_issues_232">
<title>Issues Fixed in Impala for CDH 5.5.2</title>
<conbody>
<p>
This section lists the most serious or frequently encountered customer
issues fixed in CDH 5.5.2 / Impala 2.3.2. For the full list of fixed Impala issues,
see <xref audience="standalone" href="http://www.cloudera.com/documentation/enterprise/latest/topics/cdh_rn_fixed_in_552.html" format="html" scope="external">the CDH 5.5.x release notes</xref><xref href="cdh_rn_fixed_in_55.xml#fixed_issues_552" audience="integrated"/>.
</p>
</conbody>
<!--
Fixed issues in 5.5.2 from Juan Yu spreadsheet (https://docs.google.com/spreadsheets/d/1Ia06xOLKQInKfsJzQfQ6Sf1fO3nYLOpZ-0OWfFNgNSc/edit#gid=2094272728):
IMPALA-2747: Thrift-client cleans openSSL state before using it in the case of the catalog
IMPALA-2829: SEGV in AnalyticEvalNode touching NULL input_stream_
IMPALA-2722: Free local allocations per row batch in non-partitioned AGG and HJ
IMPALA-2612: Free local allocations once for every row batch when building hash tables
IMPALA-2643: Prevent migrating incorrectly inferred identity predicates into inline views
IMPALA-2695: Fix GRANTs on URIs with uppercase letters
IMPALA-2664: Avoid sending large partition stats objects over thrift
IMPALA-2648: Ditto
IMPALA-2226: Throw AnalysisError if table properties are too large (for the Hive metastore)
IMPALA-2273: Make MAX_PAGE_HEADER_SIZE configurable
IMPALA-2473: reduce scanner memory usage
IMPALA-2113: Handle error when distinct and aggregates are used with a having clause
IMPALA-2225: Handle error when star based select item and aggregate are incorrectly used
IMPALA-2731: Refactor MemPool usage in HBase scan node
IMPALA-1459: Fix migration/assignment of On-clause predicates inside inline views
IMPALA-2558: DCHECK in parquet scanner after block read error
IMPALA-2535: PAGG hits mem_limit when switching to I/O buffers
IMPALA-2559: Fix check failed: sorter_runs_.back()->is_pinned_
IMPALA-2614: Don't ignore Status returned by DataStreamRecvr::CreateMerger()
IMPALA-2591: DataStreamSender::Send() does not return an error status if SendBatch() failed
IMPALA-2598: Re-enable SSL and Kerberos on server-server
These are changes to test cases and so ignored for release note purposes:
IMPALA-2103: Fix flaky test_impersonation test
IMPALA-2776: Remove escapechartesttable and associated tests.
IMPALA-2812: Remove additional test referencing escapecharstesttable.
IMPALA-2630: skip TestParquet.test_continue_on_error when using old aggs/joins
IMPALA-2624: Increase fs.trash.interval to 24 hours for test suite
-->
<concept id="IMPALA-2829">
<title>SEGV in AnalyticEvalNode touching NULL input_stream_</title>
<conbody>
<p>
A query involving an analytic function could encounter a serious error.
This issue was encountered infrequently, depending upon specific combinations
of queries and data.
</p>
<p><b>Bug:</b> <xref href="https://issues.cloudera.org/browse/IMPALA-2829" scope="external" format="html">IMPALA-2829</xref></p>
</conbody>
</concept>
<concept id="IMPALA-2722">
<title>Free local allocations per row batch in non-partitioned AGG and HJ</title>
<conbody>
<p>
An outer join query could fail unexpectedly with an out-of-memory error
when the <q>spill to disk</q> mechanism was turned off.
</p>
<p><b>Bug:</b> <xref href="https://issues.cloudera.org/browse/IMPALA-2722" scope="external" format="html">IMPALA-2722</xref></p>
</conbody>
</concept>
<concept id="IMPALA-2612">
<!-- In this case, the title from the spreadsheet is clearer than the original JIRA title. -->
<title>Free local allocations once for every row batch when building hash tables</title>
<conbody>
<p>
A join query could encounter a serious error due to an internal failure to allocate memory, which
resulted in dereferencing a <codeph>NULL</codeph> pointer.
</p>
<p><b>Bug:</b> <xref href="https://issues.cloudera.org/browse/IMPALA-2612" scope="external" format="html">IMPALA-2612</xref></p>
</conbody>
</concept>
<concept id="IMPALA-2643">
<title>Prevent migrating incorrectly inferred identity predicates into inline views</title>
<conbody>
<p>
Referring to the same column twice in a view definition could cause the view to omit
rows where that column contained a <codeph>NULL</codeph> value. This could cause
incorrect results due to an inaccurate <codeph>COUNT(*)</codeph> value or rows missing
from the result set.
</p>
<p><b>Bug:</b> <xref href="https://issues.cloudera.org/browse/IMPALA-2643" scope="external" format="html">IMPALA-2643</xref></p>
</conbody>
</concept>
<concept id="IMPALA-2695">
<title>Fix GRANTs on URIs with uppercase letters</title>
<conbody>
<p>
A <codeph>GRANT</codeph> statement for a URI could be ineffective if the URI
contained uppercase letters, for example in an uppercase directory name.
Subsequent statements, such as <codeph>CREATE EXTERNAL TABLE</codeph>
with a <codeph>LOCATION</codeph> clause, could fail with an authorization exception.
</p>
<p><b>Bug:</b> <xref href="https://issues.cloudera.org/browse/IMPALA-2695" scope="external" format="html">IMPALA-2695</xref></p>
</conbody>
</concept>
<concept id="IMPALA-2664-552">
<title id="IMPALA-2648-552">Avoid sending large partition stats objects over thrift</title>
<conbody>
<p>
The <cmdname>catalogd</cmdname> daemon could encounter a serious error
when loading the incremental statistics metadata for tables with large
numbers of partitions and columns. The problem occurred when the
internal representation of metadata for the table exceeded 2
GB, for example in a table with 20K partitions and 77 columns. The fix causes a
<codeph>COMPUTE INCREMENTAL STATS</codeph> operation to fail if it
would produce metadata that exceeded the maximum size.
</p>
<p><b>Bug:</b> <xref href="https://issues.cloudera.org/browse/IMPALA-2664" scope="external" format="html">IMPALA-2664</xref>,
<xref href="https://issues.cloudera.org/browse/IMPALA-2648" scope="external" format="html">IMPALA-2648</xref></p>
</conbody>
</concept>
<concept id="IMPALA-2226">
<title>Throw AnalysisError if table properties are too large (for the Hive metastore)</title>
<conbody>
<p>
<codeph>CREATE TABLE</codeph> or <codeph>ALTER TABLE</codeph> statements could fail with
metastore database errors due to length limits on the <codeph>SERDEPROPERTIES</codeph> and <codeph>TBLPROPERTIES</codeph> clauses.
(The limit on key size is 256, while the limit on value size is 4000.) The fix makes Impala handle these error conditions
more cleanly, by detecting too-long values rather than passing them to the metastore database.
</p>
<p><b>Bug:</b> <xref href="https://issues.cloudera.org/browse/IMPALA-2226" scope="external" format="html">IMPALA-2226</xref></p>
</conbody>
</concept>
<concept id="IMPALA-2273-552">
<title>Make MAX_PAGE_HEADER_SIZE configurable</title>
<conbody>
<p>
Impala could fail to access Parquet data files with page headers larger than 8 MB, which could
occur, for example, if the minimum or maximum values for a column were long strings. The
fix adds a configuration setting <codeph>--max_page_header_size</codeph>, which you can use to
increase the Impala size limit to a value higher than 8 MB.
</p>
<p><b>Bug:</b> <xref href="https://issues.cloudera.org/browse/IMPALA-2273" scope="external" format="html">IMPALA-2273</xref></p>
</conbody>
</concept>
<concept id="IMPALA-2473">
<title>reduce scanner memory usage</title>
<conbody>
<p>
Queries on Parquet tables could consume excessive memory (potentially multiple gigabytes) due to producing
large intermediate data values while evaluating groups of rows. The workaround was to reduce the size of
the <codeph>NUM_SCANNER_THREADS</codeph> query option, the <codeph>BATCH_SIZE</codeph> query option,
or both.
</p>
<p><b>Bug:</b> <xref href="https://issues.cloudera.org/browse/IMPALA-2473" scope="external" format="html">IMPALA-2473</xref></p>
</conbody>
</concept>
<concept id="IMPALA-2113">
<title>Handle error when distinct and aggregates are used with a having clause</title>
<conbody>
<p>
A query that included a <codeph>DISTINCT</codeph> operator and a <codeph>HAVING</codeph> clause, but no
aggregate functions or <codeph>GROUP BY</codeph>, would fail with an uninformative error message.
</p>
<p><b>Bug:</b> <xref href="https://issues.cloudera.org/browse/IMPALA-2113" scope="external" format="html">IMPALA-2113</xref></p>
<!-- Don't think this is really a 'major' issue. Left a comment in the IMPALA JIRA. -->
</conbody>
</concept>
<concept id="IMPALA-2225">
<title>Handle error when star based select item and aggregate are incorrectly used</title>
<conbody>
<p>
A query that included <codeph>*</codeph> in the <codeph>SELECT</codeph> list, in addition to an
aggregate function call, would fail with an uninformative message if the query had no
<codeph>GROUP BY</codeph> clause.
</p>
<p><b>Bug:</b> <xref href="https://issues.cloudera.org/browse/IMPALA-2225" scope="external" format="html">IMPALA-2225</xref></p>
<!-- Don't think this is really a 'major' issue. Left a comment in the IMPALA JIRA. -->
</conbody>
</concept>
<concept id="IMPALA-2731-552">
<title>Refactor MemPool usage in HBase scan node</title>
<conbody>
<p>
Queries involving HBase tables used substantially more memory than in earlier Impala versions.
The problem occurred starting in Impala 2.2.8, as a result of the changes for IMPALA-2284.
The fix for this issue involves removing a separate memory work area for HBase queries
and reusing other memory that was already allocated.
</p>
<p><b>Bug:</b> <xref href="https://issues.cloudera.org/browse/IMPALA-2731" scope="external" format="html">IMPALA-2731</xref></p>
</conbody>
</concept>
<concept id="IMPALA-1459-552">
<title>Fix migration/assignment of On-clause predicates inside inline views</title>
<conbody>
<p>
Some combinations of <codeph>ON</codeph> clauses in join queries could result in comparisons
being applied at the wrong stage of query processing, leading to incorrect results.
Wrong predicate assignment could happen under the following conditions:
</p>
<ul>
<li>
The query includes an inline view that contains an outer join.
</li>
<li>
That inline view is joined with another table in the enclosing query block.
</li>
<li>
That join has an <codeph>ON</codeph> clause containing a predicate that
only references columns originating from the outer-joined tables inside the inline view.
</li>
</ul>
<p><b>Bug:</b> <xref href="https://issues.cloudera.org/browse/IMPALA-1459" scope="external" format="html">IMPALA-1459</xref></p>
</conbody>
</concept>
<concept id="IMPALA-2558">
<title>DCHECK in parquet scanner after block read error</title>
<conbody>
<p>
A debug build of Impala could encounter a serious error after encountering some kinds of I/O
errors for Parquet files. This issue only occurred in debug builds, not release builds.
</p>
<p><b>Bug:</b> <xref href="https://issues.cloudera.org/browse/IMPALA-2558" scope="external" format="html">IMPALA-2558</xref></p>
</conbody>
</concept>
<concept id="IMPALA-2535">
<title>PAGG hits mem_limit when switching to I/O buffers</title>
<conbody>
<p>
A join query could fail with an out-of-memory error despite the apparent presence of sufficient memory.
The cause was the internal ordering of operations that could cause a later phase of the query to
allocate memory required by an earlier phase of the query. The workaround was to either increase
or decrease the <codeph>MEM_LIMIT</codeph> query option, because the issue would only occur for a specific
combination of memory limit and data volume.
</p>
<p><b>Bug:</b> <xref href="https://issues.cloudera.org/browse/IMPALA-2535" scope="external" format="html">IMPALA-2535</xref></p>
</conbody>
</concept>
<!-- This is a pretty incomprehensible JIRA title but there isn't any clearer title in the spreadsheet. -->
<!-- Merged with the following JIRA entry (which has a more understandable title) because it's part of the same root cause. -->
<concept id="IMPALA-2559">
<title>Fix check failed: sorter_runs_.back()->is_pinned_</title>
<conbody>
<p>
A query could fail with an internal error while calculating the memory limit.
This was an infrequent condition uncovered during stress testing.
</p>
<p><b>Bug:</b> <xref href="https://issues.cloudera.org/browse/IMPALA-2559" scope="external" format="html">IMPALA-2559</xref></p>
</conbody>
</concept>
<concept id="IMPALA-2614">
<title>Don't ignore Status returned by DataStreamRecvr::CreateMerger()</title>
<conbody>
<p>
A query could fail with an internal error while calculating the memory limit.
This was an infrequent condition uncovered during stress testing.
</p>
<p><b>Bug:</b> <xref href="https://issues.cloudera.org/browse/IMPALA-2614" scope="external" format="html">IMPALA-2614</xref>,
<xref href="https://issues.cloudera.org/browse/IMPALA-2559" scope="external" format="html">IMPALA-2559</xref></p>
</conbody>
</concept>
<concept id="IMPALA-2591">
<title>DataStreamSender::Send() does not return an error status if SendBatch() failed</title>
<conbody>
<!-- Symptoms listed in the JIRA are very vague. Left a note asking Sailesh for more detail. -->
<p>
</p>
<p><b>Bug:</b> <xref href="https://issues.cloudera.org/browse/IMPALA-2591" scope="external" format="html">IMPALA-2591</xref></p>
</conbody>
</concept>
<concept id="IMPALA-2598">
<title>Re-enable SSL and Kerberos on server-server</title>
<conbody>
<p>
These fixes lift the restriction on using SSL encryption and Kerberos authentication together
for internal communication between Impala components.
</p>
<p><b>Bug:</b> <xref href="https://issues.cloudera.org/browse/IMPALA-2598" scope="external" format="html">IMPALA-2598</xref>,
<xref href="https://issues.cloudera.org/browse/IMPALA-2747" scope="external" format="html">IMPALA-2747</xref></p>
</conbody>
</concept>
<!--
<concept id="IMPALA-2747">
<title>Thrift-client cleans openSSL state before using it in the case of the catalog</title>
<conbody>
<p></p>
<p><b>Bug:</b> <xref href="https://issues.cloudera.org/browse/IMPALA-2747" scope="external" format="html">IMPALA-2747</xref></p>
<p><b>Severity:</b> High</p>
</conbody>
</concept>
-->
</concept>
<concept rev="2.3.1" id="fixed_issues_231">
<title>Issues Fixed in Impala for CDH 5.5.1</title>
<conbody>
<p conref="../shared/impala_common.xml#common/cdh551_noop"/>
</conbody>
</concept>
<concept rev="2.3.0" id="fixed_issues_230">
<title>Issues Fixed in Impala for CDH 5.5.0</title>
<conbody>
<p> This section lists the most serious or frequently encountered customer
issues fixed in CDH 5.5.0 / Impala 2.3.0. Any issues already fixed in
CDH 5.4 maintenance releases (up through CDH 5.4.8) are also included.
Those issues are listed under the respective CDH 5.4 sections and are
not repeated here. For the full list of fixed Impala issues, see <xref
audience="standalone"
href="http://www.cloudera.com/documentation/enterprise/latest/topics/cdh_rn_fixed_in_550.html"
format="html" scope="external">the CDH 5.5.x release notes</xref><xref
href="cdh_rn_fixed_in_55.xml#concept_hjb_1pf_x5" audience="integrated"
/>. </p>
<!-- Raw "Fixed Issue" info from Silvius' working doc: https://docs.google.com/document/d/1PCboHo2Em-fySAHr3YF1ZZxMEfK8_vFFpP1rkWzyRZM/edit#
Bug
IMPALA-2168
SEGV in BufferedTupleStream::num_rows() in a query with very large, spilling ROJ
crash, resource-management
Bug
IMPALA-2213
Parquet read can fail if file metadata is stale
Bug
IMPALA-2192
Wrong results on TPCH-Q11 during nightly data load
broken-build, correctness
Bug
IMPALA-2330
SwitchToIoBuffers may leave stream to wrong state
resource-management
Bug
IMPALA-2440
Full outer join using non-partitioned HJ can incorrectly produce extra row of nulls
correctness
Bug
IMPALA-2435
Sorter::AddBatch does not check the Status of unsorted_run_->Init()
Bug
IMPALA-2378
Impalad exceeded its mem limit
crash, resource-management
Bug
IMPALA-2369
Crash: impala::Sorter::Run::Init
crash, nested_types, query_generator
Bug
IMPALA-2366
DiskIoMgr error handling with fread()
Bug
IMPALA-2357
Crash: impala::BufferedBlockMgr::Block::BytesRemaining
crash, nested_types, query_generator
Bug
IMPALA-2348
The Catalog does not close the connection to the HMS after table invalidation
catalog-server, hive
Bug
IMPALA-2319
Crash with nested loop join + limit
Bug
IMPALA-2314
Crash in PHJ::PrepareNextPartition() in call to LargestSpilledPartition()
Bug
IMPALA-2165
Avoid estimating cardinality 0 in SCAN node
performance, planner, resource-management
Improvement
IMPALA-2143
Avoid impala-shell sending password in plain text with ssl enabled
Bug
IMPALA-2133
HBase filter doesn't unescape string values correctly
Bug
IMPALA-2090
Incorrect handling of leap years when adding 1 month interval to date
correctness, query_generator
Bug
IMPALA-2086
Incorrect handling of leap years when adding 1 year interval to date
correctness, query_generator
New Feature
IMPALA-2084
SPLIT_PART and REGEXP_LIKE functions for Tableau pushdown
New Feature
IMPALA-2081
Add PERCENT_RANK, NTILE, CUME_DIST analytic window functions
planner, ramp-up-planner
Bug
IMPALA-2016
Cancelling query with group_concat causes crash
crash, query-lifecycle
Improvement
IMPALA-1975
Automatically reconnect to Impala in the shell if the connection is lost
ramp-up-task, shell
Improvement - perf
IMPALA-1968
Bad plan choices due to incorrect number of estimated hosts.
performance
Bug
IMPALA-1947
Avro cols may load incorrectly if schema inconsistent with StorageDescriptor
catalog-server, correctness
Bug
IMPALA-1917
Query return empty result if it contains NullLiteral in inlineview
correctness, regression
(Possible new feature entry?)
Improvement - perf
IMPALA-1881
Maximize data locality when scanning Parquet files with multiple row groups.
parquet, performance
(Possible new feature entry?)
Improvement
IMPALA-1870
Support users containing commas in authorized_proxy_user_config
ramp-up-introductory, ramp-up-task
Bug
IMPALA-1859
INSERT OVERWRITE with empty result set leaves existing records in partitioned table
impala, insert, overwrite, partitions
Bug
IMPALA-1813
CREATE TABLE LIKE ... STORED AS AVRO does not work.
usability
Improvement
IMPALA-1136
Impala is unable to read hive tables created with the "STORED AS AVRO" clause
ramp-up-planner, ramp-up-task, usability
(Possible new feature entry?)
Improvement - perf
IMPALA-1588
Cache HDFS file handle to avoid repeated hdfs fopen call
Bug
IMPALA-1675
Timestamp: Adding/subtracting very large time intervals to timestamps produces incorrect result
Improvement
IMPALA-1982
parquet.hive.serde.ParquetHiveSerDe is deprecated
Skye Wanderman-Milne
Critical
supportability
Bug
IMPALA-1906
PARQUET_FILE_SIZE query option not always honored due to an internal miscalculation.
Vlad Berindei
Critical
parquet, ramp-up-introductory, ramp-up-task
-->
</conbody>
<concept id="serious_230">
<title>Fixes for Serious Errors</title>
<conbody>
<p>
A number of issues were resolved that could result in serious errors
when encountered. The most critical or commonly encountered are
listed here.
</p>
<p><b>Bugs:</b>
<!-- Bugs marked as 'crash'.
IMPALA-2168
IMPALA-2378
IMPALA-2369
IMPALA-2357
IMPALA-2319
IMPALA-2314
IMPALA-2016
-->
<xref href="https://issues.cloudera.org/browse/IMPALA-2168" scope="external" format="html">IMPALA-2168</xref>,
<xref href="https://issues.cloudera.org/browse/IMPALA-2378" scope="external" format="html">IMPALA-2378</xref>,
<xref href="https://issues.cloudera.org/browse/IMPALA-2369" scope="external" format="html">IMPALA-2369</xref>,
<xref href="https://issues.cloudera.org/browse/IMPALA-2357" scope="external" format="html">IMPALA-2357</xref>,
<xref href="https://issues.cloudera.org/browse/IMPALA-2319" scope="external" format="html">IMPALA-2319</xref>,
<xref href="https://issues.cloudera.org/browse/IMPALA-2314" scope="external" format="html">IMPALA-2314</xref>,
<xref href="https://issues.cloudera.org/browse/IMPALA-2016" scope="external" format="html">IMPALA-2016</xref>
</p>
</conbody>
</concept>
<concept id="correctness_230">
<title>Fixes for Correctness Errors</title>
<conbody>
<p>
A number of issues were resolved that could result in wrong results
when encountered. The most critical or commonly encountered are
listed here.
</p>
<p><b>Bugs:</b>
<!-- Bugs marked as 'correctness'.
IMPALA-2192
IMPALA-2440
IMPALA-2090
IMPALA-2086
IMPALA-1947
IMPALA-1917
-->
<xref href="https://issues.cloudera.org/browse/IMPALA-2192" scope="external" format="html">IMPALA-2192</xref>,
<xref href="https://issues.cloudera.org/browse/IMPALA-2440" scope="external" format="html">IMPALA-2440</xref>,
<xref href="https://issues.cloudera.org/browse/IMPALA-2090" scope="external" format="html">IMPALA-2090</xref>,
<xref href="https://issues.cloudera.org/browse/IMPALA-2086" scope="external" format="html">IMPALA-2086</xref>,
<xref href="https://issues.cloudera.org/browse/IMPALA-1947" scope="external" format="html">IMPALA-1947</xref>,
<xref href="https://issues.cloudera.org/browse/IMPALA-1917" scope="external" format="html">IMPALA-1917</xref>
</p>
</conbody>
</concept>
</concept>
<!-- All 2.2.x / 5.4.x subsections go under here -->
<concept rev="2.2.10" id="fixed_issues_2210">
<title>Issues Fixed in Impala for CDH 5.4.10</title>
<conbody>
<!--
<p>
This section lists the most frequently encountered customer issues fixed in Impala for CDH 5.4.10.
</p>
-->
<p>
For the full list of fixed issues for all CDH components, see
<xref audience="standalone"
href="http://www.cloudera.com/documentation/enterprise/latest/topics/cdh_rn_fixed_in_54.html#fixed_issues_5410"
format="html" scope="external">the CDH 5.4.x release notes</xref><xref
audience="integrated" href="cdh_rn_fixed_in_54.xml#fixed_issues_5410"/>.
</p>
<note conref="../shared/impala_common.xml#common/only_cdh5_22x"/>
<ul audience="integrated">
<li conref="cdh_rn_fixed_in_54.xml#ui_fixed_in_5410/impala_start" conrefend="cdh_rn_fixed_in_54.xml#ui_fixed_in_5410/impala_end"/>
</ul>
</conbody>
</concept>
<concept rev="2.2.9" id="fixed_issues_229">
<title>Issues Fixed in Impala for CDH 5.4.9</title>
<conbody>
<p>
This section lists the most frequently encountered customer issues fixed in Impala for CDH 5.4.9.
</p>
<note conref="../shared/impala_common.xml#common/only_cdh5_22x"/>
<p>
For the full list of fixed issues, see
<xref audience="standalone"
href="http://www.cloudera.com/documentation/enterprise/latest/topics/cdh_rn_fixed_in_549.html"
format="html" scope="external">the CDH 5.4.x release notes</xref><xref
audience="integrated" href="cdh_rn_fixed_in_54.xml#fixed_issues_549"/>.
</p>
</conbody>
<!-- List of fixed issues from Juan Yu's spreadsheet: https://docs.google.com/spreadsheets/d/1Ia06xOLKQInKfsJzQfQ6Sf1fO3nYLOpZ-0OWfFNgNSc/edit#gid=1566007471
Asterisks indicate the ones that are considered critical and so get a writeup here.
* 5ff1062 IMPALA-1459: Fix migration/assignment of On-clause predicates inside inline views.
* bcea8cf IMPALA-2446: Fix wrong predicate assignment in outer joins
* 8e010af IMPALA-2648/IMPALA-2664: Avoid sending large partition stats objects over thrift
7e3a789 IMPALA-2260: Adding a large hour interval caused an interval overflow
* 9a86c94 IMPALA-1675: Avoid overflow when adding large intervals to TIMESTAMPs
1a83dfc IMPALA-2086, IMPALA-2090: Avoid boost year/month interval logic
* 1d0b017 IMPALA-1949: Analysis exception when a binary operator contain an IN operator with
0ae066f IMPALA-2141: UnionNode::GetNext() doesn't check for query errors
* 31824ad IMPALA-2273: Make MAX_PAGE_HEADER_SIZE configurable
75497ca IMPALA-2559: Fix check failed: sorter_runs_.back()->is_pinned_
ddfe66f IMPALA-2265: Sorter was not checking the returned Status of PrepareRead
* a39a2a5 IMPALA-2357: Fix spilling sorts with var-len slots that are NULL or empty.
04bcfb9 IMPALA-0000: Fix reporting of custom OOM error messages.
d9d2a10 IMPALA-0000: Log backtrace on all MEM_LIMIT_EXCEEDED errors
* 4210021 IMPALA-0000: Work-around IMPALA-2344: Fail query with OOM in case block->Pin() fails
29d7b42 IMPALA-2286: Fix race between ~BufferedBlockMgr() and BufferedBlockMgr::Creat
* d87a306 IMPALA-2252: Crash (likely race) tearing down BufferedBlockMgr on query failu
* a1bb4a6 IMPALA-1746: QueryExecState doesn't check for query cancellation or errors
* e9ff989 IMPALA-2533: Impala throws IllegalStateException when inserting data into a p
* c22683a IMPALA-1917: Do not register aux equivalence predicates with NULL on either side.
* c0a7393 IMPALA-2731: Refactor MemPool usage in HBase scan node.
-->
<concept id="IMPALA-1917">
<!-- Title in Juan's spreadsheet. Actual JIRA title more useful for readers IMO. <title>Do not register aux equivalence predicates with NULL on either side.</title> -->
<title>Query return empty result if it contains NullLiteral in inlineview</title>
<conbody>
<p>
If an inline view in a <codeph>FROM</codeph> clause contained a <codeph>NULL</codeph> literal,
the result set was empty.
</p>
<p><b>Bug:</b> <xref href="https://issues.cloudera.org/browse/IMPALA-1917" scope="external" format="html">IMPALA-1917</xref></p>
</conbody>
</concept>
<concept id="IMPALA-2731">
<!-- Title in Juan's spreadsheet. Actual JIRA title more useful for readers IMO. <title>Refactor MemPool usage in HBase scan node.</title> -->
<title>HBase scan node uses 2-4x memory after upgrade to Impala 2.2.8</title>
<conbody>
<p>
Queries involving HBase tables used substantially more memory than in earlier Impala versions.
The problem occurred starting in Impala 2.2.8, as a result of the changes for IMPALA-2284.
The fix for this issue involves removing a separate memory work area for HBase queries
and reusing other memory that was already allocated.
</p>
<p><b>Bug:</b> <xref href="https://issues.cloudera.org/browse/IMPALA-2731" scope="external" format="html">IMPALA-2731</xref></p>
</conbody>
</concept>
<concept id="IMPALA-1459">
<title>Fix migration/assignment of On-clause predicates inside inline views</title>
<conbody>
<!-- Explanation comes from IMPALA-2665. -->
<p>
Some combinations of <codeph>ON</codeph> clauses in join queries could result in comparisons
being applied at the wrong stage of query processing, leading to incorrect results.
Wrong predicate assignment could happen under the following conditions:
</p>
<ul>
<li>
The query includes an inline view that contains an outer join.
</li>
<li>
That inline view is joined with another table in the enclosing query block.
</li>
<li>
That join has an <codeph>ON</codeph> clause containing a predicate that
only references columns originating from the outer-joined tables inside the inline view.
</li>
</ul>
<p><b>Bug:</b> <xref href="https://issues.cloudera.org/browse/IMPALA-1459" scope="external" format="html">IMPALA-1459</xref></p>
</conbody>
</concept>
<concept id="IMPALA-2446">
<title>Fix wrong predicate assignment in outer joins</title>
<conbody>
<p>
The join predicate for an <codeph>OUTER JOIN</codeph> clause could be applied at the wrong stage
of query processing, leading to incorrect results.
</p>
<p><b>Bug:</b> <xref href="https://issues.cloudera.org/browse/IMPALA-2446" scope="external" format="html">IMPALA-2446</xref></p>
</conbody>
</concept>
<concept id="IMPALA-2648"><!-- Also IMPALA-2664 -->
<title>Avoid sending large partition stats objects over thrift</title>
<conbody>
<p> The <cmdname>catalogd</cmdname> daemon could encounter a serious error when loading the
incremental statistics metadata for tables with large numbers of partitions and columns.
The problem occurred when the internal representation of metadata for the table exceeded 2
GB, for example in a table with 20K partitions and 77 columns. The fix causes a
<codeph>COMPUTE INCREMENTAL STATS</codeph> operation to fail if it would produce
metadata that exceeded the maximum size. </p>
<p><b>Bug:</b> <xref href="https://issues.cloudera.org/browse/IMPALA-2648" scope="external" format="html">IMPALA-2648</xref>,
<xref href="https://issues.cloudera.org/browse/IMPALA-2664" scope="external" format="html">IMPALA-2664</xref></p>
</conbody>
</concept>
<concept id="IMPALA-1675">
<title>Avoid overflow when adding large intervals to TIMESTAMPs</title>
<conbody>
<p> Adding or subtracting a large <codeph>INTERVAL</codeph> value to a
<codeph>TIMESTAMP</codeph> value could produce an incorrect result, with the value
wrapping instead of returning an out-of-range error. </p>
<p><b>Bug:</b> <xref href="https://issues.cloudera.org/browse/IMPALA-1675" scope="external" format="html">IMPALA-1675</xref></p>
</conbody>
</concept>
<concept id="IMPALA-1949">
<title>Analysis exception when a binary operator contains an IN operator with values</title>
<conbody>
<p>
An <codeph>IN</codeph> operator with literal values could cause a statement to fail if used
as the argument to a binary operator, such as an equality test for a <codeph>BOOLEAN</codeph> value.
</p>
<p><b>Bug:</b> <xref href="https://issues.cloudera.org/browse/IMPALA-1949" scope="external" format="html">IMPALA-1949</xref></p>
</conbody>
</concept>
<concept id="IMPALA-2273">
<!-- To do: this detail would be useful to capture on the Impala + Parquet page too. -->
<title>Make MAX_PAGE_HEADER_SIZE configurable</title>
<conbody>
<p> Impala could fail to access Parquet data files with page headers larger than 8 MB, which
could occur, for example, if the minimum or maximum values for a column were long strings.
The fix adds a configuration setting <codeph>--max_page_header_size</codeph>, which you
can use to increase the Impala size limit to a value higher than 8 MB. </p>
<p><b>Bug:</b> <xref href="https://issues.cloudera.org/browse/IMPALA-2273" scope="external" format="html">IMPALA-2273</xref></p>
</conbody>
</concept>
<concept id="IMPALA-2357">
<title>Fix spilling sorts with var-len slots that are NULL or empty.</title>
<conbody>
<p>
A query that activated the spill-to-disk mechanism could fail if it contained a sort expression
involving certain combinations of fixed-length or variable-length types.
</p>
<p><b>Bug:</b> <xref href="https://issues.cloudera.org/browse/IMPALA-2357" scope="external" format="html">IMPALA-2357</xref></p>
</conbody>
</concept>
<concept id="block_pin_oom">
<title>Work-around IMPALA-2344: Fail query with OOM in case block->Pin() fails</title>
<conbody>
<p>
Some queries that activated the spill-to-disk mechanism could produce a serious error
if there was insufficient memory to set up internal work areas. Now those queries
produce normal out-of-memory errors instead.
</p>
<p><b>Bug:</b> <xref href="https://issues.cloudera.org/browse/IMPALA-2344" scope="external" format="html">IMPALA-2344</xref></p>
</conbody>
</concept>
<concept id="IMPALA-2252">
<title>Crash (likely race) tearing down BufferedBlockMgr on query failure</title>
<conbody>
<p>
A serious error could occur under rare circumstances, due to a race condition while freeing memory during heavily concurrent workloads.
</p>
<p><b>Bug:</b> <xref href="https://issues.cloudera.org/browse/IMPALA-2252" scope="external" format="html">IMPALA-2252</xref></p>
</conbody>
</concept>
<concept id="IMPALA-1746">
<title>QueryExecState doesn't check for query cancellation or errors</title>
<conbody>
<p>
A call to <codeph>SetError()</codeph> in a user-defined function (UDF) would not cause the query to fail as expected.
</p>
<p><b>Bug:</b> <xref href="https://issues.cloudera.org/browse/IMPALA-1746" scope="external" format="html">IMPALA-1746</xref></p>
</conbody>
</concept>
<concept id="IMPALA-2533">
<title>Impala throws IllegalStateException when inserting data into a partition while select
subquery group by partition columns</title>
<conbody>
<p>
An <codeph>INSERT ... SELECT</codeph> operation into a partitioned table could fail if the <codeph>SELECT</codeph> query
included a <codeph>GROUP BY</codeph> clause referring to the partition key columns.
</p>
<p><b>Bug:</b> <xref href="https://issues.cloudera.org/browse/IMPALA-2533" scope="external" format="html">IMPALA-2533</xref></p>
</conbody>
</concept>
</concept>
<concept rev="2.2.8" id="fixed_issues_228">
<title>Issues Fixed in Impala for CDH 5.4.8</title>
<conbody>
<p>
This section lists the most frequently encountered customer issues fixed in Impala for CDH 5.4.8.
</p>
<note conref="../shared/impala_common.xml#common/only_cdh5_22x"/>
<p>
For the full list of fixed issues, see
<xref audience="standalone"
href="http://www.cloudera.com/documentation/enterprise/latest/topics/cdh_rn_fixed_in_548.html" format="html" scope="external">the CDH 5.4.x release notes</xref><xref audience="integrated" href="cdh_rn_fixed_in_54.xml#fixed_issues_548"/>.
</p>
</conbody>
<!--
3e0fee5 IMPALA-2369, IMPALA-2435: Impala crashes when the sorter hits an OOM error
ef498c2 IMPALA-1136, IMPALA-2161: Skip \u0000 characters when dealing Avro schemas
8f97d4b IMPALA-2168: Do not try to access streams of repartitioned spilled partition in right-joins
* 294ac14 Additional checks to catch IMPALA-2168 earlier
b9b10ec IMPALA-2514: DCHECK on destroying an ExprContext
96206af IMPALA-2440: Fix old HJ full outer join with no rows
5e1a79a IMPALA-2366: check fread return code correctly
02f3e36 IMPALA-2477: Parquet metadata randomly 'appears stale'
76c1548 IMPALA-2213: make Parquet scanner fail query if the file size metadata is stale
10ba8ed IMPALA-2249: Avoid allocating StringBuffer > 1GB in ScannerContext::Stream::GetBytesInternal()
a87eaa0 IMPALA-2270: avoid FnvHash64to32 with empty inputs
7f6a521 IMPALA-2284: Disallow long (1<<30) strings in group_concat()
* 9aea5d7 Mistake in schema_constraints by the IMPALA-2130 patch (7c7e69b)
f8d32a4 IMPALA-2130: Wrong verification of Parquet file version
9ec5944 IMPALA-2348: The catalog does not close the connection to HMS during table invalidation
a78c303 IMPALA-2364: Wrong DCHECK in PHJ::ProcessProbeBatch
e6dec36 IMPALA-2314: LargestSpilledPartition was not checking if partition is closed
* 56d8bbd Checks for IMPALA-2314 and more accurate calculation of PHJ::LargestSpillPartition
3ad4e71 IMPALA-2256: Handle joins with right side of high cardinality and zero materialized slots
4422462 IMPALA-2165: Avoid cardinality 0 in scan nodes of small tables and low selectivity
1f199ec IMPALA-2292: Change the type of timestamp_col to string in the table no_avro_schema.
f6b4480 IMPALA-1136: Support loading Avro tables without an explicit Avro schema
d93fb5a IMPALA-1899: Cleanup handling of Hive's field schema
* = no JIRA assigned with it so it definitely is an internal-only modification.
-->
<concept id="IMPALA-1136">
<title>Impala is unable to read hive tables created with the "STORED AS AVRO" clause</title>
<conbody>
<p>Impala could not read Avro tables created in Hive with the <codeph>STORED AS AVRO</codeph> clause.</p>
<p><b>Bug:</b> <xref href="https://issues.cloudera.org/browse/IMPALA-1136" scope="external" format="html">IMPALA-1136</xref>,
<xref href="https://issues.cloudera.org/browse/IMPALA-2161" scope="external" format="html">IMPALA-2161</xref></p>
</conbody>
</concept>
<concept id="IMPALA-2213">
<title>make Parquet scanner fail query if the file size metadata is stale</title>
<conbody>
<p>If a Parquet file in HDFS was overwritten by a smaller file, Impala could encounter a serious error.
Issuing a <codeph>INVALIDATE METADATA</codeph> statement before a subsequent query would avoid the error.
The fix allows Impala to handle such inconsistencies in Parquet file length cleanly regardless of whether the
table metadata is up-to-date.</p>
<p><b>Bug:</b> <xref href="https://issues.cloudera.org/browse/IMPALA-2213" scope="external" format="html">IMPALA-2213</xref></p>
</conbody>
</concept>
<concept id="IMPALA-2249">
<title>Avoid allocating StringBuffer &gt; 1GB in ScannerContext::Stream::GetBytesInternal()</title>
<conbody>
<p>Impala could encounter a serious error when reading compressed text files larger than 1 GB. The fix causes Impala
to issue an error message instead in this case.</p>
<p><b>Bug:</b> <xref href="https://issues.cloudera.org/browse/IMPALA-2249" scope="external" format="html">IMPALA-2249</xref></p>
</conbody>
</concept>
<concept id="IMPALA-2284">
<title>Disallow long (1&lt;&lt;30) strings in group_concat()</title>
<conbody>
<p>A query using the <codeph>group_concat()</codeph> function could encounter a serious error if the returned string value was larger than 1 GB.
Now the query fails with an error message in this case.</p>
<p><b>Bug:</b> <xref href="https://issues.cloudera.org/browse/IMPALA-2284" scope="external" format="html">IMPALA-2284</xref></p>
</conbody>
</concept>
<concept id="IMPALA-2270">
<title>avoid FnvHash64to32 with empty inputs</title>
<conbody>
<p>An edge case in the algorithm used to distribute data among nodes could result in uneven distribution of work for some queries,
with all data sent to the same node.</p>
<p><b>Bug:</b> <xref href="https://issues.cloudera.org/browse/IMPALA-2270" scope="external" format="html">IMPALA-2270</xref></p>
</conbody>
</concept>
<concept id="IMPALA-2348">
<title>The catalog does not close the connection to HMS during table invalidation</title>
<conbody>
<p>A communication error could occur between Impala and the Hive metastore database, causing Impala operations that update
table metadata to fail.</p>
<p><b>Bug:</b> <xref href="https://issues.cloudera.org/browse/IMPALA-2348" scope="external" format="html">IMPALA-2348</xref></p>
</conbody>
</concept>
<concept id="IMPALA-2364-548">
<title>Wrong DCHECK in PHJ::ProcessProbeBatch</title>
<conbody>
<p>Certain queries could encounter a serious error if the spill-to-disk mechanism was activated.</p>
<p><b>Bug:</b> <xref href="https://issues.cloudera.org/browse/IMPALA-2364" scope="external" format="html">IMPALA-2364</xref></p>
</conbody>
</concept>
<concept id="IMPALA-2165-548">
<title>Avoid cardinality 0 in scan nodes of small tables and low selectivity</title>
<conbody>
<p>Impala could generate a suboptimal query plan for some queries involving small tables.</p>
<p><b>Bug:</b> <xref href="https://issues.cloudera.org/browse/IMPALA-2165" scope="external" format="html">IMPALA-2165</xref></p>
</conbody>
</concept>
</concept>
<concept rev="2.2.7" id="fixed_issues_227">
<title>Issues Fixed in Impala for CDH 5.4.7</title>
<conbody>
<p>
This section lists the most frequently encountered customer issues fixed in Impala for CDH 5.4.7.
</p>
<note conref="../shared/impala_common.xml#common/only_cdh5_22x"/>
<p>
For the full list of fixed issues, see
<xref audience="standalone"
href="http://www.cloudera.com/documentation/enterprise/latest/topics/cdh_rn_fixed_in_547.html" format="html" scope="external">the CDH 5.4.x release notes</xref><xref audience="integrated" href="cdh_rn_fixed_in_54.xml#fixed_issues_547"/>.
</p>
</conbody>
<!--
8736dba IMPALA-1983: Warn if table stats are potentially corrupt.
a0911fc IMPALA-2266: Pass correct child node in 2nd phase merge aggregation.
3183878 IMPALA-2216: Set the output smap of an EmptySetNode produced from an empty inline view.
a73d008 IMPALA-2203: Set an InsertStmt's result exprs from the source statement's result exprs.
e641759 IMPALA-2088: Fix planning of empty union operands with analytics.
752b3bd IMPALA-1756: Constant filter expressions are not checked for errors and state cleanup is not done before throwing exception.
d703959 IMPALA-2239: update misc.test to match the new .test file format
8dddd60 IMPALA-2089: Retain eq predicates bound by grouping slots with complex grouping exprs.
ce9fa25 IMPALA-2201: Unconditionally update the partition stats and row count.
ff5bc02 IMPALA-2199: Row count not set for empty partition when spec is used with compute incremental stats
3eb2d80 IMPALA-1898: Explicit aliases + ordinals analysis bug
ba6a8bf IMPALA-1987: Fix TupleIsNullPredicate to return false if no tuples are nullable.
932d4f8 IMPALA-2178: fix Expr::ComputeResultsLayout() logic
4ef0bd8 IMPALA-1737: Substitute an InsertStmt's partition key exprs with the root node's smap.
Per Juan Yu, almost all of those are critical issues related to query correctness.
The non-critical ones are:
752b3bd IMPALA-1756: Constant filter expressions are not checked for errors and state cleanup is not done before throwing exception.
d703959 IMPALA-2239: update misc.test to match the new .test file format
ce9fa25 IMPALA-2201: Unconditionally update the partition stats and row count.
-->
<concept id="IMPALA-1983">
<title>Warn if table stats are potentially corrupt.</title>
<conbody>
<p>
Impala warns if it detects a discrepancy in table statistics: a table considered to have zero rows even though there are data files present.
In this case, Impala also skips query optimizations that are normally applied to very small tables.
</p>
<p><b>Bug:</b> <xref href="https://issues.cloudera.org/browse/IMPALA-1983" scope="external" format="html">IMPALA-1983:</xref></p>
</conbody>
</concept>
<concept id="IMPALA-2266">
<title>Pass correct child node in 2nd phase merge aggregation.</title>
<conbody>
<p>A query could encounter a serious error if it included a particular combination of aggregate functions and inline views.</p>
<p><b>Bug:</b> <xref href="https://issues.cloudera.org/browse/IMPALA-2266" scope="external" format="html">IMPALA-2266</xref></p>
</conbody>
</concept>
<concept id="IMPALA-2216">
<title>Set the output smap of an EmptySetNode produced from an empty inline view.</title>
<conbody>
<p>A query could encounter a serious error if it included an inline view whose subquery had no <codeph>FROM</codeph> clause.</p>
<p><b>Bug:</b> <xref href="https://issues.cloudera.org/browse/IMPALA-2216" scope="external" format="html">IMPALA-2216</xref></p>
</conbody>
</concept>
<concept id="IMPALA-2203">
<title>Set an InsertStmt's result exprs from the source statement's result exprs.</title>
<conbody>
<p>
A <codeph>CREATE TABLE AS SELECT</codeph> or <codeph>INSERT ... SELECT</codeph> statement could produce
different results than a <codeph>SELECT</codeph> statement, for queries including a <codeph>FULL JOIN</codeph> clause
and including literal values in the select list.
</p>
<p><b>Bug:</b> <xref href="https://issues.cloudera.org/browse/IMPALA-2203" scope="external" format="html">IMPALA-2203</xref></p>
</conbody>
</concept>
<concept id="IMPALA-2088">
<title>Fix planning of empty union operands with analytics.</title>
<conbody>
<p>
A query could return incorrect results if it contained a <codeph>UNION</codeph> clause,
calls to analytic functions, and a constant expression that evaluated to <codeph>FALSE</codeph>.
</p>
<p><b>Bug:</b> <xref href="https://issues.cloudera.org/browse/IMPALA-2088" scope="external" format="html">IMPALA-2088</xref></p>
</conbody>
</concept>
<!--
<concept id="IMPALA-1756">
<title>Constant filter expressions are not checked for errors and state cleanup is not done before throwing exception.</title>
<conbody>
<p></p>
<p><b>Bug:</b> <xref href="https://issues.cloudera.org/browse/IMPALA-1756" scope="external" format="html">IMPALA-1756</xref></p>
<p><b>Severity:</b> High</p>
</conbody>
</concept>
-->
<!--
<concept id="IMPALA-2239">
<title>update misc.test to match the new .test file format</title>
<conbody>
<p></p>
<p><b>Bug:</b> <xref href="https://issues.cloudera.org/browse/IMPALA-2239" scope="external" format="html">IMPALA-2239</xref></p>
<p><b>Severity:</b> High</p>
</conbody>
</concept>
-->
<concept id="IMPALA-2089">
<title>Retain eq predicates bound by grouping slots with complex grouping exprs.</title>
<conbody>
<p>
A query containing an <codeph>INNER JOIN</codeph> clause could return undesired rows.
Some predicate specified in the <codeph>ON</codeph> clause could be omitted from the filtering operation.
</p>
<p><b>Bug:</b> <xref href="https://issues.cloudera.org/browse/IMPALA-2089" scope="external" format="html">IMPALA-2089</xref></p>
</conbody>
</concept>
<!--
<concept id="IMPALA-2201">
<title>Unconditionally update the partition stats and row count.</title>
<conbody>
<p></p>
<p><b>Bug:</b> <xref href="https://issues.cloudera.org/browse/IMPALA-2201" scope="external" format="html">IMPALA-2201</xref></p>
<p><b>Severity:</b> High</p>
</conbody>
</concept>
-->
<concept id="IMPALA-2199">
<title>Row count not set for empty partition when spec is used with compute incremental stats</title>
<conbody>
<p>
A <codeph>COMPUTE INCREMENTAL STATS</codeph> statement could leave the row count for an emptyp partition as -1,
rather than initializing the row count to 0. The missing statistic value could result in reduced query performance.
</p>
<p><b>Bug:</b> <xref href="https://issues.cloudera.org/browse/IMPALA-2199" scope="external" format="html">IMPALA-2199</xref></p>
</conbody>
</concept>
<concept id="IMPALA-1898">
<title>Explicit aliases + ordinals analysis bug</title>
<conbody>
<p>
A query could encounter a serious error if it included column aliases with the same names as table columns, and used
ordinal numbers in an <codeph>ORDER BY</codeph> or <codeph>GROUP BY</codeph> clause.
</p>
<p><b>Bug:</b> <xref href="https://issues.cloudera.org/browse/IMPALA-1898" scope="external" format="html">IMPALA-1898</xref></p>
</conbody>
</concept>
<concept id="IMPALA-1987">
<title>Fix TupleIsNullPredicate to return false if no tuples are nullable.</title>
<conbody>
<p>
A query could return incorrect results if it included an outer join clause, inline views, and calls to functions such as <codeph>coalesce()</codeph>
that can generate <codeph>NULL</codeph> values.
</p>
<p><b>Bug:</b> <xref href="https://issues.cloudera.org/browse/IMPALA-1987" scope="external" format="html">IMPALA-1987</xref></p>
</conbody>
</concept>
<concept id="IMPALA-2178">
<title>fix Expr::ComputeResultsLayout() logic</title>
<conbody>
<p>
A query could return incorrect results if the table contained multiple <codeph>CHAR</codeph> columns with length of 2 or less,
and the query included a <codeph>GROUP BY</codeph> clause that referred to multiple such columns.
</p>
<p><b>Bug:</b> <xref href="https://issues.cloudera.org/browse/IMPALA-2178" scope="external" format="html">IMPALA-2178</xref></p>
</conbody>
</concept>
<concept id="IMPALA-1737">
<title>Substitute an InsertStmt's partition key exprs with the root node's smap.</title>
<conbody>
<p>
An <codeph>INSERT</codeph> statement could encounter a serious error if the <codeph>SELECT</codeph>
portion called an analytic function.
</p>
<p><b>Bug:</b> <xref href="https://issues.cloudera.org/browse/IMPALA-1737" scope="external" format="html">IMPALA-1737</xref></p>
</conbody>
</concept>
</concept>
<concept rev="2.2.5" id="fixed_issues_225">
<title>Issues Fixed in Impala for CDH 5.4.5</title>
<conbody>
<p>
This section lists the most frequently encountered customer issues fixed in Impala for CDH 5.4.5.
</p>
<note conref="../shared/impala_common.xml#common/only_cdh5_22x"/>
<p>
For the full list of fixed issues, see
<xref audience="standalone"
href="http://www.cloudera.com/documentation/enterprise/latest/topics/cdh_rn_fixed_in_545.html" format="html" scope="external">the CDH 5.4.x release notes</xref><xref audience="integrated" href="cdh_rn_fixed_in_54.xml#fixed_issues_545"/>.
</p>
</conbody>
<!--
7c01049 IMPALA-1929: Avoiding a DCHECK of NULL hash table in spilled right joins
c98a556 IMPALA-2136: Bug in PrintTColumnValue caused wrong stats for TINYINT partition cols
* 4390a5c Making sure that there are no regressions due to IMPALA-2065
* 33ee7df IMPALA-2133: Properly unescape string value for HBase filters
* 562b61a IMPALA-1735: ExpandRmReservation only check parent pools with limit
* 5b09eaa IMPALA-2101: fix NULL ptr deref in BufferedBlockMgr::PinBlock
c2b8852 IMPALA-2018: Where clause does not propagate to joins inside nested views
16b79ce IMPALA-2064: Add effective_user() builtin
99a282b IMPALA-2125: Make UTC to local TimestampValue conversion faster.
9198985 IMPALA-2065: Workaround IMPALA-1619 in BufferedBlockMgr::ConsumeMemory()
9ce94eb IMPALA-2048: Set the correct input format when updating partition metadata
* = non-critical to document according to Juan Yu.
-->
<concept id="IMPALA-2048">
<title>Impala DML/DDL operations corrupt table metadata leading to Hive query failures</title>
<!-- Real JIRA title: Class not found: com.cloudera.impala.hive.serde.ParquetInputFormat error -->
<conbody>
<p>
When the Impala <codeph>COMPUTE STATS</codeph> statement was run on a partitioned Parquet table that was created in Hive, the table subsequently became inaccessible in Hive.
The table was still accessible to Impala. Regaining access in Hive required a workaround of creating a new table. The error displayed in Hive was:
</p>
<codeblock>Error: Error while compiling statement: FAILED: SemanticException Class not found: com.cloudera.impala.hive.serde.ParquetInputFormat (state=42000,code=40000)</codeblock>
<p><b>Bug:</b> <xref href="https://issues.cloudera.org/browse/IMPALA-2048" scope="external" format="html">IMPALA-2048</xref></p>
</conbody>
</concept>
<concept id="IMPALA-1929">
<title>Avoiding a DCHECK of NULL hash table in spilled right joins</title>
<!-- Real JIRA title: Crash because PHJ::NextSpilledProbeRowBatch() tries to use a NULL hash_tbl -->
<conbody>
<p>
A query could encounter a serious error if it contained a <codeph>RIGHT OUTER</codeph>, <codeph>RIGHT ANTI</codeph>, or <codeph>FULL OUTER</codeph> join clause
and approached the memory limit on a host so that the <q>spill to disk</q> mechanism was activated.
</p>
<p><b>Bug:</b> <xref href="https://issues.cloudera.org/browse/IMPALA-1929" scope="external" format="html">IMPALA-1929</xref></p>
</conbody>
</concept>
<concept id="IMPALA-2136">
<title>Bug in PrintTColumnValue caused wrong stats for TINYINT partition cols</title>
<!-- Real JIRA title: Partitions with TINYINT partition columns will always have 0 estimated rows after compute stats -->
<conbody>
<p>
Declaring a partition key column as a <codeph>TINYINT</codeph> caused problems with the <codeph>COMPUTE STATS</codeph> statement.
The associated partitions would always have zero estimated rows, leading to potential inefficient query plans.
</p>
<p><b>Bug:</b> <xref href="https://issues.cloudera.org/browse/IMPALA-2136" scope="external" format="html">IMPALA-2136</xref></p>
</conbody>
</concept>
<concept id="IMPALA-2018">
<title>Where clause does not propagate to joins inside nested views</title>
<!-- Real JIRA title: Where clause does not propagate to joins inside nested views -->
<conbody>
<p>
A query that referred to a view whose query referred to another view containing a join, could return incorrect results.
<codeph>WHERE</codeph> clauses for the outermost query were not always applied, causing the result
set to include additional rows that should have been filtered out.
</p>
<p><b>Bug:</b> <xref href="https://issues.cloudera.org/browse/IMPALA-2018" scope="external" format="html">IMPALA-2018</xref></p>
</conbody>
</concept>
<concept id="IMPALA-2064">
<title>Add effective_user() builtin</title>
<!-- Real JIRA title: Add effective_user() builtin -->
<conbody>
<p>
The <codeph>user()</codeph> function returned the name of the logged-in user, which might not be the
same as the user name being checked for authorization if, for example, delegation was enabled.
</p>
<p><b>Bug:</b> <xref href="https://issues.cloudera.org/browse/IMPALA-2064" scope="external" format="html">IMPALA-2064</xref></p>
<p><b>Resolution:</b> Rather than change the behavior of the <codeph>user()</codeph> function,
the fix introduces an additional function <codeph>effective_user()</codeph> that returns the user name that is checked during authorization.</p>
</conbody>
</concept>
<concept id="IMPALA-2125">
<title>Make UTC to local TimestampValue conversion faster.</title>
<!-- Real JIRA title: Improve perf when reading timestamps from parquet files written by hive -->
<conbody>
<p>
Query performance was improved substantially for Parquet files containing <codeph>TIMESTAMP</codeph>
data written by Hive, when the <codeph>-convert_legacy_hive_parquet_utc_timestamps=true</codeph> setting
is in effect.
</p>
<p><b>Bug:</b> <xref href="https://issues.cloudera.org/browse/IMPALA-2125" scope="external" format="html">IMPALA-2125</xref></p>
</conbody>
</concept>
<concept id="IMPALA-2065">
<title>Workaround IMPALA-1619 in BufferedBlockMgr::ConsumeMemory()</title>
<!-- Real JIRA title: Crash: impala::BufferedBlockMgr::ConsumeMemory (PartitionedHashJoinNode) -->
<conbody>
<p>
A join query could encounter a serious error if the query
approached the memory limit on a host so that the <q>spill to disk</q> mechanism was activated,
and data volume in the join was large enough that an internal memory buffer exceeded 1 GB in size on a particular host.
(Exceeding this limit would only happen for huge join queries, because Impala could split this intermediate data
into 16 parts during the join query, and the buffer only contains compact bookkeeping data rather than the actual
join column data.)
</p>
<p><b>Bug:</b> <xref href="https://issues.cloudera.org/browse/IMPALA-2065" scope="external" format="html">IMPALA-2065</xref></p>
</conbody>
</concept>
</concept>
<concept rev="2.2.3" id="fixed_issues_223">
<title>Issues Fixed in Impala for CDH 5.4.3</title>
<conbody>
<p>
This section lists the most frequently encountered customer issues fixed in Impala for CDH 5.4.3.
</p>
<note conref="../shared/impala_common.xml#common/only_cdh5_22x"/>
<p>
For the full list of fixed issues, see
<xref audience="standalone"
href="http://www.cloudera.com/documentation/enterprise/latest/topics/cdh_rn_fixed_in_543.html" format="html" scope="external">the CDH 5.4.1 release notes</xref><xref audience="integrated" href="cdh_rn_fixed_in_54.xml#fixed_issues_543"/>.
</p>
</conbody>
<!-- List of fixed issues from Juan's spreadsheet: https://docs.google.com/spreadsheets/d/1Ia06xOLKQInKfsJzQfQ6Sf1fO3nYLOpZ-0OWfFNgNSc/edit#gid=1566007471
b782f02 IMPALA-1963: Impala Timestamp ISO-8601 Support.
a0ce91e IMPALA-2043: skip metadata/testddl.py#test_create_alter_bulk_parition on S3
191f150 CDH-27688: Isilon: fix custom_cluster/test_insert_inherit_permission.py
ae39894 IMPALA-1968: Part 1: Improve planner numNodes estimate for remote scans
0d12492 IMPALA-1730: reduce scanner thread spinning windows
80c2ee3 CDH-26785: Isilon: add remote HDFS disk queue
19533f6 IMPALA-2002: Provide way to cache ext data source classes
f847370 IMPALA-2008: Fix wrong warning when insert overwrite to empty table
81b6f27 IMPALA-1381: Expand set of supported timezones.
2dcf1ec IMPALA-1952: Expand parsing of decimals to include scientific notation
9053970 : Isilon: don't warn for multiple remote parquet blocks
a94d401 : Enable isilon end to end tests for Impala.
3062d5d : Add flag to skip HBase pytests
ce601ec : Enable using isilon as the underlying filesystem.
-->
<concept id="isilon_support">
<title>Enable using Isilon as the underlying filesystem.</title>
<conbody>
<p>
The certification of CDH and Impala with the Isilon filesystem involves a number of
fixes to performance and flexibility for dealing with I/O using remote reads.
See <xref href="impala_isilon.xml#impala_isilon"/> for details on using Impala and Isilon together.
</p>
<p audience="Cloudera"><b>Bug:</b> <xref href="https://jira.cloudera.com/browse/CDH-24040" scope="external" format="html">CDH-24040</xref></p>
<p><b>Bug:</b> <xref href="https://issues.cloudera.org/browse/IMPALA-1968" scope="external" format="html">IMPALA-1968</xref>,
<xref href="https://issues.cloudera.org/browse/IMPALA-1730" scope="external" format="html">IMPALA-1730</xref></p>
</conbody>
</concept>
<!-- Covered under the initial Isilon umbrella issue.
<concept id="IMPALA-1968">
<title>Part 1: Improve planner numNodes estimate for remote scans</title>
<conbody>
<p></p>
<p><b>Bug:</b> <xref href="https://issues.cloudera.org/browse/IMPALA-1968" scope="external" format="html">IMPALA-1968</xref></p>
<p><b>Severity:</b> High</p>
</conbody>
</concept>
-->
<!-- Covered under the initial Isilon umbrella issue.
<concept id="IMPALA-1730">
<title>reduce scanner thread spinning windows</title>
<conbody>
<p></p>
<p><b>Bug:</b> <xref href="https://issues.cloudera.org/browse/IMPALA-1730" scope="external" format="html">IMPALA-1730</xref></p>
<p><b>Severity:</b> High</p>
</conbody>
</concept>
-->
<concept id="IMPALA-1381">
<title>Expand set of supported timezones.</title>
<conbody>
<p>
The set of timezones recognized by Impala was expanded.
You can always find the latest list of supported timezones in the
Impala source code, in the file
<xref href="https://github.com/cloudera/Impala/blob/master/be/src/exprs/timezone_db.cc" scope="external" format="html">timezone_db.cc</xref>.
</p>
<p><b>Bug:</b> <xref href="https://issues.cloudera.org/browse/IMPALA-1381" scope="external" format="html">IMPALA-1381</xref></p>
</conbody>
</concept>
<concept id="IMPALA-1963">
<title>Impala Timestamp ISO-8601 Support.</title>
<conbody>
<p>
Impala can now process <codeph>TIMESTAMP</codeph> literals including a trailing <codeph>z</codeph>,
signifying <q>Zulu</q> time, a synonym for UTC.
</p>
<p><b>Bug:</b> <xref href="https://issues.cloudera.org/browse/IMPALA-1963" scope="external" format="html">IMPALA-1963</xref></p>
</conbody>
</concept>
<concept id="IMPALA-2008">
<title>Fix wrong warning when insert overwrite to empty table</title>
<conbody>
<p>
An <codeph>INSERT OVERWRITE</codeph> operation would encounter an error
if the <codeph>SELECT</codeph> portion of the statement returned zero
rows, such as with a <codeph>LIMIT 0</codeph> clause.
</p>
<p><b>Bug:</b> <xref href="https://issues.cloudera.org/browse/IMPALA-2008" scope="external" format="html">IMPALA-2008</xref></p>
</conbody>
</concept>
<concept id="IMPALA-1952">
<!-- Fixed in first in earlier release and 'frontported' to 5.4.3. -->
<!-- Identical issue farther down in this page, just with a slightly mangled ID. -->
<title>Expand parsing of decimals to include scientific notation</title>
<conbody>
<p>
<codeph>DECIMAL</codeph> literals can now include <codeph>e</codeph> scientific notation.
For example, now <codeph>CAST(1e3 AS DECIMAL(5,3))</codeph> is a valid expression.
Formerly it returned <codeph>NULL</codeph>.
Some scientific expressions might have worked before in <codeph>DECIMAL</codeph> context, but only when the scale was 0.
</p>
<p><b>Bug:</b> <xref href="https://issues.cloudera.org/browse/" scope="external" format="html"></xref></p>
</conbody>
</concept>
<!-- Not significant.
<concept id="IMPALA-2043">
<title>skip metadata/testddl.py#test_create_alter_bulk_parition on S3</title>
<conbody>
<p></p>
<p><b>Bug:</b> <xref href="https://issues.cloudera.org/browse/" scope="external" format="html"></xref></p>
<p><b>Severity:</b> High</p>
</conbody>
</concept>
-->
<!-- Not significant.
<concept id="CDH-27688">
<title>Isilon: fix custom_cluster/test_insert_inherit_permission.py</title>
<conbody>
<p></p>
<p audience="Cloudera"><b>Bug:</b> <xref href="https://issues.cloudera.org/browse/" scope="external" format="html"></xref></p>
<p><b>Severity:</b> High</p>
</conbody>
</concept>
-->
<!-- Not significant.
<concept id="CDH-26785">
<title>Isilon: add remote HDFS disk queue</title>
<conbody>
<p></p>
<p audience="Cloudera"><b>Bug:</b> <xref href="https://issues.cloudera.org/browse/" scope="external" format="html"></xref></p>
<p><b>Severity:</b> High</p>
</conbody>
</concept>
-->
<!-- Not significant.
<concept id="IMPALA-2002">
<title>Provide way to cache ext data source classes</title>
<conbody>
<p></p>
<p><b>Bug:</b> <xref href="https://issues.cloudera.org/browse/" scope="external" format="html"></xref></p>
<p><b>Severity:</b> High</p>
</conbody>
</concept>
-->
<!-- Not significant.
<concept id="isilon_warn_multiple_blocks">
<title>Isilon: don't warn for multiple remote parquet blocks</title>
<conbody>
<p></p>
<p audience="Cloudera"><b>Bug:</b> <xref href="https://issues.cloudera.org/browse/" scope="external" format="html"></xref></p>
<p><b>Severity:</b> High</p>
</conbody>
</concept>
-->
<!-- Not significant.
<concept id="isilon_tests">
<title>Enable isilon end to end tests for Impala.</title>
<conbody>
<p></p>
<p><b>Bug:</b> <xref href="https://issues.cloudera.org/browse/" scope="external" format="html"></xref></p>
<p><b>Severity:</b> High</p>
</conbody>
</concept>
-->
<!-- Not significant.
<concept id="skip_hbase_pytests">
<title>Add flag to skip HBase pytests</title>
<conbody>
<p></p>
<p><b>Bug:</b> <xref href="https://issues.cloudera.org/browse/" scope="external" format="html"></xref></p>
<p><b>Severity:</b> High</p>
</conbody>
</concept>
-->
</concept>
<concept rev="2.2.1" id="fixed_issues_221">
<title>Issues Fixed in Impala for CDH 5.4.1</title>
<conbody>
<p>
This section lists the most frequently encountered customer issues fixed in Impala for CDH 5.4.1.
</p>
<note conref="../shared/impala_common.xml#common/only_cdh5_22x"/>
<p>
For the full list of fixed issues, see
<xref audience="standalone"
href="http://www.cloudera.com/documentation/enterprise/latest/topics/cdh_rn_fixed_in_541.html" format="html" scope="external">the CDH 5.4.1 release notes</xref><xref audience="integrated" href="cdh_rn_fixed_in_54.xml#fixed_issues_541"/>.
</p>
</conbody>
</concept>
<concept rev="2.2.0" id="fixed_issues_220">
<title>Issues Fixed in the 2.2.0 Release / CDH 5.4.0</title>
<conbody>
<p>
This section lists the most frequently encountered customer issues fixed in Impala 2.2.0.
</p>
<p>
For the full list of fixed issues in Impala 2.2.0, including over 40 critical issues, see
<!--
In maintenance releases, the CDH fixed issues topics did include a nice compact list of Impala JIRAs.
But now in 5.4, it's back to lengthy descriptions for fixed issues and the Impala ones don't have a
complete compact list anywhere in the docs. So returning to the practice of linking to the list in
the JIRA system like was done before.
-->
<!-- <xref audience="standalone" href="http://www.cloudera.com/documentation/enterprise/latest/topics/cdh_rn_fixed_in_540.html" format="html" scope="external">the CDH 5.4.0 release notes</xref><xref audience="integrated" href="cdh_rn_fixed_in_540.xml"/>. -->
<xref href="https://issues.cloudera.org/secure/IssueNavigator.jspa?reset=true&amp;jqlQuery=project%3Dimpala+and+fixVersion%3D%22Impala+2.2%22+and+resolution%3D%22Fixed%22" format="html" scope="external">
this report in the JIRA system</xref>.
</p>
<note conref="../shared/impala_common.xml#common/only_cdh5_220"/>
<p outputclass="toc inpage"/>
</conbody>
<!--
My proposed list to Silvius:
IMPALA-1854 Impala may leak or use too many file descriptors
IMPALA-1712 Spurious stale block locality messages
IMPALA-1711 DROP TABLE fails after COMPUTE STATS and ALTER TABLE RENAME to a different database.
IMPALA-1674 IMPALA-1556 causes memory leak with secure connections
IMPALA-1623 unix_timestamp() does not return correct time
IMPALA-1476 Impala incorrectly handles text data missing a newline on the last line
IMPALA-1194 Memory leak using zlib on CentOS6 (and possibly other platforms)
IMPALA-1805 Impala's ACLs check do not consider all group ACLs, only checked first one.
IMPALA-1794 IoMgr infinite loop opening/closing file when shorter than cached metadata size
IMPALA-1705 Cannot write Parquet files when values are larger than 64KB
IMPALA-1646 Impala Will Not Run on Certain Intel CPUs
Plus one or more transferred over from known issues:
IMPALA-1607
-->
<concept id="IMPALA-1607">
<title>Altering a column's type causes column stats to stop sticking for that column</title>
<conbody>
<p>
When the type of a column was changed in either Hive or Impala through <codeph>ALTER TABLE CHANGE COLUMN</codeph>, the metastore database did not correctly propagate
that change to the table that contains the column statistics. The statistics (particularly the <codeph>NDV</codeph>) for that column were permanently reset
and could not be changed by Impala's <codeph>COMPUTE STATS</codeph> command. The underlying cause is a Hive bug (HIVE-9866).
</p>
<p><b>Bug:</b> <xref href="https://issues.cloudera.org/browse/IMPALA-1607" scope="external" format="html">IMPALA-1607</xref></p>
<p><b>Resolution:</b> Resolved by incorporating the fix for <xref href="https://issues.apache.org/jira/browse/HIVE-9866" scope="external" format="html">HIVE-9866</xref>.</p>
<p><b>Workaround:</b> On systems without the corresponding Hive fix, change the column back to its original type. The stats reappear and you can recompute or drop them.</p>
</conbody>
</concept>
<concept id="IMPALA-1854">
<title>Impala may leak or use too many file descriptors</title>
<conbody>
<p>If a file was truncated in HDFS without a corresponding <codeph>REFRESH</codeph> in Impala, Impala could allocate memory for file descriptors and not free that memory.</p>
<p><b>Bug:</b> <xref href="https://issues.cloudera.org/browse/IMPALA-1854" scope="external" format="html">IMPALA-1854</xref></p>
</conbody>
</concept>
<concept id="IMPALA-1712">
<title>Spurious stale block locality messages</title>
<conbody>
<p>
Impala could issue messages stating the block locality metadata was stale,
when the metadata was actually fine.
The internal <q>remote bytes read</q> counter was not being reset properly.
This issue did not cause an actual slowdown in query execution,
but the spurious error could result in unnecessary debugging work
and unnecessary use of the <codeph>INVALIDATE METADATA</codeph> statement.
</p>
<p><b>Bug:</b> <xref href="https://issues.cloudera.org/browse/IMPALA-1712" scope="external" format="html">IMPALA-1712</xref></p>
</conbody>
</concept>
<concept id="IMPALA-1711">
<title>DROP TABLE fails after COMPUTE STATS and ALTER TABLE RENAME to a different database.</title>
<conbody>
<p>
When a table was moved from one database to another, the column statistics were not pointed to the new database.i
This could result in lower performance for queries due to unavailable statistics, and also an inability
to drop the table.
</p>
<p><b>Bug:</b> <xref href="https://issues.cloudera.org/browse/IMPALA-1711" scope="external" format="html">IMPALA-1711</xref></p>
</conbody>
</concept>
<concept id="IMPALA-1674">
<title>IMPALA-1556 causes memory leak with secure connections</title>
<conbody>
<p>
<cmdname>impalad</cmdname> daemons could experience a memory leak on clusters using Kerberos
authentication, with memory usage growing as more data is transferred across the secure channel, either
to the client program or between Impala nodes. The same issue affected LDAP-secured clusters to a lesser
degree, because the LDAP security only covers data transferred back to client programs.
</p>
<p>
<b>Bug:</b>
<xref href="https://issues.cloudera.org/browse/IMPALA-1674" scope="external" format="html">IMPALA-1674</xref>
</p>
</conbody>
</concept>
<concept id="IMPALA-1623">
<title>unix_timestamp() does not return correct time</title>
<conbody>
<p>
The <codeph>unix_timestamp()</codeph> function could return an incorrect value (a constant value of 1).
</p>
<p><b>Bug:</b> <xref href="https://issues.cloudera.org/browse/IMPALA-1623" scope="external" format="html">IMPALA-1623</xref></p>
</conbody>
</concept>
<concept id="IMPALA-1476">
<title>Impala incorrectly handles text data missing a newline on the last line</title>
<conbody>
<p>
Some queries did not recognize the final line of a text data file if the line did not end with a newline character.
This could lead to inconsistent results, such as a different number of rows for <codeph>SELECT COUNT(*)</codeph>
as opposed to <codeph>SELECT *</codeph>.
</p>
<p><b>Bug:</b> <xref href="https://issues.cloudera.org/browse/IMPALA-1476" scope="external" format="html">IMPALA-1476</xref></p>
</conbody>
</concept>
<!--
<concept id="IMPALA-1194">
<title>Memory leak using zlib on CentOS6 (and possibly other platforms)</title>
<conbody>
<p></p>
<p><b>Bug:</b> <xref href="https://issues.cloudera.org/browse/IMPALA-1194" scope="external" format="html">IMPALA-1194</xref></p>
<p><b>Severity:</b> High</p>
</conbody>
</concept>
-->
<concept id="IMPALA-1805">
<title>Impala's ACLs check do not consider all group ACLs, only checked first one.</title>
<conbody>
<p>
If the HDFS user ID associated with the <cmdname>impalad</cmdname> process had read or write access in HDFS based on group membership,
Impala statements could still fail with HDFS permission errors if that group was not the first listed group for that user ID.
</p>
<p><b>Bug:</b> <xref href="https://issues.cloudera.org/browse/IMPALA-1805" scope="external" format="html">IMPALA-1805</xref></p>
</conbody>
</concept>
<concept id="IMPALA-1794">
<title>Fix infinite loop opening or closing file with invalid metadata</title>
<conbody>
<p>
Truncating a file in HDFS, after Impala had cached the file metadata, could produce a hang when Impala
queried a table containing that file.
</p>
<p><b>Bug:</b> <xref href="https://issues.cloudera.org/browse/IMPALA-1794" scope="external" format="html">IMPALA-1794</xref></p>
</conbody>
</concept>
<concept id="IMPALA-1705">
<title>Cannot write Parquet files when values are larger than 64KB</title>
<conbody>
<p>
Impala could sometimes fail to <codeph>INSERT</codeph> into a Parquet table if a column value such as a <codeph>STRING</codeph>
was larger than 64 KB.
</p>
<p><b>Bug:</b> <xref href="https://issues.cloudera.org/browse/IMPALA-1705" scope="external" format="html">IMPALA-1705</xref></p>
</conbody>
</concept>
<concept id="IMPALA-1646">
<title>Impala Will Not Run on Certain Intel CPUs</title>
<conbody>
<p>
This fix relaxes the CPU requirement for Impala. Now only the SSSE3 instruction set is required. Formerly, SSE4.1
instructions were generated, making Impala refuse to start on some older CPUs.
</p>
<p><b>Bug:</b> <xref href="https://issues.cloudera.org/browse/IMPALA-1646" scope="external" format="html">IMPALA-1646</xref></p>
</conbody>
</concept>
</concept>
<!-- All 2.1.x / 5.3.x subsections go under here -->
<concept rev="2.3.10" id="fixed_issues_2310">
<title>Issues Fixed in Impala for CDH 5.3.10</title>
<conbody>
<p>
For the full list of fixed issues for all CDH components, see
<xref audience="standalone"
href="http://www.cloudera.com/documentation/enterprise/latest/topics/cdh_rn_fixed_in_53.html#fixed_issues_5310"
format="html" scope="external">the CDH 5.3.x release notes</xref><xref
audience="integrated" href="cdh_rn_fixed_in_53.xml#fixed_issues_5310"/>.
</p>
<note conref="../shared/impala_common.xml#common/only_cdh5_21x"/>
<ul audience="integrated">
<li conref="cdh_rn_fixed_in_53.xml#ui_fixed_in_5310/impala_start" conrefend="cdh_rn_fixed_in_53.xml#ui_fixed_in_5310/impala_end"/>
</ul>
</conbody>
</concept>
<concept rev="2.1.7" id="fixed_issues_217">
<title>Issues Fixed in the 2.1.7 Release / CDH 5.3.9</title>
<conbody>
<p>
This section lists the most significant Impala issues fixed in Impala 2.1.7 for CDH 5.3.9.
</p>
<p> For the full list of Impala fixed issues in this release, see <xref
audience="standalone"
href="http://www.cloudera.com/documentation/enterprise/latest/topics/cdh_rn_fixed_in_539.html"
format="html" scope="external">the CDH 5.3.9 release notes</xref><xref
href="cdh_rn_fixed_in_53.xml#fixed_issues_539" audience="integrated"
/>. </p>
<note conref="../shared/impala_common.xml#common/only_cdh5_21x"/>
</conbody>
<!--
5e7e50d IMPALA-1917: Do not register aux equivalence predicates with NULL on either side.
d8856bd IMPALA-2264: fix edge cases for decimal/integer cast
c17890e IMPALA-2514: DCHECK on destroying an ExprContext
2befc12 IMPALA-1746: QueryExecState doesn't check for query cancellation or errors
e591cf4 IMPALA-2141: UnionNode::GetNext() doesn't check for query errors
76b5313 IMPALA-1756: Constant filter expressions are not checked for errors and state clea
* 6a98b7c Don't redirect stdout/stderr from BE tests
-->
<concept id="IMPALA-1917-539">
<!-- Title in Juan's spreadsheet. Actual JIRA title more useful for readers IMO. <title>Do not register aux equivalence predicates with NULL on either side.</title> -->
<title>Query return empty result if it contains NullLiteral in inlineview</title>
<conbody>
<p>
If an inline view in a <codeph>FROM</codeph> clause contained a <codeph>NULL</codeph> literal,
the result set was empty.
</p>
<p><b>Bug:</b> <xref href="https://issues.cloudera.org/browse/IMPALA-1917" scope="external" format="html">IMPALA-1917</xref></p>
</conbody>
</concept>
<concept id="IMPALA-2264">
<title>Fix edge cases for decimal/integer cast</title>
<conbody>
<p>
A value of type <codeph>DECIMAL(3,0)</codeph> could be incorrectly cast to <codeph>TINYINT</codeph>.
The resulting out-of-range value could be incorrect. After the fix, the smallest type that is allowed
for this cast is <codeph>INT</codeph>, and attempting to use <codeph>DECIMAL(3,0)</codeph> in a
<codeph>TINYINT</codeph> context produces a <q>loss of precision</q> error.
</p>
<p><b>Bug:</b> <xref href="https://issues.cloudera.org/browse/IMPALA-2264" scope="external" format="html">IMPALA-2264</xref></p>
</conbody>
</concept>
<concept id="IMPALA-1756">
<title>Constant filter expressions are not checked for errors and state cleanup on exception / DCHECK on destroying an ExprContext</title>
<conbody>
<p>
An invalid constant expression in a <codeph>WHERE</codeph> clause (for example, an invalid
regular expression pattern) could fail to clean up internal state after raising a query error.
Therefore, certain combinations of invalid expressions in a query could cause a crash, or cause a query to continue
when it should halt with an error.
</p>
<p><b>Bug:</b> <xref href="https://issues.cloudera.org/browse/IMPALA-1756" scope="external" format="html">IMPALA-1756</xref>,
<xref href="https://issues.cloudera.org/browse/IMPALA-2514" scope="external" format="html">IMPALA-2514</xref></p>
</conbody>
</concept>
<!-- Combined with IMPALA-1756 above.
<concept id="IMPALA-2514">
<title>DCHECK on destroying an ExprContext</title>
<conbody>
<p>
Some internal state was not cleaned up after evaluating an invalid expression.
Therefore, certain combinations of invalid expressions in a query could cause a crash, or cause a query to continue
when it should halt with an error.
</p>
<p><b>Bug:</b> <xref href="https://issues.cloudera.org/browse/IMPALA-2514" scope="external" format="html">IMPALA-2514</xref></p>
<p><b>Severity:</b> High</p>
</conbody>
</concept>
-->
<concept id="IMPALA-1746-539">
<title>QueryExecState does not check for query cancellation or
errors</title>
<conbody>
<p>
A call to <codeph>SetError()</codeph> in a user-defined function (UDF) would not cause the query to fail as expected.
</p>
<p><b>Bug:</b> <xref href="https://issues.cloudera.org/browse/IMPALA-1746" scope="external" format="html">IMPALA-1746</xref>,
<xref href="https://issues.cloudera.org/browse/IMPALA-2141" scope="external" format="html">IMPALA-2141</xref></p>
</conbody>
</concept>
</concept>
<concept rev="2.1.6" id="fixed_issues_216">
<title>Issues Fixed in the 2.1.6 Release / CDH 5.3.8</title>
<conbody>
<p>
This section lists the most significant Impala issues fixed in Impala 2.1.6 for CDH 5.3.8.
</p>
<p> For the full list of Impala fixed issues in this release, see <!-- Don't think we have quite the right combination of conditional options to make this reference be a web link in Impala PDF and a normal xref in Release Notes PDF. -->
<xref audience="standalone"
href="http://www.cloudera.com/documentation/enterprise/latest/topics/cdh_rn_fixed_in_538.html"
format="html" scope="external">the CDH 5.3.8 release notes</xref><xref
href="cdh_rn_fixed_in_53.xml#fixed_issues_538" audience="integrated"
/>. </p>
<note conref="../shared/impala_common.xml#common/only_cdh5_21x"/>
</conbody>
<!--
9f2a54f IMPALA-2364: Wrong DCHECK in PHJ::ProcessProbeBatch
7789e19 IMPALA-2165: Avoid cardinality 0 in scan nodes of small tables and low selectivity
902c88b IMPALA-2314: LargestSpilledPartition was not checking if partition is closed
1be8617 IMPALA-2178: fix Expr::ComputeResultsLayout() logic
* afc2cf6 IMPALA-2133: Properly unescape string value for HBase filters
25210af IMPALA-1929: Avoiding a DCHECK of NULL hash table in spilled right joins
* Designated as non-critical when previously published in a different 5.x.y maintenance release.
-->
<concept id="IMPALA-2364">
<title>Wrong DCHECK in PHJ::ProcessProbeBatch</title>
<conbody>
<p>Certain queries could encounter a serious error if the spill-to-disk mechanism was activated.</p>
<p><b>Bug:</b> <xref href="https://issues.cloudera.org/browse/IMPALA-2364" scope="external" format="html">IMPALA-2364</xref></p>
</conbody>
</concept>
<concept id="IMPALA-2314">
<title>LargestSpilledPartition was not checking if partition is closed</title>
<conbody>
<p>Certain queries could encounter a serious error if the spill-to-disk mechanism was activated.</p>
<p><b>Bug:</b> <xref href="https://issues.cloudera.org/browse/IMPALA-2314" scope="external" format="html">IMPALA-2314</xref></p>
</conbody>
</concept>
<concept id="IMPALA-2165">
<title>Avoid cardinality 0 in scan nodes of small tables and low selectivity</title>
<conbody>
<p>Impala could generate a suboptimal query plan for some queries involving small tables.</p>
<p><b>Bug:</b> <xref href="https://issues.cloudera.org/browse/IMPALA-2165" scope="external" format="html">IMPALA-2165</xref></p>
</conbody>
</concept>
<concept id="IMPALA-2178-238">
<title>fix Expr::ComputeResultsLayout() logic</title>
<conbody>
<p>Queries using the <codeph>GROUP BY</codeph> operator on multiple <codeph>CHAR</codeph> columns with length less than or equal to 2 characters
could return incorrect results for some columns.</p>
<p><b>Bug:</b> <xref href="https://issues.cloudera.org/browse/IMPALA-2178" scope="external" format="html">IMPALA-2178</xref></p>
</conbody>
</concept>
<concept id="IMPALA-2133">
<title>Properly unescape string value for HBase filters</title>
<conbody>
<p>Queries against HBase tables could return incomplete results if the <codeph>WHERE</codeph> clause included string comparisons using literals
containing escaped quotation marks.</p>
<p><b>Bug:</b> <xref href="https://issues.cloudera.org/browse/IMPALA-2133" scope="external" format="html">IMPALA-2133</xref></p>
</conbody>
</concept>
<concept id="IMPALA-1929-238">
<title>Avoiding a DCHECK of NULL hash table in spilled right joins</title>
<!-- Real JIRA title: Crash because PHJ::NextSpilledProbeRowBatch() tries to use a NULL hash_tbl -->
<conbody>
<p>
A query could encounter a serious error if it contained a <codeph>RIGHT OUTER</codeph>, <codeph>RIGHT ANTI</codeph>, or <codeph>FULL OUTER</codeph> join clause
and approached the memory limit on a host so that the <q>spill to disk</q> mechanism was activated.
</p>
<p><b>Bug:</b> <xref href="https://issues.cloudera.org/browse/IMPALA-1929" scope="external" format="html">IMPALA-1929</xref></p>
</conbody>
</concept>
</concept>
<concept rev="2.1.5" id="fixed_issues_215">
<title>Issues Fixed in the 2.1.5 Release / CDH 5.3.6</title>
<conbody>
<p>
This section lists the most significant Impala issues fixed in Impala 2.1.5 for CDH 5.3.6.
</p>
<p> For the full list of Impala fixed issues in this release, see <!-- Don't think we have quite the right combination of conditional options to make this reference be a web link in Impala PDF and a normal xref in Release Notes PDF. -->
<xref audience="standalone"
href="http://www.cloudera.com/documentation/enterprise/latest/topics/cdh_rn_fixed_in_536.html"
format="html" scope="external">the CDH 5.3.6 release notes</xref><xref
href="cdh_rn_fixed_in_53.xml#fixed_issues_536" audience="integrated"
/>. </p>
<note conref="../shared/impala_common.xml#common/only_cdh5_21x"/>
</conbody>
<!--
3a8716c Parquet: Fix value def level when max def level is 0
856ee01 IMPALA-1774 Allow querying Parquet tables with complex-typed columns as long as those columns are not selected.
82b8ee4 IMPALA-1919: Avoid calling ProcessBatch with out_batch->AtCapacity in right joins
11c852c IMPALA-2002: Provide way to cache ext data source classes
2c4f716 IMPALA-1726: Move JNI / Thrift utilities to separate header
-->
<concept id="IMPALA-1919">
<title>Avoid calling ProcessBatch with out_batch-&gt;AtCapacity in right joins</title>
<conbody>
<p>
Queries including <codeph>RIGHT OUTER JOIN</codeph>, <codeph>RIGHT ANTI JOIN</codeph>, or <codeph>FULL OUTER JOIN</codeph>
clauses and involving a high volume of data could encounter a serious error.
</p>
<p><b>Bug:</b> <xref href="https://issues.cloudera.org/browse/IMPALA-1919" scope="external" format="html">IMPALA-1919</xref></p>
</conbody>
</concept>
</concept>
<concept rev="2.1.4" id="fixed_issues_214">
<title>Issues Fixed in the 2.1.4 Release / CDH 5.3.4</title>
<conbody>
<p>
This section lists the most significant Impala issues fixed in Impala 2.1.4 for CDH 5.3.4.
<ph conref="../shared/impala_common.xml#common/impala_214_redux"/>
</p>
<p> For the full list of Impala fixed issues in Impala 2.1.4 for CDH
5.3.4, see <xref audience="standalone"
href="http://www.cloudera.com/documentation/enterprise/latest/topics/cdh_rn_fixed_in_534.html"
format="html" scope="external">the CDH 5.3.4 release notes</xref><xref
href="cdh_rn_fixed_in_53.xml#fixed_issues_534" audience="integrated"
/>. </p>
<note conref="../shared/impala_common.xml#common/only_cdh5_21x"/>
<p outputclass="toc inpage"/>
</conbody>
<!--
d8bce5d IMPALA-1519: Fix wrapping of exprs via a TupleIsNullPredicate with analytics.
42e4b8d IMPALA-1952: Expand parsing of decimals to include scientific notation
d4cdb61 IMPALA-1860: INSERT/CTAS evaluates and applies constant predicates.
b97920d IMPALA-1900: Assign predicates below analytic functions with a compatible partition by clause
cdf547a IMPALA-1376: Split up Planner into multiple classes.
47d87ea IMPALA-1888: FIRST_VALUE may produce incorrect results with preceding windows
16725e5 IMPALA-1559: FIRST_VALUE rewrite fn type might not match slot type
c132c75 IMPALA-1808: AnalyticEvalNode cannot handle partition/order by exprs with NaN
082b806 IMPALA-1562: AnalyticEvalNode not properly handling nullable tuples
-->
<concept id="IMPALA-1519">
<title>Crash: impala::TupleIsNullPredicate::Prepare</title>
<conbody>
<p>
When expressions that tested for <codeph>NULL</codeph> were used in combination with analytic functions, an error could occur.
(The original crash issue was fixed by an earlier patch.)
</p>
<p><b>Bug:</b> <xref href="https://issues.cloudera.org/browse/IMPALA-1519" scope="external" format="html">IMPALA-1519</xref></p>
</conbody>
</concept>
<concept id="IMPALA-1952-214">
<title>Expand parsing of decimals to include scientific notation</title>
<conbody>
<p>
<codeph>DECIMAL</codeph> literals could include <codeph>e</codeph> scientific notation.
For example, now <codeph>CAST(1e3 AS DECIMAL(5,3))</codeph> is a valid expression.
Formerly it returned <codeph>NULL</codeph>.
Some scientific expressions might have worked before in <codeph>DECIMAL</codeph> context, but only when the scale was 0.
</p>
<p><b>Bug:</b> <xref href="https://issues.cloudera.org/browse/IMPALA-1952" scope="external" format="html">IMPALA-1952</xref></p>
</conbody>
</concept>
<concept id="IMPALA-1860">
<title>INSERT/CTAS evaluates and applies constant predicates.</title>
<conbody>
<p>
An <codeph>INSERT OVERWRITE</codeph> statement would write new data, even if
a constant clause such as <codeph>WHERE 1 = 0</codeph> should have
prevented it from writing any rows.
</p>
<p><b>Bug:</b> <xref href="https://issues.cloudera.org/browse/IMPALA-1860" scope="external" format="html">IMPALA-1860</xref></p>
</conbody>
</concept>
<concept id="IMPALA-1900">
<title>Assign predicates below analytic functions with a compatible partition by clause</title>
<conbody>
<p>
If the <codeph>PARTITION BY</codeph> clause in an analytic function refers to partition key columns in a partitioned table,
now Impala can perform partition pruning during the analytic query.
</p>
<p><b>Bug:</b> <xref href="https://issues.cloudera.org/browse/IMPALA-1900" scope="external" format="html">IMPALA-1900</xref></p>
</conbody>
</concept>
<!-- This is just code refactoring that should have no user-visible aspects.
(Although it did introduce some bugs that were the subject of other fixes.)
<concept id="IMPALA-1376">
<title>Split up Planner into multiple classes.</title>
<conbody>
<p>
</p>
<p><b>Bug:</b> <xref href="https://issues.cloudera.org/browse/IMPALA-1376" scope="external" format="html">IMPALA-1376</xref></p>
<p><b>Severity:</b> High</p>
</conbody>
</concept>
-->
<concept id="IMPALA-1888">
<title>FIRST_VALUE may produce incorrect results with preceding windows</title>
<conbody>
<p>
A query using the <codeph>FIRST_VALUE</codeph> analytic function
and a window defined with the <codeph>PRECEDING</codeph> keyword
could produce wrong results.
</p>
<p><b>Bug:</b> <xref href="https://issues.cloudera.org/browse/IMPALA-1888" scope="external" format="html">IMPALA-1888</xref></p>
</conbody>
</concept>
<concept id="IMPALA-1559">
<title>FIRST_VALUE rewrite fn type might not match slot type</title>
<conbody>
<p>
A query referencing a <codeph>DECIMAL</codeph> column with the <codeph>FIRST_VALUE</codeph> analytic function
could encounter an error.
</p>
<p><b>Bug:</b> <xref href="https://issues.cloudera.org/browse/IMPALA-1559" scope="external" format="html">IMPALA-1559</xref></p>
</conbody>
</concept>
<concept id="IMPALA-1808">
<title>AnalyticEvalNode cannot handle partition/order by exprs with NaN</title>
<conbody>
<p>
A query using an analytic function
could encounter an error if the
evaluation of an analytic <codeph>ORDER BY</codeph> or <codeph>PARTITION</codeph> expression
resulted in a NaN value, for example if the <codeph>ORDER BY</codeph> or <codeph>PARTITION</codeph>
contained a division operation where both operands were zero.
</p>
<p><b>Bug:</b> <xref href="https://issues.cloudera.org/browse/IMPALA-1808" scope="external" format="html">IMPALA-1808</xref></p>
</conbody>
</concept>
<concept id="IMPALA-1562">
<title>AnalyticEvalNode not properly handling nullable tuples</title>
<conbody>
<p>
An analytic function containing only an <codeph>OVER</codeph> clause could
encounter an error if another part of the query (specifically an outer join)
produced all-<codeph>NULL</codeph> tuples.
</p>
<p><b>Bug:</b> <xref href="https://issues.cloudera.org/browse/IMPALA-1562" scope="external" format="html">IMPALA-1562</xref></p>
</conbody>
</concept>
</concept>
<concept rev="2.1.3" id="fixed_issues_213">
<title>Issues Fixed in the 2.1.3 Release / CDH 5.3.3</title>
<conbody>
<p>
This section lists the most significant issues fixed in Impala 2.1.3.
</p>
<p> For the full list of fixed issues in Impala 2.1.3, see <xref
audience="standalone"
href="http://www.cloudera.com/documentation/enterprise/latest/topics/cdh_rn_fixed_in_533.html"
format="html" scope="external">the CDH 5.3.3 release notes</xref><xref
href="cdh_rn_fixed_in_53.xml#fixed_issues_533" audience="integrated"
/>.
<!-- <xref href="https://issues.cloudera.org/secure/IssueNavigator.jspa?reset=true&amp;jqlQuery=project%3Dimpala+and+fixVersion%3D%22Impala+2.1.3%22+and+resolution%3D%22Fixed%22" format="html" scope="external">
this report in the JIRA system</xref>. -->
</p>
<note conref="../shared/impala_common.xml#common/only_cdh5_213"/>
<p outputclass="toc inpage"/>
</conbody>
<!--
! = identified by Juan as a critical fix, so listing here.
? = not identified by Juan as a critical fix, so leaving in the big list under CDH fixed issues
but removing from the long descriptions here.
! IMPALA-1658 - Add compatibility flag for Hive-Parquet-Timestamps
! IMPALA-1738 - Use snprintf() instead of lexical_cast() in float-to-string casts
! IMPALA-1865 - Fix partition spilling cleanup when new stream OOMs
! IMPALA-1805 - Impala's ACLs check do not consider all group ACLs, only checked first one.
! IMPALA-1794 - Fix infinite loop opening or closing file with invalid metadata
! IMPALA-1801 - external-data-source-executor leaking global jni refs
! IMPALA-1712 - Unexpected remote bytes read counter was not being reset properly
? IMPALA-1820 - Start with small pages for hash tables during repartitioning
? IMPALA-1897 - Fixes for old hash join and agg
? IMPALA-1894 - Fix old aggregation node hash table cleanup
? IMPALA-1863 - Avoid deadlock across fragment instances
? IMPALA-1915 - Fix query hang in BufferedBlockMgr:FindBlock()
? IMPALA-1890 - Fixing a race between ~BufferedBlockMgr() and the WriteComplete() call
? IMPALA-1835 - Keep the fragment alive for TransmitData()
? IMPALA-1636 - Generalize index-based partition pruning to allow constant expressions
-->
<concept id="IMPALA-1658">
<title>Add compatibility flag for Hive-Parquet-Timestamps</title>
<conbody>
<p>
When Hive writes <codeph>TIMESTAMP</codeph> values, it represents them
in the local time zone of the server. Impala expects <codeph>TIMESTAMP</codeph>
values to always be in the UTC time zone, possibly leading to inconsistent
results depending on which component created the data files.
This patch introduces a new startup flag,
<codeph>-convert_legacy_hive_parquet_utc_timestamps</codeph>
for the <cmdname>impalad</cmdname> daemon.
Specify <codeph>-convert_legacy_hive_parquet_utc_timestamps=true</codeph>
to make Impala recognize Parquet data files written by Hive
and automatically adjust <codeph>TIMESTAMP</codeph>
values read from those files into the UTC time zone for
compatibility with other Impala <codeph>TIMESTAMP</codeph> processing.
Although this setting is currently turned off by default,
consider enabling it if practical in your environment,
for maximum interoperability with Hive-created Parquet files.
</p>
<p><b>Bug:</b> <xref href="https://issues.cloudera.org/browse/IMPALA-1658" scope="external" format="html">IMPALA-1658</xref></p>
</conbody>
</concept>
<concept id="IMPALA-1738">
<title>Use snprintf() instead of lexical_cast() in float-to-string casts</title>
<conbody>
<p>
Converting a floating-point value to a <codeph>STRING</codeph> could be slower than necessary.
</p>
<p><b>Bug:</b> <xref href="https://issues.cloudera.org/browse/IMPALA-1738" scope="external" format="html">IMPALA-1738</xref></p>
</conbody>
</concept>
<concept id="IMPALA-1865">
<title>Fix partition spilling cleanup when new stream OOMs</title>
<conbody>
<p>
Certain calls to aggregate functions with <codeph>STRING</codeph> arguments could encounter a serious error
when the system ran low on memory and attempted to activate the spill-to-disk mechanism.
The error message referenced the function <codeph>impala::AggregateFunctions::StringValGetValue</codeph>.
</p>
<p><b>Bug:</b> <xref href="https://issues.cloudera.org/browse/IMPALA-1865" scope="external" format="html">IMPALA-1865</xref></p>
</conbody>
</concept>
<concept id="IMPALA-1805-213">
<title>Impala's ACLs check do not consider all group ACLs, only checked first one.</title>
<conbody>
<p>
If the HDFS user ID associated with the <cmdname>impalad</cmdname> process had read or write access in HDFS based on group membership,
Impala statements could still fail with HDFS permission errors if that group was not the first listed group for that user ID.
</p>
<p><b>Bug:</b> <xref href="https://issues.cloudera.org/browse/IMPALA-1805" scope="external" format="html">IMPALA-1805</xref></p>
</conbody>
</concept>
<concept id="IMPALA-1794-213">
<title>Fix infinite loop opening or closing file with invalid metadata</title>
<conbody>
<p>
Truncating a file in HDFS, after Impala had cached the file metadata, could produce a hang when Impala
queried a table containing that file.
</p>
<p><b>Bug:</b> <xref href="https://issues.cloudera.org/browse/IMPALA-1794" scope="external" format="html">IMPALA-1794</xref></p>
</conbody>
</concept>
<concept id="IMPALA-1801">
<title>external-data-source-executor leaking global jni refs</title>
<conbody>
<p>
Successive calls to the data source API could result in excessive memory consumption,
with memory allocated but never freed.
</p>
<p><b>Bug:</b> <xref href="https://issues.cloudera.org/browse/IMPALA-1801" scope="external" format="html">IMPALA-1801</xref></p>
</conbody>
</concept>
<concept id="IMPALA-1712-213">
<title>Spurious stale block locality messages</title>
<conbody>
<p>
Impala could issue messages stating the block locality metadata was stale,
when the metadata was actually fine.
The internal <q>remote bytes read</q> counter was not being reset properly.
This issue did not cause an actual slowdown in query execution,
but the spurious error could result in unnecessary debugging work
and unnecessary use of the <codeph>INVALIDATE METADATA</codeph> statement.
</p>
<p><b>Bug:</b> <xref href="https://issues.cloudera.org/browse/IMPALA-1712" scope="external" format="html">IMPALA-1712</xref></p>
</conbody>
</concept>
</concept>
<concept rev="2.1.2" id="fixed_issues_212">
<title>Issues Fixed in the 2.1.2 Release / CDH 5.3.2</title>
<conbody>
<p>
This section lists the most significant issues fixed in Impala 2.1.2.
</p>
<p>
For the full list of fixed issues in Impala 2.1.2, see
<xref href="https://issues.cloudera.org/secure/IssueNavigator.jspa?reset=true&amp;jqlQuery=project%3Dimpala+and+fixVersion%3D%22Impala+2.1.2%22+and+resolution%3D%22Fixed%22" format="html" scope="external">
this report in the JIRA system</xref>.
</p>
<note conref="../shared/impala_common.xml#common/only_cdh5_212"/>
<p outputclass="toc inpage"/>
</conbody>
<!--
IMPALA-1622: fix overflow in StringParser::StringToFloatInternal()
IMPALA-1623: unix_timestamp() does not return correct time
IMPALA-1535: Partition pruning with NULL - also in 5.2.4 / 2.0.3 release notes
IMPALA-1120: Fetch column statistics using Hive 0.13 bulk API - also in 5.2.4 / 2.0.3 release notes
-->
<concept id="IMPALA-1622">
<title>Impala incorrectly handles double numbers with more than 19 significant decimal digits</title>
<conbody>
<p>
When a floating-point value was read from a text file and interpreted as a <codeph>FLOAT</codeph>
or <codeph>DOUBLE</codeph> value, it could be incorrectly interpreted if it included more than
19 significant digits.
</p>
<p><b>Bug:</b> <xref href="https://issues.cloudera.org/browse/IMPALA-1622" scope="external" format="html">IMPALA-1622</xref></p>
</conbody>
</concept>
<concept id="IMPALA-1623-213">
<title>unix_timestamp() does not return correct time</title>
<conbody>
<p>
The <codeph>unix_timestamp()</codeph> function could return an incorrect value (a constant value of 1).
</p>
<p><b>Bug:</b> <xref href="https://issues.cloudera.org/browse/IMPALA-1623" scope="external" format="html">IMPALA-1623</xref></p>
</conbody>
</concept>
<concept id="IMPALA-1535">
<title>Row Count Mismatch: Partition pruning with NULL</title>
<conbody>
<p>
A query against a partitioned table could return incorrect results if the <codeph>WHERE</codeph> clause
compared the partition key to <codeph>NULL</codeph> using operators such as <codeph>=</codeph> or <codeph>!=</codeph>.
</p>
<p><b>Bug:</b> <xref href="https://issues.cloudera.org/browse/IMPALA-1535" scope="external" format="html">IMPALA-1535</xref></p>
</conbody>
</concept>
<concept id="IMPALA-1120">
<title>Fetch column stats in bulk using new (Hive .13) HMS APIs</title>
<conbody>
<p>The performance of the <codeph>COMPUTE STATS</codeph> statement and queries was improved, particularly for wide tables.</p>
<p><b>Bug:</b> <xref href="https://issues.cloudera.org/browse/IMPALA-1120" scope="external" format="html">IMPALA-1120</xref></p>
</conbody>
</concept>
</concept>
<concept rev="2.1.1" id="fixed_issues_211">
<title>Issues Fixed in the 2.1.1 Release / CDH 5.3.1</title>
<conbody>
<p>
This section lists the most significant issues fixed in Impala 2.1.1.
</p>
<p>
For the full list of fixed issues in Impala 2.1.1, see
<xref href="https://issues.cloudera.org/secure/IssueNavigator.jspa?reset=true&amp;jqlQuery=project%3Dimpala+and+fixVersion%3D%22Impala+2.1.1%22+and+resolution%3D%22Fixed%22" format="html" scope="external">
this report in the JIRA system</xref>.
</p>
<p outputclass="toc inpage"/>
</conbody>
<concept rev="2.1.1" id="IMPALA-1674-211">
<title>IMPALA-1556 causes memory leak with secure connections</title>
<conbody>
<p>
<cmdname>impalad</cmdname> daemons could experience a memory leak on clusters using Kerberos
authentication, with memory usage growing as more data is transferred across the secure channel, either
to the client program or between Impala nodes. The same issue affected LDAP-secured clusters to a lesser
degree, because the LDAP security only covers data transferred back to client programs.
</p>
<p>
<b>Bug:</b>
<xref href="https://issues.cloudera.org/browse/IMPALA-1674" scope="external" format="html"/> IMPALA-1674
</p>
<!-- <p><b>Resolution:</b> Contact <keyword keyref="support_org"/> for a patch.</p> -->
</conbody>
</concept>
<concept rev="2.1.1" id="IMPALA-1668">
<title>TSaslServerTransport::Factory::getTransport() leaks transport map entries</title>
<conbody>
<p>
<cmdname>impalad</cmdname> daemons in clusters secured by Kerberos or LDAP could experience a slight
memory leak on each connection. The accumulation of unreleased memory could cause problems on
long-running clusters.
</p>
<p>
<b>Bug:</b>
<xref href="https://issues.cloudera.org/browse/IMPALA-1668" scope="external" format="html">IMPALA-1668</xref>
</p>
<!-- <p><b>Resolution:</b> Contact <keyword keyref="support_org"/> for a patch.</p> -->
</conbody>
</concept>
</concept>
<concept rev="2.1.0" id="fixed_issues_210">
<title>Issues Fixed in the 2.1.0 Release / CDH 5.3.0</title>
<conbody>
<p>
This section lists the most significant issues fixed in Impala 2.1.0.
</p>
<p>
For the full list of fixed issues in Impala 2.1.0, see
<xref href="https://issues.cloudera.org/secure/IssueNavigator.jspa?reset=true&amp;jqlQuery=project%3Dimpala+and+fixVersion%3D%22Impala+2.1%22+and+resolution%3D%22Fixed%22" format="html" scope="external">
this report in the JIRA system</xref>.
</p>
<p outputclass="toc inpage"/>
</conbody>
<concept id="IMPALA-1455">
<title>Kerberos fetches 3x slower</title>
<conbody>
<p>
Transferring large result sets back to the client application on Kerberos
</p>
<p>
<b>Bug:</b>
<xref href="https://issues.cloudera.org/browse/IMPALA-1455" scope="external" format="html">IMPALA-1455</xref>
</p>
</conbody>
</concept>
<concept id="IMPALA-1556">
<title>Compressed file needs to be hold on entirely in Memory</title>
<conbody>
<p>
Queries on gzipped text files required holding the entire data file and its uncompressed representation
in memory at the same time. <codeph>SELECT</codeph> and <codeph>COMPUTE STATS</codeph> statements could
fail or perform inefficiently as a result. The fix enables streaming reads for gzipped text, so that the
data is uncompressed as it is read.
</p>
<p>
<b>Bug:</b>
<xref href="https://issues.cloudera.org/browse/IMPALA-1556" scope="external" format="html">IMPALA-1556</xref>
</p>
</conbody>
</concept>
<concept id="IMPALA-1611">
<title>Cannot read hbase metadata with NullPointerException: null</title>
<conbody>
<p>
Impala might not be able to access HBase tables, depending on the associated levels of Impala and HBase
on the system.
</p>
<p>
<b>Bug:</b>
<xref href="https://issues.cloudera.org/browse/IMPALA-1611" scope="external" format="html">IMPALA-1611</xref>
</p>
</conbody>
</concept>
<concept id="IMPALA-serious-201">
<title>Serious errors / crashes</title>
<conbody>
<p>
Improved code coverage in Impala testing uncovered a number of potentially serious errors that could
occur with specific query syntax. These errors are resolved in Impala 2.1.
</p>
<p>
<b>Bug:</b>
<xref href="https://issues.cloudera.org/browse/IMPALA-1553" scope="external" format="html">IMPALA-1553</xref>
,
<xref href="https://issues.cloudera.org/browse/IMPALA-1528" scope="external" format="html">IMPALA-1528</xref>
,
<xref href="https://issues.cloudera.org/browse/IMPALA-1526" scope="external" format="html">IMPALA-1526</xref>
,
<xref href="https://issues.cloudera.org/browse/IMPALA-1524" scope="external" format="html">IMPALA-1524</xref>
,
<xref href="https://issues.cloudera.org/browse/IMPALA-1508" scope="external" format="html">IMPALA-1508</xref>
,
<xref href="https://issues.cloudera.org/browse/IMPALA-1493" scope="external" format="html">IMPALA-1493</xref>
,
<xref href="https://issues.cloudera.org/browse/IMPALA-1501" scope="external" format="html">IMPALA-1501</xref>
,
<xref href="https://issues.cloudera.org/browse/IMPALA-1483" scope="external" format="html">IMPALA-1483</xref>
<!--
, <xref href="https://issues.cloudera.org/browse/" scope="external" format="html"></xref>
, <xref href="https://issues.cloudera.org/browse/" scope="external" format="html"></xref>
, <xref href="https://issues.cloudera.org/browse/" scope="external" format="html"></xref>
, <xref href="https://issues.cloudera.org/browse/" scope="external" format="html"></xref>
, <xref href="https://issues.cloudera.org/browse/" scope="external" format="html"></xref>
, <xref href="https://issues.cloudera.org/browse/" scope="external" format="html"></xref>
, <xref href="https://issues.cloudera.org/browse/" scope="external" format="html"></xref>
, <xref href="https://issues.cloudera.org/browse/" scope="external" format="html"></xref>
, <xref href="https://issues.cloudera.org/browse/" scope="external" format="html"></xref>
-->
</p>
</conbody>
</concept>
</concept>
<!-- All 2.0.x subsections go under here -->
<concept rev="2.0.5" id="fixed_issues_205">
<title>Issues Fixed in the 2.0.5 Release / CDH 5.2.6</title>
<conbody>
<p>
For the full list of fixed issues in Impala 2.0.5, see
<xref href="https://issues.cloudera.org/secure/IssueNavigator.jspa?reset=true&amp;jqlQuery=project%3Dimpala+and+fixVersion%3D%22Impala+2.0.5%22+and+resolution%3D%22Fixed%22" format="html" scope="external">
this report in the JIRA system</xref>.
</p>
<note conref="../shared/impala_common.xml#common/only_cdh5_205"/>
</conbody>
</concept>
<concept rev="2.0.4" id="fixed_issues_204">
<title>Issues Fixed in the 2.0.4 Release / CDH 5.2.5</title>
<conbody>
<p>
This section lists the most significant issues fixed in Impala 2.0.4.
</p>
<p>
For the full list of fixed issues in Impala 2.0.4, see
<xref href="https://issues.cloudera.org/secure/IssueNavigator.jspa?reset=true&amp;jqlQuery=project%3Dimpala+and+fixVersion%3D%22Impala+2.0.4%22+and+resolution%3D%22Fixed%22" format="html" scope="external">
this report in the JIRA system</xref>.
</p>
<note conref="../shared/impala_common.xml#common/only_cdh5_204"/>
<p outputclass="toc inpage"/>
</conbody>
<!--
2c87782 IMPALA-1658: Add compatibility flag for Hive-Parquet-Timestamps
9c5d858 IMPALA-1794: Fix infinite loop opening/closing file w/ invalid metadata
Not going to list this one, 'external data source' feature is not documented or supported:
db3362f IMPALA-1801: external-data-source-executor leaking global jni refs
-->
<concept id="IMPALA-1658-526">
<title>Add compatibility flag for Hive-Parquet-Timestamps</title>
<conbody>
<p>
When Hive writes <codeph>TIMESTAMP</codeph> values, it represents them
in the local time zone of the server. Impala expects <codeph>TIMESTAMP</codeph>
values to always be in the UTC time zone, possibly leading to inconsistent
results depending on which component created the data files.
This patch introduces a new startup flag,
<codeph>-convert_legacy_hive_parquet_utc_timestamps</codeph>
for the <cmdname>impalad</cmdname> daemon.
Specify <codeph>-convert_legacy_hive_parquet_utc_timestamps=true</codeph>
to make Impala recognize Parquet data files written by Hive
and automatically adjust <codeph>TIMESTAMP</codeph>
values read from those files into the UTC time zone for
compatibility with other Impala <codeph>TIMESTAMP</codeph> processing.
Although this setting is currently turned off by default,
consider enabling it if practical in your environment,
for maximum interoperability with Hive-created Parquet files.
</p>
<p><b>Bug:</b> <xref href="https://issues.cloudera.org/browse/IMPALA-1658" scope="external" format="html">IMPALA-1658</xref></p>
</conbody>
</concept>
<concept id="IMPALA-1794-526">
<title>IoMgr infinite loop opening/closing file when shorter than cached metadata size</title>
<conbody>
<p>
If a table data file was replaced by a shorter file outside of Impala,
such as with <codeph>INSERT OVERWRITE</codeph> in Hive producing an empty
output file, subsequent Impala queries could hang.
</p>
<p><b>Bug:</b> <xref href="https://issues.cloudera.org/browse/IMPALA-1794" scope="external" format="html">IMPALA-1794</xref></p>
</conbody>
</concept>
</concept>
<concept rev="2.0.3" id="fixed_issues_203">
<title>Issues Fixed in the 2.0.3 Release / CDH 5.2.4</title>
<conbody>
<p>
This section lists the most significant issues fixed in Impala 2.0.3.
</p>
<p>
For the full list of fixed issues in Impala 2.0.3, see
<xref href="https://issues.cloudera.org/secure/IssueNavigator.jspa?reset=true&amp;jqlQuery=project%3Dimpala+and+fixVersion%3D%22Impala+2.0.3%22+and+resolution%3D%22Fixed%22" format="html" scope="external">
this report in the JIRA system</xref>.
</p>
<note conref="../shared/impala_common.xml#common/only_cdh5_203"/>
<p outputclass="toc inpage"/>
</conbody>
<!--
IMPALA-1471: Bug in spilling of PHJ that was affecting left anti and outer joins.
IMPALA-1535: Partition pruning with NULL
IMPALA-1120: Fetch column statistics using Hive 0.13 bulk API this one fix the customer issue in this JIRA https://jira.cloudera.com/browse/CDH-24919 Stats on Avro column names using uppercase letters don't get persisted
-->
<concept id="IMPALA-1471">
<title>Anti join could produce incorrect results when spilling</title>
<conbody>
<p>
An anti-join query (or a <codeph>NOT EXISTS</codeph> operation that was rewritten internally into an anti-join) could produce incorrect results
if Impala reached its memory limit, causing the query to write temporary results to disk.
</p>
<p><b>Bug:</b> <xref href="https://issues.cloudera.org/browse/IMPALA-1471" scope="external" format="html">IMPALA-1471</xref></p>
</conbody>
</concept>
<concept id="IMPALA-1535-52x">
<title>Row Count Mismatch: Partition pruning with NULL</title>
<conbody>
<p>
A query against a partitioned table could return incorrect results if the <codeph>WHERE</codeph> clause
compared the partition key to <codeph>NULL</codeph> using operators such as <codeph>=</codeph> or <codeph>!=</codeph>.
</p>
<p><b>Bug:</b> <xref href="https://issues.cloudera.org/browse/IMPALA-1535" scope="external" format="html">IMPALA-1535</xref></p>
</conbody>
</concept>
<concept id="IMPALA-1120-52x">
<title>Fetch column stats in bulk using new (Hive .13) HMS APIs</title>
<conbody>
<p>The performance of the <codeph>COMPUTE STATS</codeph> statement and queries was improved, particularly for wide tables.</p>
<p><b>Bug:</b> <xref href="https://issues.cloudera.org/browse/IMPALA-1120" scope="external" format="html">IMPALA-1120</xref></p>
</conbody>
</concept>
</concept>
<concept rev="2.0.2" id="fixed_issues_202">
<title>Issues Fixed in the 2.0.2 Release / CDH 5.2.3</title>
<conbody>
<p>
This section lists the most significant issues fixed in Impala 2.0.2.
</p>
<p>
For the full list of fixed issues in Impala 2.0.2, see
<xref href="https://issues.cloudera.org/secure/IssueNavigator.jspa?reset=true&amp;jqlQuery=project%3Dimpala+and+fixVersion%3D%22Impala+2.0.2%22+and+resolution%3D%22Fixed%22" format="html" scope="external">
this report in the JIRA system</xref>.
</p>
<note conref="../shared/impala_common.xml#common/only_cdh5_202"/>
<p outputclass="toc inpage"/>
</conbody>
<!-- These fixed JIRAs are copied from from impala_fixed_issues.xml. Going forward, the content in this file is the baseline for future changes. -->
<concept id="IMPALA-1453">
<title>GROUP BY on STRING column produces inconsistent results</title>
<conbody>
<p>
Some operations in queries submitted through Hue or other HiveServer2 clients could produce inconsistent
results.
</p>
<p>
<b>Bug:</b>
<xref href="https://issues.cloudera.org/browse/IMPALA-1453" scope="external" format="html">IMPALA-1453</xref>
</p>
</conbody>
</concept>
<concept id="IoMgr_522">
<title>Fix leaked file descriptor and excessive file descriptor use</title>
<conbody>
<p>
Impala could encounter an error from running out of file descriptors. The fix reduces the amount of time
file descriptors are kept open, and avoids leaking file descriptors when read operations encounter
errors.
</p>
</conbody>
</concept>
<concept id="IMPALA-1623-52x">
<title>unix_timestamp() does not return correct time</title>
<conbody>
<p>
The <codeph>unix_timestamp()</codeph> function could return a constant value <codeph>1</codeph> instead
of a representation of the time.
</p>
<p>
<b>Bug:</b>
<xref href="https://issues.cloudera.org/browse/IMPALA-1623" scope="external" format="html">IMPALA-1623</xref>
</p>
</conbody>
</concept>
<concept id="IMPALA-1586">
<title>Impala should randomly select cached replica</title>
<conbody>
<p>
To avoid putting too heavy a load on any one node, Impala now randomizes which scan node processes each
HDFS data block rather than choosing the first cached block replica.
</p>
<p>
<b>Bug:</b>
<xref href="https://issues.cloudera.org/browse/IMPALA-1586" scope="external" format="html">IMPALA-1586</xref>
</p>
</conbody>
</concept>
<concept id="IMPALA-1606">
<title>Impala does not always give short name to Llama.</title>
<conbody>
<p>
In clusters secured by Kerberos or LDAP, a discrepancy in internal transmission of user names could cause
a communication error with Llama.
</p>
<p>
<b>Bug:</b>
<xref href="https://issues.cloudera.org/browse/IMPALA-1606" scope="external" format="html">IMPALA-1606</xref>
</p>
</conbody>
</concept>
<concept id="IMPALA-1475">
<title>accept unmangled native UDF symbols</title>
<conbody>
<p>
The <codeph>CREATE FUNCTION</codeph> statement could report that it could not find a function entry point
within the <codeph>.so</codeph> file for a UDF written in C++, even if the corresponding function was
present.
</p>
<p>
<b>Bug:</b>
<xref href="https://issues.cloudera.org/browse/IMPALA-1475" scope="external" format="html">IMPALA-1475</xref>
</p>
</conbody>
</concept>
</concept>
<concept rev="2.0.1" id="fixed_issues_201">
<title>Issues Fixed in the 2.0.1 Release / CDH 5.2.1</title>
<conbody>
<p>
This section lists the most significant issues fixed in Impala 2.0.1.
</p>
<p>
For the full list of fixed issues in Impala 2.0.1, see
<xref href="https://issues.cloudera.org/secure/IssueNavigator.jspa?reset=true&amp;jqlQuery=project%3Dimpala+and+fixVersion%3D%22Impala+2.0.1%22+and+resolution%3D%22Fixed%22" format="html" scope="external">
this report in the JIRA system</xref>.
</p>
<p outputclass="toc inpage"/>
</conbody>
<concept id="IMPALA-1416">
<title>Queries fail with metastore exception after upgrade and compute stats</title>
<conbody>
<p>
After running the <codeph>COMPUTE STATS</codeph> statement on an Impala table, subsequent queries on that
table could fail with the exception message <codeph>Failed to load metadata for table:
default.stats_test</codeph>.
</p>
<p>
<b>Bug:</b>
<xref href="https://issues.cloudera.org/browse/IMPALA-1416" scope="external" format="html"/> IMPALA-1416
</p>
<p>
<b>Workaround:</b> Upgrading to CDH 5.2.1, or another level of CDH that includes the fix for HIVE-8627,
prevents the problem from affecting future <codeph>COMPUTE STATS</codeph> statements. On affected levels
of CDH, or for Impala tables that have become inaccessible, the workaround is to disable the
<codeph>hive.metastore.try.direct.sql</codeph> setting in the Hive metastore
<filepath>hive-site.xml</filepath> file and issue the <codeph>INVALIDATE METADATA</codeph> statement for
the affected table. You do not need to rerun the <codeph>COMPUTE STATS</codeph> statement for the table.
</p>
</conbody>
</concept>
</concept>
<concept rev="2.0.0" id="fixed_issues_200">
<title>Issues Fixed in the 2.0.0 Release / CDH 5.2.0</title>
<conbody>
<p>
This section lists the most significant issues fixed in Impala 2.0.0.
</p>
<p>
For the full list of fixed issues in Impala 2.0.0, see
<xref href="https://issues.cloudera.org/secure/IssueNavigator.jspa?reset=true&amp;jqlQuery=project%3Dimpala+and+fixVersion%3D%22Impala+2.0%22+and+resolution%3D%22Fixed%22" format="html" scope="external">
this report in the JIRA system</xref>.
</p>
<p outputclass="toc inpage"/>
<!-- Preliminary list from Alex:
[IMPALA-89] - OVERWRITE behaviour is different from Hive for statically partitioned inserts
[IMPALA-642] - Conjunctive predicates on HBase table not working in some scenarios.
[IMPALA-648] - Timestamp Parser Does Not like "T"
[IMPALA-677] - The columns of the rhs or a LEFT SEMI JOIN should not be visible outside the ON clause.
[IMPALA-755] - Escaped quote within quotes hangs the shell
[IMPALA-757] - Like predicate with "%%" pattern returns empty result set.
[IMPALA-995] - Join Hint is dropped when used inside a view
[IMPALA-1072] - Parquet writer is miscounting the byte estimate of the current file.
[IMPALA-1087] - Loading UDF/UDA should return with the correct message.
[IMPALA-1100] - AnalysisException: duplicated inline view column alias, when definition has "with" and inline view
[IMPALA-1101] - WHERE condition ignored in simple query with RIGHT JOIN
[IMPALA-1102] - Query with self joined table may produce incorrect results
[IMPALA-1105] - Usage of TRUNC with string timestamp reliably crashes node
[IMPALA-1109] - Timestamp Cast Returns invalid TIMESTAMP
[IMPALA-1110] - group_concat agg function does not work with optional separator
[IMPALA-1118] - Incorrect plan after reordering predicates (inner join following outer join)
[IMPALA-1119] - Analysis Exception: CTAS, query runs fine on its own
[IMPALA-1121] - IllegalStateException upon JOIN of DECIMAL columns with different precision
[IMPALA-1123] - Combining fragments with compatible data partitions can lead to incorrect results due to type incompatibilities (missing casts).
[IMPALA-1126] - Unable to disable strict unicode mode in the shell
[IMPALA-1128] - IllegalArgumentException with UNION and BIGINT and DECIMAL(19, 0)
[IMPALA-1129] - Queries with NOT IN predicates do not handle nulls correctly and return wrong results when codegen is enabled.
[IMPALA-1133] - Ensure all webserver output is escaped
[IMPALA-1138] - Compute Stats does not report proper error
[IMPALA-1164] - NDV Aggregate Function returns a STRING not a BIGINT
[IMPALA-1165] - Predicate dropped: Inline view + DISTINCT aggregate in outer query
[IMPALA-1187] - Optionally auto transform multiple count distincts as NDV
[IMPALA-1224] - 'illegal state exception' on using scientific notation in where condition in the query for bigint data type
[IMPALA-1235] - impala-shell prompt gets out of synch with connected host
[IMPALA-1236] - Queries with union in inline view have empty resource requests
[IMPALA-1252] - Parquet writer double counts when computing the file size estimate leading to smaller than expected files
[IMPALA-1272] - setting compression codec that parquet can't handle crashes impalad
[IMPALA-1279] - Impala does not employ ACLs when checking path permissions for LOAD and INSERT
[IMPALA-1334] - Impala does not map principals to lowercase, affecting Sentry authorisation
[IMPALA-1336] - Set the shell's unix environment's locale settings to allow for UTF-8 compatibility.
[IMPALA-1353] - Reuse of a column in JOIN predicate may lead to incorrect results
[IMPALA-491] - Improve error message when queries fail due to BE nodes dying
[IMPALA-526] - Impala should report expression evaluation error instead of always returning null
[IMPALA-601] - Shell should optionally read its configuration from a file
[IMPALA-603] - Impala should be more robust against metastore connection failures and issue user-friendly error message if Hive metastore connection fails
[IMPALA-1058] - Don't throw away SUMMARY until next successful statement
[IMPALA-1104] - Allow creating Avro tables without column definitions. Allow COMPUTE STATS to always work on Impala-created Avro tables.
[IMPALA-700] - Show functions should also list built-in functions
[IMPALA-731] - DECODE function
[IMPALA-1080] - Implement "SET <query_option>" as SQL statement
[IMPALA-52] - Add support for compressed text
-->
</conbody>
<concept id="IMPALA-995">
<title>Join Hint is dropped when used inside a view</title>
<conbody>
<p>
Hints specified within a view query did not take effect when the view was queried, leading to slow
performance. As part of this fix, Impala now supports hints embedded within comments.
</p>
<p>
<b>Bug:</b>
<xref href="https://issues.cloudera.org/browse/IMPALA-995" scope="external" format="html">IMPALA-995"</xref>
</p>
</conbody>
</concept>
<concept id="IMPALA-1101">
<title>WHERE condition ignored in simple query with RIGHT JOIN</title>
<conbody>
<p>
Potential wrong results for some types of queries.
</p>
<p>
<b>Bug:</b>
<xref href="https://issues.cloudera.org/browse/IMPALA-1101" scope="external" format="html">IMPALA-1101"</xref>
</p>
</conbody>
</concept>
<concept id="IMPALA-1102">
<title>Query with self joined table may produce incorrect results</title>
<conbody>
<p>
Potential wrong results for some types of queries.
</p>
<p>
<b>Bug:</b>
<xref href="https://issues.cloudera.org/browse/IMPALA-1102" scope="external" format="html">IMPALA-1102"</xref>
</p>
</conbody>
</concept>
<concept id="IMPALA-1118">
<title>Incorrect plan after reordering predicates (inner join following outer join)</title>
<conbody>
<p>
Potential wrong results for some types of queries.
</p>
<p>
<b>Bug:</b>
<xref href="https://issues.cloudera.org/browse/IMPALA-1118" scope="external" format="html">IMPALA-1118"</xref>
</p>
</conbody>
</concept>
<concept id="IMPALA-1123">
<title>Combining fragments with compatible data partitions can lead to incorrect results due to type incompatibilities (missing casts).</title>
<conbody>
<p>
Potential wrong results for some types of queries.
</p>
<p>
<b>Bug:</b>
<xref href="https://issues.cloudera.org/browse/IMPALA-1123" scope="external" format="html">IMPALA-1123"</xref>
</p>
</conbody>
</concept>
<concept id="IMPALA-1165">
<title>Predicate dropped: Inline view + DISTINCT aggregate in outer query</title>
<conbody>
<p>
Potential wrong results for some types of queries.
</p>
<p>
<b>Bug:</b>
<xref href="https://issues.cloudera.org/browse/IMPALA-1165" scope="external" format="html">IMPALA-1165"</xref>
</p>
</conbody>
</concept>
<concept id="IMPALA-1353">
<title>Reuse of a column in JOIN predicate may lead to incorrect results</title>
<conbody>
<p>
Potential wrong results for some types of queries.
</p>
<p>
<b>Bug:</b>
<xref href="https://issues.cloudera.org/browse/IMPALA-1353" scope="external" format="html">IMPALA-1353"</xref>
</p>
</conbody>
</concept>
<concept id="IMPALA-1105">
<title>Usage of TRUNC with string timestamp reliably crashes node</title>
<conbody>
<p>
Serious error for certain combinations of function calls and data types.
</p>
<p>
<b>Bug:</b>
<xref href="https://issues.cloudera.org/browse/IMPALA-1105" scope="external" format="html">IMPALA-1105"</xref>
</p>
</conbody>
</concept>
<concept id="IMPALA-1109">
<title>Timestamp Cast Returns invalid TIMESTAMP</title>
<conbody>
<p>
Serious error for certain combinations of function calls and data types.
</p>
<p>
<b>Bug:</b>
<xref href="https://issues.cloudera.org/browse/IMPALA-1109" scope="external" format="html">IMPALA-1109"</xref>
</p>
</conbody>
</concept>
<concept id="IMPALA-1121">
<title>IllegalStateException upon JOIN of DECIMAL columns with different precision</title>
<conbody>
<p>
<codeph>DECIMAL</codeph> columns with different precision could not be compared in join predicates.
</p>
<p>
<b>Bug:</b>
<xref href="https://issues.cloudera.org/browse/IMPALA-1121" scope="external" format="html">IMPALA-1121"</xref>
</p>
</conbody>
</concept>
<concept id="IMPALA-1104">
<title>Allow creating Avro tables without column definitions. Allow COMPUTE STATS to always work on Impala-created Avro tables.</title>
<conbody>
<p>
Hive-created Avro tables with columns specified by a JSON file or literal could produce errors when
queried in Impala, and could not be used with the <codeph>COMPUTE STATS</codeph> statement. Now you can
create such tables in Impala to avoid such errors.
</p>
<p>
<b>Bug:</b>
<xref href="https://issues.cloudera.org/browse/IMPALA-1104" scope="external" format="html">IMPALA-1104"</xref>
</p>
</conbody>
</concept>
<concept id="IMPALA-1133">
<title>Ensure all webserver output is escaped</title>
<conbody>
<p>
The Impala debug web UI did not properly encode all output.
</p>
<p>
<b>Bug:</b>
<xref href="https://issues.cloudera.org/browse/IMPALA-1133" scope="external" format="html">IMPALA-1133"</xref>
</p>
</conbody>
</concept>
<concept id="IMPALA-1236">
<title>Queries with union in inline view have empty resource requests</title>
<conbody>
<p>
Certain queries could run without obeying the limits imposed by resource management.
</p>
<p>
<b>Bug:</b>
<xref href="https://issues.cloudera.org/browse/IMPALA-1236" scope="external" format="html">IMPALA-1236"</xref>
</p>
</conbody>
</concept>
<concept id="IMPALA-1279">
<title>Impala does not employ ACLs when checking path permissions for LOAD and INSERT</title>
<conbody>
<p>
Certain <codeph>INSERT</codeph> and <codeph>LOAD DATA</codeph> statements could fail unnecessarily, if
the target directories in HDFS had restrictive HDFS permissions, but those permissions were overridden by
HDFS extended ACLs.
</p>
<p>
<b>Bug:</b>
<xref href="https://issues.cloudera.org/browse/IMPALA-1279" scope="external" format="html">IMPALA-1279"</xref>
</p>
</conbody>
</concept>
<concept id="IMPALA-1334">
<title>Impala does not map principals to lowercase, affecting Sentry authorisation</title>
<conbody>
<p>
In a Kerberos environment, the principal name was not mapped to lowercase, causing issues when a user
logged in with an uppercase principal name and Sentry authorization was enabled.
</p>
<p>
<b>Bug:</b>
<xref href="https://issues.cloudera.org/browse/IMPALA-1334" scope="external" format="html">IMPALA-1334"</xref>
</p>
</conbody>
</concept>
</concept>
<concept rev="1.4.4" id="fixed_issues_144">
<title>Issues Fixed in the 1.4.4 Release / CDH 5.1.5</title>
<conbody>
<p>
For the list of fixed issues, see
<xref href="http://www.cloudera.com/documentation/enterprise/release-notes/topics/cdh_rn_fixed_in_51.html" scope="external" format="html">Issues Fixed in CDH 5.1.5</xref>
in the <cite>CDH 5 Release Notes</cite>.
</p>
<note conref="../shared/impala_common.xml#common/only_cdh5_144"/>
</conbody>
</concept>
<concept rev="1.4.3" id="fixed_issues_143">
<title>Issues Fixed in the 1.4.3 Release / CDH 5.1.4</title>
<conbody>
<p>
Impala 1.4.3 includes fixes to address what is known as the POODLE vulnerability in SSLv3. SSLv3 access is
disabled in the Impala debug web UI.
</p>
<note conref="../shared/impala_common.xml#common/only_cdh5_143"/>
</conbody>
</concept>
<concept rev="1.4.2" id="fixed_issues_142">
<title>Issues Fixed in the 1.4.2 Release / CDH 5.1.3</title>
<conbody>
<p>
This section lists the most significant issues fixed in Impala 1.4.2.
</p>
<p>
For the full list of fixed issues in Impala 1.4.2, see
<xref href="https://issues.cloudera.org/secure/IssueNavigator.jspa?reset=true&amp;jqlQuery=project%3Dimpala+and+fixVersion%3D%22Impala+1.4.2%22+and+resolution%3D%22Fixed%22" format="html" scope="external">
this report in the JIRA system</xref>.
</p>
<note conref="../shared/impala_common.xml#common/only_cdh5_143"/>
<p outputclass="toc inpage"/>
</conbody>
</concept>
<concept rev="1.4.1" id="fixed_issues_141">
<title>Issues Fixed in the 1.4.1 Release / CDH 5.1.2</title>
<conbody>
<p>
This section lists the most significant issues fixed in Impala 1.4.1.
</p>
<p>
For the full list of fixed issues in Impala 1.4.1, see
<xref href="https://issues.cloudera.org/secure/IssueNavigator.jspa?reset=true&amp;jqlQuery=project%3Dimpala+and+fixVersion%3D%22Impala+1.4.1%22+and+resolution%3D%22Fixed%22" format="html" scope="external">
this report in the JIRA system</xref>.
</p>
<note conref="../shared/impala_common.xml#common/only_cdh5_141"/>
<p outputclass="toc inpage"/>
</conbody>
<concept id="CDH-20513">
<title>impalad terminating with Boost exception</title>
<conbody>
<p>
Occasionally, a non-trivial query run through Llama could encounter a serious error. The detailed error
in the log was:
</p>
<codeblock>boost::exception_detail::clone_impl
&lt;boost::exception_detail::error_info_injector&lt;boost::lock_error&gt; &gt;
</codeblock>
<!-- <p><b>Bug:</b> <xref href="https://issues.cloudera.org/browse/CDH-20513" scope="external" format="html">CDH-20513</xref></p> -->
<p>
<b>Severity:</b> High
</p>
</conbody>
</concept>
<concept id="CDH-20478">
<title>Impalad uses wrong string format when writing logs</title>
<conbody>
<p>
Impala log files could contain internal error messages due to a problem formatting certain strings. The
messages consisted of a Java call stack starting with:
</p>
<codeblock>jni-util.cc:177] java.util.MissingFormatArgumentException: Format specifier 's'
</codeblock>
<!-- <p><b>Bug:</b> <xref href="https://jira.cloudera.com/browse/CDH-20478" scope="external" format="html">CDH-20478</xref></p> -->
</conbody>
</concept>
<concept id="IMPALA-1107">
<title>Update HS2 client API.</title>
<conbody>
<p>
A downlevel version of the HiveServer2 API could cause difficulty retrieving the precision and scale of a
<codeph>DECIMAL</codeph> value.
</p>
<p>
<b>Bug:</b>
<xref href="https://issues.cloudera.org/browse/IMPALA-1107" scope="external" format="html">IMPALA-1107</xref>
</p>
</conbody>
</concept>
<concept id="IMPALA-1093">
<title>Impalad catalog updates can fail with error: "IllegalArgumentException: fromKey out of range" at com.cloudera.impala.catalog.CatalogDeltaLog</title>
<conbody>
<p>
The error in the title could occur following a DDL statement. This issue was discovered during internal
testing and has not been reported in customer environments.
</p>
<p>
<b>Bug:</b>
<xref href="https://issues.cloudera.org/browse/IMPALA-1093" scope="external" format="html">IMPALA-1093</xref>
</p>
</conbody>
</concept>
<concept id="IMPALA-1131">
<title>"Total" time counter does not capture all the network transmit time</title>
<conbody>
<p>
The time for some network operations was not counted in the report of total time for a query, making it
difficult to diagnose network-related performance issues.
</p>
<p>
<b>Bug:</b>
<xref href="https://issues.cloudera.org/browse/IMPALA-1131" scope="external" format="html">IMPALA-1131</xref>
</p>
</conbody>
</concept>
<!--
<concept id="IMPALA-1130">
<title>Multi-line query with non UTF-8 compatible input throws UnicodeEncodeError</title>
<conbody>
<p>Although <cmdname>impala-shell</cmdname> added Unicode support in Impala 1.4.0, queries entered across multiple lines could still encounter errors related to Unicode processing.</p>
<p><b>Bug:</b> <xref href="https://issues.cloudera.org/browse/IMPALA-1130" scope="external" format="html">IMPALA-1130</xref></p>
<p><b>Severity:</b> High</p>
</conbody>
</concept>
-->
<concept id="IMPALA-1149">
<title>Impala will crash when reading certain Avro files containing bytes data</title>
<conbody>
<p>
Certain Avro fields for byte data could cause Impala to be unable to read an Avro data file, even if the
field was not part of the Impala table definition. With this fix, Impala can now read these Avro data
files, although Impala queries cannot refer to the <q>bytes</q> fields.
</p>
<p>
<b>Bug:</b>
<xref href="https://issues.cloudera.org/browse/IMPALA-1149" scope="external" format="html">IMPALA-1149</xref>
</p>
</conbody>
</concept>
<concept id="IMPALA-1142">
<title>Support specifying a custom AuthorizationProvider in Impala</title>
<conbody>
<p>
The <codeph>--authorization_policy_provider_class</codeph> option for <cmdname>impalad</cmdname> was
added back. This option specifies a custom <codeph>AuthorizationProvider</codeph> class rather than the
default <codeph>HadoopGroupAuthorizationProvider</codeph>. It had been used for internal testing, then
removed in Impala 1.4.0, but it was considered useful by some customers.
</p>
<p>
<b>Bug:</b>
<xref href="https://issues.cloudera.org/browse/IMPALA-1142" scope="external" format="html">IMPALA-1142</xref>
</p>
</conbody>
</concept>
</concept>
<concept rev="1.4.0" id="fixed_issues_140">
<title>Issues Fixed in the 1.4.0 Release / CDH 5.1.0</title>
<conbody>
<p>
This section lists the most significant issues fixed in Impala 1.4.0.
</p>
<p>
For the full list of fixed issues in Impala 1.4.0, see
<xref href="https://issues.cloudera.org/secure/IssueNavigator.jspa?reset=true&amp;jqlQuery=project%3Dimpala+and+fixVersion%3D%22Impala+1.4%22+and+resolution%3D%22Fixed%22" format="html" scope="external">
this report in the JIRA system</xref>.
</p>
<p outputclass="toc inpage"/>
</conbody>
<!--
[IMPALA-489] - impala-shell only works with ascii characters
[IMPALA-962] - The extended view definition SQL text in Views created by Impala should always have fully-qualified table names.
[IMPALA-741] - Impala forgets about partitions with non-existant locations
[IMPALA-834] - CREATE TABLE LIKE fails if source is a view
[IMPALA-864] - Inserting into table with many cols takes a long time.
[IMPALA-887] - Improve partition pruning time
[IMPALA-1003] - Improve compute stats performance
[IMPALA-185] - When I run CREATE TABLE new_table LIKE avro_table, the schema does not get mapped properly from an avro schema to a hive schema
[IMPALA-1046] - Race condition in IoMgr. Blocked ranges enqueued after cancel.
[IMPALA-1083] - Deadlock in scan node
-->
<!-- Use as a template for new issues.
<concept id="">
<title></title>
<conbody>
<p>
</p>
<p><b>Bug:</b> <xref href="https://issues.cloudera.org/browse/" scope="external" format="html"></xref></p>
<p><b>Severity:</b> High</p>
</conbody>
</concept>
-->
<concept id="IMPALA-1019">
<!-- This issue gets propagated to some older release notes too because it was backported as the only bug fix for Impala 1.3.2. -->
<title>Failed DCHECK in disk-io-mgr-reader-context.cc:174</title>
<conbody>
<p>
The serious error in the title could occur, with the supplemental message:
</p>
<codeblock>num_used_buffers_ &lt; 0: #used=-1 during cancellation HDFS cached data</codeblock>
<p>
The issue was due to the use of HDFS caching with data files accessed by Impala. Support for HDFS caching
in Impala was introduced in Impala 1.4.0 for CDH 5.1.0. The fix for this issue was backported to Impala
1.3.x, and is the only change in Impala 1.3.2 for CDH 5.0.4.
</p>
<p>
<b>Bug:</b>
<xref href="https://issues.cloudera.org/browse/IMPALA-1019" scope="external" format="html">IMPALA-1019</xref>
</p>
<p>
<b>Workaround:</b> On CDH 5.0.x, upgrade to CDH 5.0.4 with Impala 1.3.2, where this issue is fixed. In
Impala 1.3.0 or 1.3.1 on CDH 5.0.x, do not use HDFS caching for Impala data files in Impala internal or
external tables. If some of these data files are cached (for example because they are used by other
components that take advantage of HDFS caching), set the query option
<codeph>DISABLE_CACHED_READS=true</codeph>. To set that option for all Impala queries across all
sessions, start <cmdname>impalad</cmdname> with the <codeph>-default_query_options</codeph> option and
include this setting in the option argument, or on a cluster managed by Cloudera Manager, fill in this
option setting on the <uicontrol>Impala Daemon</uicontrol> options page.
</p>
<p>
<b>Resolution:</b> This issue is fixed in Impala 1.3.2 for CDH 5.0.4. The addition of HDFS caching
support in Impala 1.4 means that this issue does not apply to any new level of Impala on CDH 5.
</p>
</conbody>
</concept>
<concept id="IMPALA-489">
<title>impala-shell only works with ASCII characters</title>
<conbody>
<p>
The <cmdname>impala-shell</cmdname> interpreter could encounter errors processing SQL statements
containing non-ASCII characters.
</p>
<p>
<b>Bug:</b>
<xref href="https://issues.cloudera.org/browse/IMPALA-489" scope="external" format="html">IMPALA-489</xref>
</p>
</conbody>
</concept>
<concept id="IMPALA-962">
<title>The extended view definition SQL text in Views created by Impala should always have fully-qualified table names</title>
<conbody>
<p>
When a view was accessed while inside a different database, references to tables were not resolved unless
the names were fully qualified when the view was created.
</p>
<p>
<b>Bug:</b>
<xref href="https://issues.cloudera.org/browse/IMPALA-962" scope="external" format="html">IMPALA-962</xref>
</p>
</conbody>
</concept>
<concept id="IMPALA-741">
<title>Impala forgets about partitions with non-existant locations</title>
<conbody>
<p>
If an <codeph>ALTER TABLE</codeph> specified a non-existent HDFS location for a partition, afterwards
Impala would not be able to access the partition at all.
</p>
<p>
<b>Bug:</b>
<xref href="https://issues.cloudera.org/browse/IMPALA-741" scope="external" format="html">IMPALA-741</xref>
</p>
</conbody>
</concept>
<concept id="IMPALA-834">
<title>CREATE TABLE LIKE fails if source is a view</title>
<conbody>
<p>
The <codeph>CREATE TABLE LIKE</codeph> clause was enhanced to be able to create a table with the same
column definitions as a view. The resulting table is a text table unless the <codeph>STORED AS</codeph>
clause is specified, because a view does not have an associated file format to inherit.
</p>
<p>
<b>Bug:</b>
<xref href="https://issues.cloudera.org/browse/IMPALA-834" scope="external" format="html">IMPALA-834</xref>
</p>
</conbody>
</concept>
<!-- Not really fixed yet. Dupe of IMPALA-428 which is unresolved.
<concept id="IMPALA-864">
<title>Inserting into table with many cols takes a long time.</title>
<conbody>
<p></p>
<p><b>Bug:</b> <xref href="https://issues.cloudera.org/browse/IMPALA-864" scope="external" format="html">IMPALA-864</xref></p>
<p><b>Severity:</b> High</p>
</conbody>
</concept>
-->
<concept id="IMPALA-887">
<title>Improve partition pruning time</title>
<conbody>
<p>
Operations on tables with many partitions could be slow due to the time to evaluate which partitions were
affected. The partition pruning code was speeded up substantially.
</p>
<p>
<b>Bug:</b>
<xref href="https://issues.cloudera.org/browse/IMPALA-887" scope="external" format="html">IMPALA-887</xref>
</p>
</conbody>
</concept>
<concept id="IMPALA-1003">
<title>Improve compute stats performance</title>
<conbody>
<p>
The performance of the <codeph>COMPUTE STATS</codeph> statement was improved substantially. The
efficiency of its internal operations was improved, and some statistics are no longer gathered because
they are not currently used for planning Impala queries.
</p>
<p>
<b>Bug:</b>
<xref href="https://issues.cloudera.org/browse/IMPALA-1003" scope="external" format="html">IMPALA-1003</xref>
</p>
</conbody>
</concept>
<concept id="IMPALA-185">
<title>When I run CREATE TABLE new_table LIKE avro_table, the schema does not get mapped properly from an avro schema to a hive schema</title>
<conbody>
<p>
After a <codeph>CREATE TABLE LIKE</codeph> statement using an Avro table as the source, the new table
could have incorrect metadata and be inaccessible, depending on how the original Avro table was created.
</p>
<p>
<b>Bug:</b>
<xref href="https://issues.cloudera.org/browse/IMPALA-185" scope="external" format="html">IMPALA-185</xref>
</p>
</conbody>
</concept>
<concept id="IMPALA-1046">
<title>Race condition in IoMgr. Blocked ranges enqueued after cancel.</title>
<conbody>
<p>
Impala could encounter a serious error after a query was cancelled.
</p>
<p>
<b>Bug:</b>
<xref href="https://issues.cloudera.org/browse/IMPALA-1046" scope="external" format="html">IMPALA-1046</xref>
</p>
</conbody>
</concept>
<concept id="IMPALA-1083">
<title>Deadlock in scan node</title>
<conbody>
<p>
A deadlock condition could make all <cmdname>impalad</cmdname> daemons hang, making the cluster
unresponsive for Impala queries.
</p>
<p>
<b>Bug:</b>
<xref href="https://issues.cloudera.org/browse/IMPALA-1083" scope="external" format="html">IMPALA-1083</xref>
</p>
</conbody>
</concept>
</concept>
<concept rev="1.3.3" id="fixed_issues_133">
<title>Issues Fixed in the 1.3.3 Release / CDH 5.0.5</title>
<conbody>
<p>
Impala 1.3.3 includes fixes to address what is known as the POODLE vulnerability in SSLv3. SSLv3 access is
disabled in the Impala debug web UI.
</p>
<note conref="../shared/impala_common.xml#common/only_cdh5_133"/>
<p outputclass="toc inpage"/>
</conbody>
</concept>
<concept rev="1.3.2" id="fixed_issues_132">
<title>Issues Fixed in the 1.3.2 Release / CDH 5.0.4</title>
<conbody>
<p>
This backported bug fix is the only change between Impala 1.3.1 and Impala 1.3.2.
</p>
<note conref="../shared/impala_common.xml#common/only_cdh5_133"/>
</conbody>
<concept id="IMPALA-1019-132">
<!-- This issue gets propagated to some older release notes too because it was backported as the only bug fix for Impala 1.3.2. -->
<title>Failed DCHECK in disk-io-mgr-reader-context.cc:174</title>
<conbody>
<p>
The serious error in the title could occur, with the supplemental message:
</p>
<codeblock>num_used_buffers_ &lt; 0: #used=-1 during cancellation HDFS cached data</codeblock>
<p>
The issue was due to the use of HDFS caching with data files accessed by Impala. Support for HDFS caching
in Impala was introduced in Impala 1.4.0 for CDH 5.1.0. The fix for this issue was backported to Impala
1.3.x, and is the only change in Impala 1.3.2 for CDH 5.0.4.
</p>
<p>
<b>Bug:</b>
<xref href="https://issues.cloudera.org/browse/IMPALA-1019" scope="external" format="html">IMPALA-1019</xref>
</p>
<p>
<b>Workaround:</b> On CDH 5.0.x, upgrade to CDH 5.0.4 with Impala 1.3.2, where this issue is fixed. In
Impala 1.3.0 or 1.3.1 on CDH 5.0.x, do not use HDFS caching for Impala data files in Impala internal or
external tables. If some of these data files are cached (for example because they are used by other
components that take advantage of HDFS caching), set the query option
<codeph>DISABLE_CACHED_READS=true</codeph>. To set that option for all Impala queries across all
sessions, start <cmdname>impalad</cmdname> with the <codeph>-default_query_options</codeph> option and
include this setting in the option argument, or on a cluster managed by Cloudera Manager, fill in this
option setting on the <uicontrol>Impala Daemon</uicontrol> options page.
</p>
<p>
<b>Resolution:</b> This issue is fixed in Impala 1.3.2 for CDH 5.0.4. The addition of HDFS caching
support in Impala 1.4 means that this issue does not apply to any new level of Impala on CDH 5.
</p>
</conbody>
</concept>
</concept>
<concept rev="1.3.1" id="fixed_issues_131">
<title>Issues Fixed in the 1.3.1 Release / CDH 5.0.3</title>
<conbody>
<p>
This section lists the most significant issues fixed in Impala 1.3.1.
</p>
<p>
For the full list of fixed issues in Impala 1.3.1, see
<xref href="https://issues.cloudera.org/secure/IssueNavigator.jspa?reset=true&amp;jqlQuery=project%3Dimpala+and+fixVersion%3D%22Impala+1.3.1%22+and+resolution%3D%22Fixed%22" format="html" scope="external">
this report in the JIRA system</xref>. Because 1.3.1 is the first 1.3.x release for CDH 4, if you are on
CDH 4, also consult <xref href="impala_fixed_issues.xml#fixed_issues_130"/>.
<!-- <xref href="https://issues.cloudera.org/secure/IssueNavigator.jspa?reset=true&amp;jqlQuery=project%3Dimpala+and+fixVersion%3D%22Impala+1.3%22+and+resolution%3D%22Fixed%22" format="html" scope="external">list of issues fixed in 1.3.0</xref>. -->
</p>
<p outputclass="toc inpage"/>
</conbody>
<!--
[IMPALA-13] - Text data with carriage returns generates wrong results for count(*)
[IMPALA-488] - IO Mgr should take instance memory limit into account when creating io buffers
[IMPALA-827] - Impala should provide an option for new sub directories to automatically inherit the permissions of the parent directory
[IMPALA-888] - Illegal state exception (or crash) in query with UNION in inline view
[IMPALA-945] - INSERT column reordering doesn't work with SELECT clause
-->
<concept id="IMPALA-904">
<title>Impalad crashes when left joining inline view that has aggregate using distinct</title>
<conbody>
<p>
Impala could encounter a severe error in a query combining a left outer join with an inline view
containing a <codeph>COUNT(DISTINCT)</codeph> operation.
</p>
<p>
<b>Bug:</b>
<xref href="https://issues.cloudera.org/browse/IMPALA-904" scope="external" format="html">IMPALA-904</xref>
</p>
</conbody>
</concept>
<concept id="IMPALA-901">
<title>Incorrect result with group by query with null value in group by data</title>
<conbody>
<p>
If the result of a <codeph>GROUP BY</codeph> operation is <codeph>NULL</codeph>, the resulting row might
be omitted from the result set. This issue depends on the data values and data types in the table.
</p>
<p>
<b>Bug:</b>
<xref href="https://issues.cloudera.org/browse/IMPALA-901" scope="external" format="html">IMPALA-901</xref>
</p>
</conbody>
</concept>
<concept id="IMPALA-786">
<title>Drop Function does not clear local library cache</title>
<conbody>
<p>
When a UDF is dropped through the <codeph>DROP FUNCTION</codeph> statement, and then the UDF is
re-created with a new <codeph>.so</codeph> library or JAR file, the original version of the UDF is still
used when the UDF is called from queries.
</p>
<p>
<b>Bug:</b>
<xref href="https://issues.cloudera.org/browse/IMPALA-786" scope="external" format="html">IMPALA-786</xref>
</p>
<p>
<b>Workaround:</b> Restart the <cmdname>impalad</cmdname> daemon on all nodes.
</p>
</conbody>
</concept>
<concept id="IMPALA-762">
<title>Compute stats doesn't propagate underlying error correctly</title>
<conbody>
<p>
If a <codeph>COMPUTE STATS</codeph> statement encountered an error, the error message is <q>Query
aborted</q> with no further detail. Common reasons why a <codeph>COMPUTE STATS</codeph> statement might
fail include network errors causing the coordinator node to lose contact with other
<cmdname>impalad</cmdname> instances, and column names that match Impala
<xref href="impala_reserved_words.xml#reserved_words">reserved words</xref>. (Currently, if a column name
is an Impala reserved word, <codeph>COMPUTE STATS</codeph> always returns an error.)
</p>
<p>
<b>Bug:</b>
<xref href="https://issues.cloudera.org/browse/IMPALA-762" scope="external" format="html">IMPALA-762</xref>
</p>
</conbody>
</concept>
<concept id="IMPALA-624">
<title>Inserts should respect changes in partition location</title>
<conbody>
<p>
After an <codeph>ALTER TABLE</codeph> statement that changes the <codeph>LOCATION</codeph> property of a
partition, a subsequent <codeph>INSERT</codeph> statement would always use a path derived from the base
data directory for the table.
</p>
<p>
<b>Bug:</b>
<xref href="https://issues.cloudera.org/browse/IMPALA-624" scope="external" format="html">IMPALA-624</xref>
</p>
</conbody>
</concept>
<concept id="IMPALA-13">
<title>Text data with carriage returns generates wrong results for count(*)</title>
<conbody>
<p>
A <codeph>COUNT(*)</codeph> operation could return the wrong result for text tables using nul characters
(ASCII value 0) as delimiters.
</p>
<p>
<b>Bug:</b>
<xref href="https://issues.cloudera.org/browse/IMPALA-13" scope="external" format="html">IMPALA-13</xref>
</p>
<p>
<b>Workaround:</b> Impala adds support for ASCII 0 characters as delimiters through the clause
<codeph>FIELDS TERMINATED BY '\0'</codeph>.
</p>
</conbody>
</concept>
<concept id="IMPALA-488">
<title>IO Mgr should take instance memory limit into account when creating io buffers</title>
<conbody>
<p>
Impala could allocate more memory than necessary during certain operations.
</p>
<p>
<b>Bug:</b>
<xref href="https://issues.cloudera.org/browse/IMPALA-488" scope="external" format="html">IMPALA-488</xref>
</p>
<p>
<b>Workaround:</b> Before issuing a <codeph>COMPUTE STATS</codeph> statement for a Parquet table, reduce
the number of threads used in that operation by issuing <codeph>SET NUM_SCANNER_THREADS=2</codeph> in
<cmdname>impala-shell</cmdname>. Then issue <codeph>UNSET NUM_SCANNER_THREADS</codeph> before continuing
with queries.
</p>
</conbody>
</concept>
<concept id="IMPALA-827">
<title>Impala should provide an option for new sub directories to automatically inherit the permissions of the parent directory</title>
<conbody>
<p>
When new subdirectories are created underneath a partitioned table by an <codeph>INSERT</codeph>
statement, previously the new subdirectories always used the default HDFS permissions for the
<codeph>impala</codeph> user, which might not be suitable for directories intended to be read and written
by other components also.
</p>
<p>
<b>Bug:</b>
<xref href="https://issues.cloudera.org/browse/IMPALA-827" scope="external" format="html">IMPALA-827</xref>
</p>
<p>
<b>Resolution:</b> In Impala 1.3.1 and higher, you can specify the
<codeph>--insert_inherit_permissions</codeph> configuration when starting the <cmdname>impalad</cmdname>
daemon.
</p>
</conbody>
</concept>
<concept id="IMPALA-888">
<title>Illegal state exception (or crash) in query with UNION in inline view</title>
<conbody>
<p>
Impala could encounter a severe error in a query where the <codeph>FROM</codeph> list contains an inline
view that includes a <codeph>UNION</codeph>. The exact type of the error varies.
</p>
<p>
<b>Bug:</b>
<xref href="https://issues.cloudera.org/browse/IMPALA-888" scope="external" format="html">IMPALA-888</xref>
</p>
</conbody>
</concept>
<concept id="IMPALA-945">
<title>INSERT column reordering doesn't work with SELECT clause</title>
<conbody>
<p>
The ability to specify a subset of columns in an <codeph>INSERT</codeph> statement, with order different
than in the target table, was not working as intended.
</p>
<p>
<b>Bug:</b>
<xref href="https://issues.cloudera.org/browse/IMPALA-945" scope="external" format="html">IMPALA-945</xref>
</p>
</conbody>
</concept>
</concept>
<concept rev="1.3.0" id="fixed_issues_130">
<title>Issues Fixed in the 1.3.0 Release / CDH 5.0.0</title>
<conbody>
<p>
This section lists the most significant issues fixed in Impala 1.3.0, primarily issues that could cause
wrong results, or cause problems running the <codeph>COMPUTE STATS</codeph> statement, which is very
important for performance and scalability.
</p>
<p>
For the full list of fixed issues, see
<xref href="https://issues.cloudera.org/secure/IssueNavigator.jspa?reset=true&amp;jqlQuery=project%3Dimpala+and+fixVersion%3D%22Impala+1.3%22+and+resolution%3D%22Fixed%22" format="html" scope="external">
this report in the JIRA system</xref>.
</p>
<p outputclass="toc inpage"/>
</conbody>
<!--
My first cut at the important issues:
Wrong results:
860
850
845
831
817
765
723
729
Bad stats:
777
903
880
867
-->
<concept id="IMPALA-860">
<title>Inner join after right join may produce wrong results</title>
<conbody>
<p>
The automatic join reordering optimization could incorrectly reorder queries with an outer join or semi
join followed by an inner join, producing incorrect results.
</p>
<p>
<b>Bug:</b>
<xref href="https://issues.cloudera.org/browse/IMPALA-860" scope="external" format="html">IMPALA-860</xref>
</p>
<p>
<b>Workaround:</b> Including the <codeph>STRAIGHT_JOIN</codeph> keyword in the query prevented the issue
from occurring.
</p>
</conbody>
</concept>
<concept id="IMPALA-850">
<title>Incorrect results with codegen on multi-column group by with NULLs.</title>
<conbody>
<p>
A query with a <codeph>GROUP BY</codeph> clause referencing multiple columns could introduce incorrect
<codeph>NULL</codeph> values in some columns of the result set. The incorrect <codeph>NULL</codeph>
values could appear in rows where a different <codeph>GROUP BY</codeph> column actually did return
<codeph>NULL</codeph>.
</p>
<p>
<b>Bug:</b>
<xref href="https://issues.cloudera.org/browse/IMPALA-850" scope="external" format="html">IMPALA-850</xref>
</p>
</conbody>
</concept>
<concept id="IMPALA-845">
<title>Using distinct inside aggregate function may cause incorrect result when using having clause</title>
<conbody>
<p>
A query could return incorrect results if it combined an aggregate function call, a
<codeph>DISTINCT</codeph> operator, and a <codeph>HAVING</codeph> clause, without a <codeph>GROUP
BY</codeph> clause.
</p>
<p>
<b>Bug:</b>
<xref href="https://issues.cloudera.org/browse/IMPALA-845" scope="external" format="html">IMPALA-845</xref>
</p>
</conbody>
</concept>
<concept id="IMPALA-831">
<title>Aggregation on union inside (inline) view not distributed properly.</title>
<conbody>
<p>
An aggregation query or a query with <codeph>ORDER BY</codeph> and <codeph>LIMIT</codeph> could be
executed on a single node in some cases, rather than distributed across the cluster. This issue affected
queries whose <codeph>FROM</codeph> clause referenced an inline view containing a <codeph>UNION</codeph>.
</p>
<p>
<b>Bug:</b>
<xref href="https://issues.cloudera.org/browse/IMPALA-831" scope="external" format="html">IMPALA-831</xref>
</p>
</conbody>
</concept>
<concept id="IMPALA-817">
<title>Wrong expression may be used in aggregate query if there are multiple similar expressions</title>
<conbody>
<p>
If a <codeph>GROUP BY</codeph> query referenced the same columns multiple times using different
operators, result rows could contain multiple copies of the same expression.
</p>
<p>
<b>Bug:</b>
<xref href="https://issues.cloudera.org/browse/IMPALA-817" scope="external" format="html">IMPALA-817</xref>
</p>
</conbody>
</concept>
<concept id="IMPALA-765">
<title>Incorrect results when changing the order of aggregates in the select list with codegen enabled</title>
<conbody>
<p>
Referencing the same columns in both a <codeph>COUNT()</codeph> and a <codeph>SUM()</codeph> call in the
same query, or some other combinations of aggregate function calls, could incorrectly return a result of
0 from one of the aggregate functions. This issue affected references to <codeph>TINYINT</codeph> and
<codeph>SMALLINT</codeph> columns, but not <codeph>INT</codeph> or <codeph>BIGINT</codeph> columns.
</p>
<p>
<b>Bug:</b>
<xref href="https://issues.cloudera.org/browse/IMPALA-765" scope="external" format="html">IMPALA-765</xref>
</p>
<p>
<b>Workaround:</b> Setting the query option <codeph>DISABLE_CODEGEN=TRUE</codeph> prevented the incorrect
results. Switching the order of the function calls could also prevent the issue from occurring.
</p>
</conbody>
</concept>
<concept id="IMPALA-723">
<title>Union queries give Wrong result in a UNION followed by SIGSEGV in another union</title>
<conbody>
<p>
A <codeph>UNION</codeph> query could produce a wrong result, followed by a serious error for a subsequent
<codeph>UNION</codeph> query.
</p>
<p>
<b>Bug:</b>
<xref href="https://issues.cloudera.org/browse/IMPALA-723" scope="external" format="html">IMPALA-723</xref>
</p>
</conbody>
</concept>
<concept id="IMPALA-729">
<title>String data in MR-produced parquet files may be read incorrectly</title>
<conbody>
<p>
Impala could return incorrect string results when reading uncompressed Parquet data files containing
multiple row groups. This issue only affected Parquet data files produced by MapReduce jobs.
</p>
<p>
<b>Bug:</b>
<xref href="https://issues.cloudera.org/browse/IMPALA-729" scope="external" format="html">IMPALA-729</xref>
</p>
</conbody>
</concept>
<concept id="IMPALA-777">
<title>Compute stats need to use quotes with identifiers that are Impala keywords</title>
<conbody>
<p>
Using a column or table name that conflicted with Impala keywords could prevent running the
<codeph>COMPUTE STATS</codeph> statement for the table.
</p>
<p>
<b>Bug:</b>
<xref href="https://issues.cloudera.org/browse/IMPALA-777" scope="external" format="html">IMPALA-777</xref>
</p>
</conbody>
</concept>
<concept id="IMPALA-903">
<title>COMPUTE STATS child queries do not inherit parent query options.</title>
<conbody>
<p>
The <codeph>COMPUTE STATS</codeph> statement did not use the setting of the <codeph>MEM_LIMIT</codeph>
query option in <cmdname>impala-shell</cmdname>, potentially causing problems gathering statistics for
wide Parquet tables.
</p>
<p>
<b>Bug:</b>
<xref href="https://issues.cloudera.org/browse/IMPALA-903" scope="external" format="html">IMPALA-903</xref>
</p>
</conbody>
</concept>
<concept id="IMPALA-880">
<title>COMPUTE STATS should update partitions in batches</title>
<conbody>
<p>
The <codeph>COMPUTE STATS</codeph> statement could be slow or encounter a timeout while analyzing a table
with many partitions.
</p>
<p>
<b>Bug:</b>
<xref href="https://issues.cloudera.org/browse/IMPALA-880" scope="external" format="html">IMPALA-880</xref>
</p>
</conbody>
</concept>
<concept id="IMPALA-867">
<title>Fail early (in analysis) when COMPUTE STATS is run against Avro table with no columns</title>
<conbody>
<p>
If the columns for an Avro table were all defined in the <codeph>TBLPROPERTIES</codeph> or
<codeph>SERDEPROPERTIES</codeph> clauses, the <codeph>COMPUTE STATS</codeph> statement would fail after
completely analyzing the table, potentially causing a long delay. Although the <codeph>COMPUTE
STATS</codeph> statement still does not work for such tables, now the problem is detected and reported
immediately.
</p>
<p>
<b>Bug:</b>
<xref href="https://issues.cloudera.org/browse/IMPALA-867" scope="external" format="html">IMPALA-867</xref>
</p>
<p>
<b>Workaround:</b> Re-create the Avro table with columns defined in SQL style, using the output of
<codeph>SHOW CREATE TABLE</codeph>. (See the JIRA page for detailed steps.)
</p>
</conbody>
</concept>
</concept>
<concept rev="1.2.4" id="fixed_issues_124">
<title>Issues Fixed in the 1.2.4 Release</title>
<conbody>
<p>
This section lists the most significant issues fixed in Impala 1.2.4. For the full list of fixed issues,
see
<xref href="https://issues.cloudera.org/secure/IssueNavigator.jspa?reset=true&amp;jqlQuery=project%3Dimpala+and+fixVersion%3D%22Impala+1.2.4%22+and+resolution%3D%22Fixed%22" format="html" scope="external">
this report in the JIRA system</xref>.
</p>
<p outputclass="toc inpage"/>
</conbody>
<concept id="IMPALA-818">
<title>The Catalog Server exits with an OOM error after a certain number of CREATE statements</title>
<conbody>
<p>
A large number of concurrent <codeph>CREATE TABLE</codeph> statements can cause the
<cmdname>catalogd</cmdname> process to consume excessive memory, and potentially be killed due to an
out-of-memory condition.
</p>
<p>
<b>Bug:</b>
<xref href="https://issues.cloudera.org/browse/IMPALA-818" scope="external" format="html">IMPALA-818</xref>
</p>
<p>
<b>Workaround:</b> Restart the <cmdname>catalogd</cmdname> service and re-try the DDL operations that
failed.
</p>
</conbody>
</concept>
<concept id="IMPALA-821">
<title>Catalog Server consumes excessive cpu cycle</title>
<conbody>
<p>
A large number of tables and partitions could result in unnecessary CPU overhead during Impala idle time
and background operations.
</p>
<p>
<b>Bug:</b>
<xref href="https://issues.cloudera.org/browse/IMPALA-821" scope="external" format="html">IMPALA-821</xref>
</p>
<p>
<b>Resolution:</b> Catalog server processing was optimized in several ways.
</p>
</conbody>
</concept>
<concept id="IMPALA-828">
<title>Query against Avro table crashes Impala with codegen enabled</title>
<conbody>
<p>
A query against a <codeph>TIMESTAMP</codeph> column in an Avro table could encounter a serious issue.
</p>
<p>
<b>Bug:</b>
<xref href="https://issues.cloudera.org/browse/IMPALA-828" scope="external" format="html">IMPALA-828</xref>
</p>
<p>
<b>Workaround:</b> Set the query option <codeph>DISABLE_CODEGEN=TRUE</codeph>
</p>
</conbody>
</concept>
<concept id="IMPALA-809">
<title>Statestore seems to send concurrent heartbeats to the same subscriber leading to repeated "Subscriber '<varname>hostname</varname>' is registering with statestore, ignoring update" messages</title>
<conbody>
<p>
Impala nodes could produce repeated error messages after recovering from a communication error with the
statestore service.
</p>
<p>
<b>Bug:</b>
<xref href="https://issues.cloudera.org/browse/IMPALA-809" scope="external" format="html">IMPALA-809</xref>
</p>
</conbody>
</concept>
<concept id="IMPALA-805">
<title>Join predicate incorrectly ignored</title>
<conbody>
<p>
A join query could produce wrong results if multiple equality comparisons between the same tables
referred to the same column.
</p>
<p>
<b>Bug:</b>
<xref href="https://issues.cloudera.org/browse/IMPALA-805" scope="external" format="html">IMPALA-805</xref>
</p>
</conbody>
</concept>
<concept id="IMP-1258">
<title>Query result differing between Impala and Hive</title>
<conbody>
<p>
Certain outer join queries could return wrong results. If one of the tables involved in the join was an
inline view, some tests from the <codeph>WHERE</codeph> clauses could be applied to the wrong phase of
the query.
</p>
<!-- <p><b>Bug:</b> <xref href="https://jira.cloudera.com/browse/IMP-1258" scope="external" format="html">IMP-1258</xref></p> -->
</conbody>
</concept>
<concept id="IMPALA-715">
<title>ArrayIndexOutOfBoundsException / Invalid query handle when reading large HBase cell</title>
<conbody>
<p>
An HBase cell could contain a value larger than 32 KB, leading to a serious error when Impala queries
that table. The error could occur even if the applicable row is not part of the result set.
</p>
<p>
<b>Bug:</b>
<xref href="https://issues.cloudera.org/browse/IMPALA-715" scope="external" format="html">IMPALA-715</xref>
</p>
<p>
<b>Workaround:</b> Use smaller values in the HBase table, or exclude the column containing the large
value from the result set.
</p>
</conbody>
</concept>
<concept id="IMPALA-735">
<title>select with distinct and full outer join, impalad coredump</title>
<conbody>
<p>
A query involving a <codeph>DISTINCT</codeph> operator combined with a <codeph>FULL OUTER JOIN</codeph>
could encounter a serious error.
</p>
<p>
<b>Bug:</b>
<xref href="https://issues.cloudera.org/browse/IMPALA-735" scope="external" format="html">IMPALA-735</xref>
</p>
<p>
<b>Workaround:</b> Set the query option <codeph>DISABLE_CODEGEN=TRUE</codeph>
</p>
</conbody>
</concept>
<!-- Actually a new feature, so removing from list of fixed issues.
<concept id="IMPALA-736">
<title>INVALIDATE METADATA <varname>table_name</varname> should add the table to the catalog if it doesn't already exist</title>
<conbody>
<p/>
<p><b>Bug:</b> <xref href="https://issues.cloudera.org/browse/IMPALA-736" scope="external" format="html">IMPALA-736</xref></p>
<p><b>Severity:</b> High</p>
</conbody>
</concept>
-->
<concept id="IMPALA-749">
<title>Impala cannot load tables with more than Short.MAX_VALUE number of partitions</title>
<conbody>
<p>
If a table had more than 32,767 partitions, Impala would not recognize the partitions above the 32K limit
and query results could be incomplete.
</p>
<p>
<b>Bug:</b>
<xref href="https://issues.cloudera.org/browse/IMPALA-749" scope="external" format="html">IMPALA-749</xref>
</p>
</conbody>
</concept>
<concept id="IMP-1188">
<title>Various issues with HBase row key specification</title>
<conbody>
<p>
Queries against HBase tables could fail with an error if the row key was compared to a function return
value rather than a string constant. Also, queries against HBase tables could fail if the
<codeph>WHERE</codeph> clause contained combinations of comparisons that could not possibly match any row
key.
</p>
<p audience="Cloudera">
<b>Bug:</b>
<xref href="https://jira.cloudera.com/browse/IMP-1188" scope="external" format="html">IMP-1188</xref>
</p>
<p>
<b>Resolution:</b> Queries now return appropriate results when function calls are used in the row key
comparison. For queries involving non-existent row keys, such as <codeph>WHERE <varname>row_key</varname>
IS NULL</codeph> or where the lower bound is greater than the upper bound, the query succeeds and returns
an empty result set.
</p>
</conbody>
</concept>
</concept>
<concept rev="1.2.3" id="fixed_issues_123">
<title>Issues Fixed in the 1.2.3 Release</title>
<conbody>
<!--
<p>
This section lists the most significant issues fixed in Impala 1.2.3. For the full list of fixed issues, see
<xref href="https://issues.cloudera.org/secure/IssueNavigator.jspa?reset=true&amp;jqlQuery=project%3Dimpala+and+fixVersion%3D%22Impala+1.2.3%22+and+resolution%3D%22Fixed%22" format="html" scope="external">
this report in the JIRA system</xref>.
</p>
-->
<p>
This release is a fix release that supercedes Impala 1.2.2, with the same features and fixes as 1.2.2 plus
one additional fix for compatibility with Parquet files generated outside of Impala by components such as
Hive, Pig, or MapReduce.
</p>
<p outputclass="toc inpage"/>
</conbody>
<concept id="IMPALA-720">
<title>Impala cannot read Parquet files with multiple row groups</title>
<conbody>
<p>
The <codeph>parquet-mr</codeph> library included with CDH4.5 writes files that are not readable by
Impala, due to the presence of multiple row groups. Queries involving these data files might result in a
crash or a failure with an error such as <q>Column chunk should not contain two dictionary pages</q>.
</p>
<p>
This issue does not occur for Parquet files produced by Impala <codeph>INSERT</codeph> statements,
because Impala only produces files with a single row group.
</p>
<p>
<b>Bug:</b>
<xref href="https://issues.cloudera.org/browse/IMPALA-720" scope="external" format="html">IMPALA-720</xref>
</p>
</conbody>
</concept>
</concept>
<concept rev="1.2.2" id="fixed_issues_122">
<title>Issues Fixed in the 1.2.2 Release</title>
<conbody>
<p>
This section lists the most significant issues fixed in Impala 1.2.2. For the full list of fixed issues,
see
<xref href="https://issues.cloudera.org/secure/IssueNavigator.jspa?reset=true&amp;jqlQuery=project%3Dimpala+and+fixVersion%3D%22Impala+1.2.2%22+and+resolution%3D%22Fixed%22" format="html" scope="external">
this report in the JIRA system</xref>.
</p>
<p outputclass="toc inpage"/>
</conbody>
<concept id="join_order">
<title>Order of table references in FROM clause is critical for optimal performance</title>
<conbody>
<p>
Impala does not currently optimize the join order of queries; instead, it joins tables in the order in
which they are listed in the FROM clause. Queries that contain one or more large tables on the right hand
side of joins (either an explicit join expressed as a JOIN statement or a join implicit in the list of
table references in the FROM clause) may run slowly or crash Impala due to out-of-memory errors. For
example:
</p>
<codeblock>SELECT ... FROM small_table JOIN large_table</codeblock>
<p>
<b>Anticipated Resolution</b>: Fixed in Impala 1.2.2.
</p>
<p>
<b>Workaround</b>: In Impala 1.2.2 and higher, use the <codeph>COMPUTE STATS</codeph> statement to gather
statistics for each table involved in the join query, after data is loaded. Prior to Impala 1.2.2, modify
the query, if possible, to join the largest table first. For example:
</p>
<codeblock>SELECT ... FROM small_table JOIN large_table</codeblock>
<p>
should be modified to:
</p>
<codeblock>SELECT ... FROM large_table JOIN small_table</codeblock>
</conbody>
</concept>
<concept id="IMPALA-694">
<title>Parquet in CDH4.5 writes data files that are sometimes unreadable by Impala</title>
<conbody>
<p>
Some Parquet files could be generated by other components that Impala could not read.
</p>
<p>
<b>Bug:</b>
<xref href="https://issues.cloudera.org/browse/IMPALA-694" scope="external" format="html">IMPALA-694</xref>
</p>
<p>
<b>Resolution:</b> The underlying issue is being addressed by a fix in the CDH Parquet libraries. Impala
1.2.2 works around the problem and reads the existing data files.
</p>
</conbody>
</concept>
<concept id="IMPALA-699">
<title>Deadlock in statestore when unregistering a subscriber and building a topic update</title>
<conbody>
<p>
The statestore service cound experience an internal error leading to a hang.
</p>
<p>
<b>Bug:</b>
<xref href="https://issues.cloudera.org/browse/IMPALA-699" scope="external" format="html">IMPALA-699</xref>
</p>
</conbody>
</concept>
<concept id="IMPALA-687">
<title>IllegalStateException when doing a union involving a group by</title>
<conbody>
<p>
A <codeph>UNION</codeph> query where one side involved a <codeph>GROUP BY</codeph> operation could cause
a serious error.
</p>
<p>
<b>Bug:</b>
<xref href="https://issues.cloudera.org/browse/IMPALA-687" scope="external" format="html">IMPALA-687</xref>
</p>
</conbody>
</concept>
<concept id="IMPALA-689">
<title>Impala Parquet Writer hit DCHECK in RleEncoder</title>
<conbody>
<p>
A serious error could occur when doing an <codeph>INSERT</codeph> into a Parquet table.
</p>
<p>
<b>Bug:</b>
<xref href="https://issues.cloudera.org/browse/IMPALA-689" scope="external" format="html">IMPALA-689</xref>
</p>
</conbody>
</concept>
<concept id="IMPALA-695">
<title>Hive UDF jars cannot be loaded by the FE</title>
<conbody>
<p>
If the JAR file for a Java-based Hive UDF was not in the <codeph>CLASSPATH</codeph>, the UDF could not be
called during a query.
</p>
<p>
<b>Bug:</b>
<xref href="https://issues.cloudera.org/browse/IMPALA-695" scope="external" format="html">IMPALA-695</xref>
</p>
</conbody>
</concept>
</concept>
<concept rev="1.2.1" id="fixed_issues_121">
<title>Issues Fixed in the 1.2.1 Release</title>
<conbody>
<p>
This section lists the most significant issues fixed in Impala 1.2.1. For the full list of fixed issues,
see
<xref href="https://issues.cloudera.org/secure/IssueNavigator.jspa?reset=true&amp;jqlQuery=project%3Dimpala+and+fixVersion%3D%22Impala+1.2.1%22+and+resolution%3D%22Fixed%22" format="html" scope="external">
this report in the JIRA system</xref>.
</p>
<p outputclass="toc inpage"/>
</conbody>
<concept id="IMPALA-525">
<title>Scanners use too much memory when reading past scan range</title>
<conbody>
<p>
While querying a table with long column values, Impala could over-allocate memory leading to an
out-of-memory error. This problem was observed most frequently with tables using uncompressed RCFile or
text data files.
</p>
<p>
<b>Bug:</b>
<xref href="https://issues.cloudera.org/browse/IMPALA-525" scope="external" format="html">IMPALA-525</xref>
</p>
<p>
<b>Resolution:</b> Fixed in 1.2.1
</p>
</conbody>
</concept>
<concept id="IMPALA-657">
<title>Join node consumes memory way beyond mem-limit</title>
<conbody>
<p>
A join query could allocate a temporary work area that was larger than needed, leading to an
out-of-memory error. The fix makes Impala return unused memory to the system when the memory limit is
reached, avoiding unnecessary memory errors.
</p>
<p>
<b>Bug:</b>
<xref href="https://issues.cloudera.org/browse/IMPALA-657" scope="external" format="html">IMPALA-657</xref>
</p>
<p>
<b>Resolution:</b> Fixed in 1.2.1
</p>
</conbody>
</concept>
<concept id="IMPALA-652">
<title>Excessive memory consumption when query tables with 1k columns (Parquet file)</title>
<conbody>
<p>
Impala could encounter an out-of-memory condition setting up work areas for Parquet tables with many
columns. The fix reduces the size of the allocated memory when not actually needed to hold table data.
</p>
<p>
<b>Bug:</b>
<xref href="https://issues.cloudera.org/browse/IMPALA-652" scope="external" format="html">IMPALA-652</xref>
</p>
<p>
<b>Resolution:</b> Fixed in 1.2.1
</p>
</conbody>
</concept>
<!--
<concept id="IMPALA-520">
<title>Fail early (in analysis) when COMPUTE STATS is run against Avro table with no columns</title>
<conbody>
<p></p>
<p><b>Bug:</b> <xref href="https://issues.cloudera.org/browse/IMPALA-520" scope="external" format="html">IMPALA-520</xref></p>
<p><b>Severity:</b> </p>
<p><b>Resolution:</b> </p>
</conbody>
</concept>
-->
</concept>
<concept rev="1.2" id="fixed_issues_120">
<title>Issues Fixed in the 1.2.0 Beta Release</title>
<conbody>
<p>
This section lists the most significant issues fixed in Impala 1.2 (beta). For the full list of fixed
issues, see
<xref href="https://issues.cloudera.org/secure/IssueNavigator.jspa?reset=true&amp;jqlQuery=project%3Dimpala+and+fixVersion%3D%22Impala+1.2%22+and+resolution%3D%22Fixed%22" format="html" scope="external">
this report in the JIRA system</xref>.
</p>
</conbody>
</concept>
<concept id="fixed_issues_111">
<title>Issues Fixed in the 1.1.1 Release</title>
<conbody>
<p>
This section lists the most significant issues fixed in Impala 1.1.1. For the full list of fixed issues,
see
<xref href="https://issues.cloudera.org/secure/IssueNavigator.jspa?reset=true&amp;jqlQuery=project%3Dimpala+and+fixVersion%3D%22Impala+1.1.1%22+and+resolution%3D%22Fixed%22" format="html" scope="external">
this report in the JIRA system</xref>.
</p>
<p outputclass="toc inpage"/>
</conbody>
<concept id="IMPALA-477">
<title>Unexpected LLVM Crash When Querying Doubles on CentOS 5.x</title>
<conbody>
<p>
Certain queries involving <codeph>DOUBLE</codeph> columns could fail with a serious error. The fix
improves the generation of native machine instructions for certain chipsets.
</p>
<p>
<b>Bug:</b>
<xref href="https://issues.cloudera.org/browse/IMPALA-477" scope="external" format="html">IMPALA-477</xref>
</p>
</conbody>
</concept>
<concept id="IMPALA-482">
<title>"block size is too big" error with Snappy-compressed RCFile containing null</title>
<conbody>
<p>
Queries could fail with a <q>block size is too big</q> error, due to <codeph>NULL</codeph> values in
RCFile tables using Snappy compression.
</p>
<p>
<b>Bug:</b>
<xref href="https://issues.cloudera.org/browse/IMPALA-482" scope="external" format="html">IMPALA-482</xref>
</p>
</conbody>
</concept>
<concept id="IMPALA-510">
<title>Cannot query RC file for table that has more columns than the data file</title>
<conbody>
<p>
Queries could fail if an Impala RCFile table was defined with more columns than in the corresponding
RCFile data files.
</p>
<p>
<b>Bug:</b>
<xref href="https://issues.cloudera.org/browse/IMPALA-510" scope="external" format="html">IMPALA-510</xref>
</p>
</conbody>
</concept>
<concept id="IMPALA-495">
<title>Views Sometimes Not Utilizing Partition Pruning</title>
<conbody>
<p>
Certain combinations of clauses in a view definition for a partitioned table could result in inefficient
performance and incorrect results.
</p>
<p>
<b>Bug:</b>
<xref href="https://issues.cloudera.org/browse/IMPALA-495" scope="external" format="html">IMPALA-495</xref>
</p>
</conbody>
</concept>
<concept id="IMPALA-485">
<title>Update the serde name we write into the metastore for Parquet tables</title>
<conbody>
<p>
The SerDes class string written into Parquet data files created by Impala was updated for compatibility
with Parquet support in Hive. See <xref href="impala_incompatible_changes.xml#incompatible_changes_111"/>
for the steps to update older Parquet data files for Hive compatibility.
</p>
<p>
<b>Bug:</b>
<xref href="https://issues.cloudera.org/browse/IMPALA-485" scope="external" format="html">IMPALA-485</xref>
</p>
</conbody>
</concept>
<concept id="IMPALA-534">
<title>Selective queries over large tables produce unnecessary memory consumption</title>
<conbody>
<p>
A query returning a small result sets from a large table could tie up memory unnecessarily for the
duration of the query.
</p>
<p>
<b>Bug:</b>
<xref href="https://issues.cloudera.org/browse/IMPALA-534" scope="external" format="html">IMPALA-534</xref>
</p>
</conbody>
</concept>
<concept id="IMPALA-538">
<title>Impala stopped to query AVRO tables</title>
<conbody>
<p>
Queries against Avro tables could fail depending on whether the Avro schema URL was specified in the
<codeph>TBLPROPERTIES</codeph> or <codeph>SERDEPROPERTIES</codeph> field. The fix causes Impala to check
both fields for the schema URL.
</p>
<p>
<b>Bug:</b>
<xref href="https://issues.cloudera.org/browse/IMPALA-538" scope="external" format="html">IMPALA-538</xref>
</p>
</conbody>
</concept>
<concept id="IMPALA-520">
<title>Impala continues to allocate more memory even though it has exceed its mem-limit</title>
<conbody>
<p>
Queries could allocate substantially more memory than specified in the <cmdname>impalad</cmdname>
<codeph>-mem_limit</codeph> startup option. The fix causes more frequent checking of the limit during
query execution.
</p>
<p>
<b>Bug:</b>
<xref href="https://issues.cloudera.org/browse/IMPALA-520" scope="external" format="html">IMPALA-520</xref>
</p>
</conbody>
</concept>
</concept>
<concept id="fixed_issues_110">
<title>Issues Fixed in the 1.1.0 Release</title>
<conbody>
<p>
This section lists the most significant issues fixed in Impala 1.1. For the full list of fixed issues, see
<xref href="https://issues.cloudera.org/secure/IssueNavigator.jspa?reset=true&amp;jqlQuery=project%3Dimpala+and+fixVersion%3D%22Impala+1.1%22+and+resolution%3D%22Fixed%22" format="html" scope="external">
this report in the JIRA system</xref>.
</p>
<p outputclass="toc inpage"/>
</conbody>
<concept id="IMPALA-335">
<title>10-20% perf regression for most queries across all table formats</title>
<conbody>
<p>
This issue is due to a performance tradeoff between systems running many queries concurrently, and
systems running a single query. Systems running only a single query could experience lower performance
than in early beta releases. Systems running many queries simultaneously should experience higher
performance than in the beta releases.
</p>
<!-- Bug was actually deleted from JIRA system.
<p>
<b>Bug:</b> <xref href="https://issues.cloudera.org/browse/IMPALA-335" scope="external" format="html">IMPALA-335</xref>
</p>
-->
</conbody>
</concept>
<concept id="IMPALA-85">
<title>planner fails with "Join requires at least one equality predicate between the two tables" when "from" table order does not match "where" join order</title>
<conbody>
<p>
A query could fail if it involved 3 or more tables and the last join table was specified as a subquery.
</p>
<p>
<b>Bug:</b>
<xref href="https://issues.cloudera.org/browse/IMPALA-85" scope="external" format="html">IMPALA-85</xref>
</p>
</conbody>
</concept>
<concept id="IMPALA-257">
<title>Parquet writer uses excessive memory with partitions</title>
<conbody>
<p>
<codeph>INSERT</codeph> statements against partitioned tables using the Parquet format could use
excessive amounts of memory as the number of partitions grew large.
</p>
<p>
<b>Bug:</b>
<xref href="https://issues.cloudera.org/browse/IMPALA-257" scope="external" format="html">IMPALA-257</xref>
</p>
</conbody>
</concept>
<concept id="IMPALA-192">
<title>Comments in impala-shell in interactive mode are not handled properly causing syntax errors or wrong results</title>
<conbody>
<p>
The <cmdname>impala-shell</cmdname> interpreter did not accept comment entered at the command line,
making it problematic to copy and paste from scripts or other code examples.
</p>
<p>
<b>Bug:</b>
<xref href="https://issues.cloudera.org/browse/IMPALA-192" scope="external" format="html">IMPALA-192</xref>
</p>
</conbody>
</concept>
<concept id="IMPALA-364">
<title>Cancelled queries sometimes aren't removed from the inflight query list</title>
<conbody>
<p>
The Impala web UI would sometimes display a query as if it were still running, after the query was
cancelled.
</p>
<p>
<b>Bug:</b>
<xref href="https://issues.cloudera.org/browse/IMPALA-364" scope="external" format="html">IMPALA-364</xref>
</p>
</conbody>
</concept>
<concept id="IMPALA-396">
<title>Impala's 1.0.1 Shell Broke Python 2.4 Compatibility (AttributeError: 'module' object has no attribute 'field_size_limit)</title>
<conbody>
<p>
The <codeph>impala-shell</codeph> command in Impala 1.0.1 does not work with Python 2.4, which is the
default on Red Hat 5.
</p>
<p>
For the <codeph>impala-shell</codeph> command in Impala 1.0, the <codeph>-o</codeph> option (pipe output
to a file) does not work with Python 2.4.
</p>
<p>
<b>Bug:</b>
<xref href="https://issues.cloudera.org/browse/IMPALA-396" scope="external" format="html">IMPALA-396</xref>
</p>
</conbody>
</concept>
</concept>
<concept id="fixed_issues_101">
<title>Issues Fixed in the 1.0.1 Release</title>
<conbody>
<p>
This section lists the most significant issues fixed in Impala 1.0.1. For the full list of fixed issues,
see
<xref href="https://issues.cloudera.org/secure/IssueNavigator.jspa?reset=true&amp;jqlQuery=project%3Dimpala+and+fixVersion%3D%22Impala+1.0.1%22+and+resolution%3D%22Fixed%22" format="html" scope="external">
this report in the JIRA system</xref>.
</p>
<p outputclass="toc inpage"/>
</conbody>
<!--
<concept id="IMPALA-325">
<title>Impala may crash when running large number of concurrent queries against a single coordinator</title>
<conbody>
<p>This issue occurs when a large number of queries are submitted to the same instance of <codeph>impalad</codeph> and are running simultaneously.</p>
<p><b>Bug:</b> <xref href="https://issues.cloudera.org/browse/IMPALA-325" scope="external" format="html">IMPALA-325</xref></p>
<p><b>Severity:</b> High</p>
<p><b>Resolution:</b> To be fixed in a future release</p>
<p><b>Workaround:</b> Use round-robin scheduling to submit queries to different Impala nodes.</p>
</conbody>
</concept>
-->
<concept id="IMPALA-333">
<title>Impala parquet scanner cannot read all data files generated by
other frameworks</title>
<conbody>
<p>
Impala might issue an erroneous error message when processing a Parquet data file produced by a
non-Impala Hadoop component.
</p>
<p>
<b>Bug:</b>
<xref href="https://issues.cloudera.org/browse/IMPALA-333" scope="external" format="html">IMPALA-333</xref>
</p>
<p>
<b>Resolution:</b> Fixed
</p>
</conbody>
</concept>
<concept id="IMPALA-293">
<title>Impala is unable to query RCFile tables which describe fewer columns than the file's header.</title>
<conbody>
<p>
If an RCFile table definition had fewer columns than the fields actually in the data files, queries would
fail.
</p>
<p>
<b>Bug:</b>
<xref href="https://issues.cloudera.org/browse/IMPALA-293" scope="external" format="html">IMPALA-293</xref>
</p>
<p>
<b>Resolution:</b> Fixed
</p>
</conbody>
</concept>
<concept id="IMPALA-351">
<title>Impala does not correctly substitute _HOST with hostname in --principal</title>
<conbody>
<p>
The <codeph>_HOST</codeph> placeholder in the <codeph>--principal</codeph> startup option was not
substituted with the correct hostname, potentially leading to a startup error in setups using Kerberos
authentication.
</p>
<p>
<b>Bug:</b>
<xref href="https://issues.cloudera.org/browse/IMPALA-351" scope="external" format="html">IMPALA-351</xref>
</p>
<p>
<b>Resolution:</b> Fixed
</p>
</conbody>
</concept>
<concept id="IMPALA-356">
<title>HBase query missed the last region</title>
<conbody>
<p>
A query for an HBase table could omit data from the last region.
</p>
<p>
<b>Bug:</b>
<xref href="https://issues.cloudera.org/browse/IMPALA-356" scope="external" format="html">IMPALA-356</xref>
</p>
<p>
<b>Resolution:</b> Fixed
</p>
</conbody>
</concept>
<concept id="IMPALA-300">
<title>Hbase region changes are not handled correctly</title>
<conbody>
<p>
After a region in an HBase table was split or moved, an Impala query might return incomplete or
out-of-date results.
</p>
<p>
<b>Bug:</b>
<xref href="https://issues.cloudera.org/browse/IMPALA-300" scope="external" format="html">IMPALA-300</xref>
</p>
<p>
<b>Resolution:</b> Fixed
</p>
</conbody>
</concept>
<concept id="IMPALA-349">
<title>Query state for successful create table is EXCEPTION</title>
<conbody>
<p>
After a successful <codeph>CREATE TABLE</codeph> statement, the corresponding query state would be
incorrectly reported as <codeph>EXCEPTION</codeph>.
</p>
<p>
<b>Bug:</b>
<xref href="https://issues.cloudera.org/browse/IMPALA-349" scope="external" format="html">IMPALA-349</xref>
</p>
<p>
<b>Resolution:</b> Fixed
</p>
</conbody>
</concept>
<concept id="IMPALA-358">
<title>Double check release of JNI-allocated byte-strings</title>
<conbody>
<p>
Operations involving calls to the Java JNI subsystem (for example, queries on HBase tables) could
allocate memory but not release it.
</p>
<p>
<b>Bug:</b>
<xref href="https://issues.cloudera.org/browse/IMPALA-358" scope="external" format="html">IMPALA-358</xref>
</p>
<p>
<b>Resolution:</b> Fixed
</p>
</conbody>
</concept>
<concept id="unix_timestamp">
<title>Impala returns 0 for bad time values in UNIX_TIMESTAMP, Hive returns NULL</title>
<conbody>
<p>
Impala returns 0 for bad time values in UNIX_TIMESTAMP, Hive returns NULL.
</p>
<p>
<b>Impala:</b>
</p>
<codeblock>impala&gt; select UNIX_TIMESTAMP('10:02:01') ;
impala&gt; 0</codeblock>
<p>
<b>Hive:</b>
</p>
<codeblock>hive&gt; select UNIX_TIMESTAMP('10:02:01') FROM tmp;
hive&gt; NULL</codeblock>
<p>
<b>Bug:</b>
<xref href="https://issues.cloudera.org/browse/IMPALA-16" scope="external" format="html">IMPALA-16</xref>
</p>
<p>
<b>Anticipated Resolution:</b> Fixed
</p>
</conbody>
</concept>
<concept id="IMP-231">
<title>INSERT INTO TABLE SELECT &lt;constant&gt; does not work.</title>
<conbody>
<p>
Insert INTO TABLE SELECT &lt;constant&gt; will not insert any data and may return an error.
</p>
<p audience="Cloudera">
<b>Cloudera Bug:</b>
<xref href="https://jira.cloudera.com/browse/IMP-231" scope="external" format="html"/> ; KI added 0.1
<i>Cloudera internal only</i>
</p>
<p>
<b>Anticipated Resolution</b>: Fixed
</p>
</conbody>
</concept>
</concept>
<concept id="fixed_issues_10">
<title>Issues Fixed in the 1.0 GA Release</title>
<conbody>
<p>
Here are the major user-visible issues fixed in Impala 1.0. For a full list of fixed issues, see
<xref href="https://issues.cloudera.org/secure/IssueNavigator.jspa?reset=true&amp;jqlQuery=project+%3D+impala+AND+resolution+%3D+Fixed+AND+fixVersion+%3D+%22Impala+1.0%22+ORDER+BY+key+ASC%2C+assignee+ASC%2C+priority+DESC" scope="external" format="html">this
report in the public issue tracker</xref>.
</p>
<p outputclass="toc inpage"/>
</conbody>
<concept id="IMPALA-183">
<title>Undeterministically receive "ERROR: unknown row bach destination..." and "ERROR: Invalid query handle" from impala shell when running union query</title>
<conbody>
<p>
A query containing both <codeph>UNION</codeph> and <codeph>LIMIT</codeph> clauses could intermittently
cause the <codeph>impalad</codeph> process to halt with a segmentation fault.
</p>
<p>
<b>Bug:</b>
<xref href="https://issues.cloudera.org/browse/IMPALA-183" scope="external" format="html">IMPALA-183</xref>
</p>
<p>
<b>Resolution:</b> Fixed
</p>
</conbody>
</concept>
<concept id="IMPALA-190">
<title>Insert with NULL partition keys results in SIGSEGV.</title>
<conbody>
<p>
An <codeph>INSERT</codeph> statement specifying a <codeph>NULL</codeph> value for one of the partitioning
columns could cause the <codeph>impalad</codeph> process to halt with a segmentation fault.
</p>
<p>
<b>Bug:</b>
<xref href="https://issues.cloudera.org/browse/IMPALA-190" scope="external" format="html">IMPALA-190</xref>
</p>
<p>
<b>Resolution:</b> Fixed
</p>
</conbody>
</concept>
<concept id="IMPALA-217">
<title>INSERT queries don't show completed profiles on the debug webpage</title>
<conbody>
<p>
In the Impala web user interface, the profile page for an <codeph>INSERT</codeph> statement showed
obsolete information for the statement once it was complete.
</p>
<p>
<b>Bug:</b>
<xref href="https://issues.cloudera.org/browse/IMPALA-217" scope="external" format="html">IMPALA-217</xref>
</p>
<p>
<b>Resolution:</b> Fixed
</p>
</conbody>
</concept>
<concept id="IMPALA-231">
<title>Impala HBase scan is very slow</title>
<conbody>
<p>
Queries involving an HBase table could be slower than expected, due to excessive memory usage on the
Impala nodes.
</p>
<p>
<b>Bug:</b>
<xref href="https://issues.cloudera.org/browse/IMPALA-231" scope="external" format="html">IMPALA-231</xref>
</p>
<p>
<b>Resolution:</b> Fixed
</p>
</conbody>
</concept>
<concept id="IMPALA-234">
<title>Add some library version validation logic to impalad when loading impala-lzo shared library</title>
<conbody>
<p>
No validation was done to check that the <codeph>impala-lzo</codeph> shared library was compatible with
the version of Impala, possibly leading to a crash when using LZO-compressed text files.
</p>
<p>
<b>Bug:</b>
<xref href="https://issues.cloudera.org/browse/IMPALA-234" scope="external" format="html">IMPALA-234</xref>
</p>
<p>
<b>Resolution:</b> Fixed
</p>
<p>
<b>Workaround:</b> Always upgrade the <codeph>impala-lzo</codeph> library at the same time as you upgrade
Impala itself.
</p>
</conbody>
</concept>
<concept id="IMPALA-238">
<title>Problems inserting into tables with TIMESTAMP partition columns leading table metadata loading failures and failed dchecks</title>
<conbody>
<p>
<codeph>INSERT</codeph> statements for tables partitioned on columns involving datetime types could
appear to succeed, but cause errors for subsequent queries on those tables. The problem was especially
serious if an improperly formatted timestamp value was specified for the partition key.
</p>
<p>
<b>Bug:</b>
<xref href="https://issues.cloudera.org/browse/IMPALA-238" scope="external" format="html">IMPALA-238</xref>
</p>
<p>
<b>Resolution:</b> Fixed
</p>
</conbody>
</concept>
<concept id="IMPALA-243">
<title>Ctrl-C sometimes interrupts shell in system call, rather than cancelling query</title>
<conbody>
<p>
Pressing Ctrl-C in the <codeph>impala-shell</codeph> interpreter could sometimes display an error and
return control to the shell, making it impossible to cancel the query.
</p>
<p>
<b>Bug:</b>
<xref href="https://issues.cloudera.org/browse/IMPALA-243" scope="external" format="html">IMPALA-243</xref>
</p>
<p>
<b>Resolution:</b> Fixed
</p>
</conbody>
</concept>
<concept id="IMPALA-252">
<title>Empty string partition value causes metastore update failure</title>
<conbody>
<p>
Specifying an empty string or <codeph>NULL</codeph> for a partition key in an <codeph>INSERT</codeph>
statement would fail.
</p>
<p>
<b>Bug:</b>
<xref href="https://issues.cloudera.org/browse/IMPALA-252" scope="external" format="html">IMPALA-252</xref>
</p>
<p>
<b>Resolution:</b> Fixed. The behavior for empty partition keys was made more compatible with the
corresponding Hive behavior.
</p>
</conbody>
</concept>
<concept id="IMPALA-266">
<title>Round() does not output the right precision</title>
<conbody>
<p>
The <codeph>round()</codeph> function did not always return the correct number of significant digits.
</p>
<p>
<b>Bug:</b>
<xref href="https://issues.cloudera.org/browse/IMPALA-266" scope="external" format="html">IMPALA-266</xref>
</p>
<p>
<b>Resolution:</b> Fixed
</p>
</conbody>
</concept>
<concept id="IMPALA-267">
<title>Cannot cast string literal to string</title>
<conbody>
<p>
Casting from a string literal back to the same type would cause an <q>invalid type cast</q> error rather
than leaving the original value unchanged.
</p>
<p>
<b>Bug:</b>
<xref href="https://issues.cloudera.org/browse/IMPALA-267" scope="external" format="html">IMPALA-267</xref>
</p>
<p>
<b>Resolution:</b> Fixed
</p>
</conbody>
</concept>
<concept id="IMPALA-288">
<title>Excessive mem usage for certain queries which are very selective</title>
<conbody>
<p>
Some queries that returned very few rows experienced unnecessary memory usage.
</p>
<p>
<b>Bug:</b>
<xref href="https://issues.cloudera.org/browse/IMPALA-288" scope="external" format="html">IMPALA-288</xref>
</p>
<p>
<b>Resolution:</b> Fixed
</p>
</conbody>
</concept>
<concept id="IMPALA-289">
<title>HdfsScanNode crashes in UpdateCounters</title>
<conbody>
<p>
A serious error could occur for relatively small and inexpensive queries.
</p>
<p>
<b>Bug:</b>
<xref href="https://issues.cloudera.org/browse/IMPALA-289" scope="external" format="html">IMPALA-289</xref>
</p>
<p>
<b>Resolution:</b> Fixed
</p>
</conbody>
</concept>
<concept id="IMPALA-292">
<title>Parquet performance issues on large dataset</title>
<conbody>
<p>
Certain aggregation queries against Parquet tables were inefficient due to lower than required thread
utilization.
</p>
<p>
<b>Bug:</b>
<xref href="https://issues.cloudera.org/browse/IMPALA-292" scope="external" format="html">IMPALA-292</xref>
</p>
<p>
<b>Resolution:</b> Fixed
</p>
</conbody>
</concept>
<concept id="IMPALA-295">
<title>impala not populating hive metadata correctly for create table</title>
<conbody>
<p>
The Impala <codeph>CREATE TABLE</codeph> command did not fill in the <codeph>owner</codeph> and
<codeph>tbl_type</codeph> columns in the Hive metastore database.
</p>
<p>
<b>Bug:</b>
<xref href="https://issues.cloudera.org/browse/IMPALA-295" scope="external" format="html">IMPALA-295</xref>
</p>
<p>
<b>Resolution:</b> Fixed. The metadata was made more Hive-compatible.
</p>
</conbody>
</concept>
<concept id="IMPALA-312">
<title>impala daemons die if statestore goes down</title>
<conbody>
<p>
The <codeph>impalad</codeph> instances in a cluster could halt when the <codeph>statestored</codeph>
process became unavailable.
</p>
<p>
<b>Bug:</b>
<xref href="https://issues.cloudera.org/browse/IMPALA-312" scope="external" format="html">IMPALA-312</xref>
</p>
<p>
<b>Resolution:</b> Fixed
</p>
</conbody>
</concept>
<concept id="IMPALA-67">
<title>Constant SELECT clauses do not work in subqueries</title>
<conbody>
<p>
A subquery would fail if the <codeph>SELECT</codeph> statement inside it returned a constant value rather
than querying a table.
</p>
<p>
<b>Bug:</b>
<xref href="https://issues.cloudera.org/browse/IMPALA-67" scope="external" format="html">IMPALA-67</xref>
</p>
<p>
<b>Resolution:</b> Fixed
</p>
</conbody>
</concept>
<concept id="IMPALA-90">
<title>Right outer Join includes NULLs as well and hence wrong result count</title>
<conbody>
<p>
The result set from a right outer join query could include erroneous rows containing
<codeph>NULL</codeph> values.
</p>
<p>
<b>Bug:</b>
<xref href="https://issues.cloudera.org/browse/IMPALA-90" scope="external" format="html">IMPALA-90</xref>
</p>
<p>
<b>Resolution:</b> Fixed
</p>
</conbody>
</concept>
<concept id="IMPALA-204">
<title>Parquet scanner hangs for some queries</title>
<conbody>
<p>
The Parquet scanner non-deterministically hangs when executing some queries.
</p>
<p>
<b>Bug:</b>
<xref href="https://issues.cloudera.org/browse/IMPALA-204" scope="external" format="html">IMPALA-204</xref>
</p>
<p>
<b>Resolution:</b> Fixed
</p>
</conbody>
</concept>
</concept>
<concept id="fixed_issues_07">
<title>Issues Fixed in Version 0.7 of the Beta Release</title>
<conbody/>
<concept id="IMPALA-167">
<title>Impala does not gracefully handle unsupported Hive table types (INDEX and VIEW tables)</title>
<conbody>
<p>
When attempting to load metadata from an unsupported Hive table type (INDEX and VIEW tables), Impala
fails with an unclear error message.
</p>
<p>
Bug:
<xref href="https://issues.cloudera.org/browse/IMPALA-167" scope="external" format="html">IMPALA-167</xref>
</p>
<p>
<b>Resolution:</b> Fixed in 0.7
</p>
</conbody>
</concept>
<concept id="IMP-474">
<title>DDL statements (CREATE/ALTER/DROP TABLE) are not supported in the Impala Beta Release</title>
<conbody>
<p audience="Cloudera">
<b>Cloudera Bug:</b>
<xref href="https://jira.cloudera.com/browse/IMP-474" scope="external" format="html"/>; KI added 0.1
<i>Cloudera internal only</i>
</p>
<p>
<b>Resolution:</b> Fixed in 0.7
</p>
</conbody>
</concept>
<concept id="avro_support">
<title>Avro is not supported in the Impala Beta Release</title>
<conbody>
<p>
<b>Resolution:</b> Fixed in 0.7
</p>
<p>
<b>Workaround:</b> None
</p>
</conbody>
</concept>
<concept id="mem_limit_query">
<title>Impala does not currently allow limiting the memory consumption of a single query</title>
<conbody>
<p>
It is currently not possible to limit the memory consumption of a single query. All tables on the right
hand side of JOIN statements need to be able to fit in memory. If they do not, Impala may crash due to
out of memory errors.
</p>
<p>
<b>Resolution:</b> Fixed in 0.7
</p>
</conbody>
</concept>
<concept id="IMPALA-20">
<title>Aggregate of a subquery result set returns wrong results if the subquery contains a 'limit' and data is distributed across multiple nodes</title>
<conbody>
<p>
Aggregate of a subquery result set returns wrong results if the subquery contains a 'limit' clause and
data is distributed across multiple nodes. From the query plan, it looks like we are just summing the
results from each worker node.
</p>
<p>
<b>Bug:</b>
<xref href="https://issues.cloudera.org/browse/IMPALA-20" scope="external" format="html">IMPALA-20</xref>
</p>
<p>
<b>Resolution:</b> Fixed in 0.7
</p>
</conbody>
</concept>
<concept id="IMPALA-144">
<title>Partition pruning for arbitrary predicates that are fully bound by a particular partition column</title>
<conbody>
<p> We currently cannot utilize a predicate like "country_code in ('DE',
'FR', 'US')" to do partitioning pruning, because that requires an
equality predicate or a binary comparison. </p>
<p>
We should create a superclass of planner.ValueRange, ValueSet, that can be constructed with an arbitrary
predicate, and whose isInRange(analyzer, valueExpr) constructs a literal predicate by substitution of the
valueExpr into the predicate.
</p>
<p>
<b>Bug:</b>
<xref href="https://issues.cloudera.org/browse/IMPALA-144" scope="external" format="html">IMPALA-144</xref>
</p>
<p>
<b>Resolution:</b> Fixed in 0.7
</p>
</conbody>
</concept>
</concept>
<concept id="fixed_issues_06">
<title>Issues Fixed in Version 0.6 of the Beta Release</title>
<conbody/>
<concept id="IMP-56">
<title>Impala reads the NameNode address and port as command line parameters</title>
<conbody>
<p>
Impala reads the NameNode address and port as command line parameters rather than reading them from
<codeph>core-site.xml</codeph>. Updating the NameNode address in the <codeph>core-site.xml</codeph> file
does not propagate to Impala.
</p>
<p audience="Cloudera">
<b>Cloudera Bug:</b>
<xref href="https://jira.cloudera.com/browse/IMP-56" scope="external" format="html"/>; KI added 0.1
<i>Cloudera internal only</i>
</p>
<p>
<b>Severity:</b> Low
</p>
<p>
<b>Resolution:</b> Fixed in 0.6 - Impala reads the namenode location and port from the Hadoop
configuration files, though setting <codeph>-nn</codeph> and <codeph>-nn_port</codeph> overrides this.
Users are advised not to set <codeph>-nn</codeph> or <codeph>-nn_port</codeph>.
</p>
</conbody>
</concept>
<concept id="IMPALA-64">
<title>Queries may fail on secure environment due to impalad Kerberos ticket expiration</title>
<conbody>
<p>
Queries may fail on secure environment due to <codeph>impalad</codeph> Kerberos tickets expiring. This
can happen if the Impala <codeph>-kerberos_reinit_interval</codeph> flag is set to a value ten minutes or
less. This may lead to an <codeph>impalad</codeph> requesting a ticket with a lifetime that is less than
the time to the next ticket renewal.
</p>
<p>
<b>Bug:</b>
<xref href="https://issues.cloudera.org/browse/IMPALA-64" scope="external" format="html">IMPALA-64</xref>
</p>
<p>
<b>Resolution</b>: Fixed in 0.6
</p>
</conbody>
</concept>
<concept id="IMPALA-48">
<title>Concurrent queries may fail when Impala uses Thrift to communicate with the Hive Metastore</title>
<conbody>
<p>
Concurrent queries may fail when Impala is using Thrift to communicate with part of the Hive Metastore
such as the Hive Metastore Service. In such a case, the error <codeph>get_fields failed: out of sequence
response"</codeph> may occur because Impala shared a single Hive Metastore Client connection across
threads. With Impala 0.6, a separate connection is used for each metadata request.
</p>
<p>
<b>Bug:</b>
<xref href="https://issues.cloudera.org/browse/IMPALA-48" scope="external" format="html">IMPALA-48</xref>
</p>
<p>
<b>Resolution</b>: Fixed in 0.6
</p>
</conbody>
</concept>
<concept id="IMPALA-58">
<title>impalad fails to start if unable to connect to the Hive Metastore</title>
<conbody>
<p>
Impala fails to start if it is unable to establish a connection with the Hive Metastore. This behavior
was fixed, allowing Impala to start, even when no Metastore is available.
</p>
<p>
<b>Bug:</b>
<xref href="https://issues.cloudera.org/browse/IMPALA-58" scope="external" format="html">IMPALA-58</xref>
</p>
<p>
<b>Resolution</b>: Fixed in 0.6
</p>
</conbody>
</concept>
<concept id="IMPALA-44">
<title>Impala treats database names as case-sensitive in some contexts</title>
<conbody>
<p>
In some queries (including "USE database" statements), database names are treated as case-sensitive. This
may lead queries to fail with an IllegalStateException.
</p>
<p>
<b>Bug:</b>
<xref href="https://issues.cloudera.org/browse/IMPALA-44" scope="external" format="html">IMPALA-44</xref>
</p>
<p>
<b>Resolution</b>: Fixed in 0.6
</p>
</conbody>
</concept>
<concept id="IMPALA-18">
<title>Impala does not ignore hidden HDFS files</title>
<conbody>
<p>
Impala does not ignore hidden HDFS files, meaning those files prefixed with a period '.' or underscore
'_'. This diverges from Hive/MapReduce, which skips these files.
</p>
<p>
<b>Bug:</b>
<xref href="https://issues.cloudera.org/browse/IMPALA-18" scope="external" format="html">IMPALA-18</xref>
</p>
<p id="p_7">
<b>Resolution</b>: Fixed in 0.6
</p>
</conbody>
</concept>
</concept>
<concept id="fixed_issues_05">
<title>Issues Fixed in Version 0.5 of the Beta Release</title>
<conbody/>
<concept id="many_partitions">
<title>Impala may have reduced performance on tables that contain a large number of partitions</title>
<conbody>
<p>
Impala may have reduced performance on tables that contain a large number of partitions. This is due to
extra overhead reading/parsing the partition metadata.
</p>
<p>
<b>Resolution</b>: Fixed in 0.5
</p>
</conbody>
</concept>
<concept id="IMPALA-38">
<title>Backend client connections not getting cached causes an observable latency in secure clusters</title>
<conbody>
<p>
Backend impalads do not cache connections to the coordinator. On a secure cluster, this introduces a
latency proportional to the number of backend clients involved in query execution, as the cost of
establishing a secure connection is much higher than in the non-secure case.
</p>
<p>
<b>Bug:</b>
<xref href="https://issues.cloudera.org/browse/IMPALA-38" scope="external" format="html">IMPALA-38</xref>
</p>
<p>
<b>Resolution</b>: Fixed in 0.5
</p>
</conbody>
</concept>
<concept id="IMPALA-30">
<title>Concurrent queries may fail with error: "Table object has not been been initialised : `PARTITIONS`"</title>
<conbody>
<p>
Concurrent queries may fail with error: <codeph>"Table object has not been been initialised :
`PARTITIONS`"</codeph>. This was due to a lack of locking in the Impala table/database metadata cache.
</p>
<p>
<b>Bug:</b>
<xref href="https://issues.cloudera.org/browse/IMPALA-30" scope="external" format="html">IMPALA-30</xref>
</p>
<p>
<b>Resolution</b>: Fixed in 0.5
</p>
</conbody>
</concept>
<concept id="IMPALA-15">
<title>UNIX_TIMESTAMP format behaviour deviates from Hive when format matches a prefix of the time value</title>
<conbody>
<p>
The Impala UNIX_TIMESTAMP(val, format) operation compares the length of format and val and returns NULL
if they do not match. Hive instead effectively truncates val to the length of the format parameter.
</p>
<p>
<b>Bug:</b>
<xref href="https://issues.cloudera.org/browse/IMPALA-15" scope="external" format="html">IMPALA-15</xref>
</p>
<p id="p_10">
<b>Resolution:</b> Fixed in 0.5
</p>
</conbody>
</concept>
</concept>
<concept id="fixed_issues_04">
<title>Issues Fixed in Version 0.4 of the Beta Release</title>
<conbody/>
<concept id="HIVE-3596">
<title>Impala fails to refresh the Hive metastore if a Hive temporary configuration file is removed</title>
<conbody>
<p>
Impala is impacted by Hive bug
<xref href="https://issues.apache.org/jira/browse/HIVE-3596" scope="external" format="html">HIVE-3596</xref>
which may cause metastore refreshes to fail if a Hive temporary configuration file is deleted (normally
located at <codeph>/tmp/hive-&lt;user&gt;-&lt;tmp_number&gt;.xml</codeph>). Additionally, the
impala-shell will incorrectly report that the failed metadata refresh completed successfully.
</p>
<p audience="Cloudera">
<b>Cloudera Bug:</b>
<xref href="https://jira.cloudera.com/browse/IMP-611" scope="external" format="html"/>; KI added 0.1
<i>Cloudera internal only</i>
</p>
<p>
<b>Anticipated Resolution:</b> To be fixed in a future release
</p>
<p>
<b>Workaround:</b> Restart the <codeph>impalad</codeph> service. Use the <codeph>impalad</codeph> log to
check for metadata refresh errors.
</p>
</conbody>
</concept>
<concept id="lpad_rpad">
<title>lpad/rpad builtin functions is not correct.</title>
<conbody>
<p>
The lpad/rpad builtin functions generate the wrong results.
</p>
<p>
<b>Resolution:</b> Fixed in 0.4
</p>
</conbody>
</concept>
<concept id="IMPALA-14">
<title>Files with .gz extension reported as 'not supported'</title>
<conbody>
<p>
Compressed files with extensions incorrectly generate an exception.
</p>
<p>
<b>Bug:</b>
<xref href="https://issues.cloudera.org/browse/IMPALA-14" scope="external" format="html">IMPALA-14</xref>
</p>
<p>
<b>Resolution:</b> Fixed in 0.4
</p>
</conbody>
</concept>
<concept id="large_limit_hang">
<title>Queries with large limits would hang.</title>
<conbody>
<p>
Some queries with large limits were hanging.
</p>
<p>
<b>Resolution:</b> Fixed in 0.4
</p>
</conbody>
</concept>
<concept id="IMP-601">
<title>Order by on a string column produces incorrect results if there are empty strings</title>
<conbody>
<p audience="Cloudera">
<b>Cloudera Bug:</b>
<xref href="https://jira.cloudera.com/browse/IMP-601" scope="external" format="html"/>; KI added 0.1
<i>Cloudera internal only</i>
</p>
<p>
<b>Resolution:</b> Fixed in 0.4
</p>
</conbody>
</concept>
</concept>
<concept id="fixed_issues_03">
<title>Issues Fixed in Version 0.3 of the Beta Release</title>
<conbody/>
<concept id="IMP-298">
<title>All table loading errors show as unknown table</title>
<conbody>
<p>
If Impala is unable to load the metadata for a table for any reason, a subsequent query referring to that
table will return an <codeph>unknown table</codeph> error message, even if the table is known.
</p>
<p audience="Cloudera">
<b>Cloudera Bug:</b>
<xref href="https://jira.cloudera.com/browse/IMP-298" scope="external" format="html"/>; KI added 0.1
<i>Cloudera internal only</i>
</p>
<p>
<b>Resolution:</b> Fixed in 0.3
</p>
</conbody>
</concept>
<concept id="IMP-298-2">
<title>A table that cannot be loaded will disappear from SHOW TABLES</title>
<conbody>
<p>
After failing to load metadata for a table, Impala removes that table from the list of known tables
returned in <codeph>SHOW TABLES</codeph>. Subsequent attempts to query the table returns 'unknown table',
even if the metadata for that table is fixed.
</p>
<p audience="Cloudera">
<b>Cloudera Bug:</b>
<xref href="https://jira.cloudera.com/browse/IMP-298" scope="external" format="html"/>; KI added 0.1
<i>Cloudera internal only</i>
</p>
<p>
<b>Resolution:</b> Fixed in 0.3
</p>
</conbody>
</concept>
<concept id="IMP-581">
<title>Impala cannot read from HBase tables that are not created as external tables in the hive metastore.</title>
<conbody>
<p>
Attempting to select from these tables fails.
</p>
<p audience="Cloudera">
<b>Cloudera Bug:</b>
<xref href="https://jira.cloudera.com/browse/IMP-581" scope="external" format="html"/>; KI added 0.1
<i>Cloudera internal only</i>
</p>
<p>
<b>Resolution:</b> Fixed in 0.3
</p>
</conbody>
</concept>
<concept id="IMP-137">
<title>Certain queries that contain OUTER JOINs may return incorrect results</title>
<conbody>
<p>
Queries that contain OUTER JOINs may not return the correct results if there are predicates referencing
any of the joined tables in the WHERE clause.
</p>
<p audience="Cloudera">
<b>Cloudera Bug:</b>
<xref href="https://jira.cloudera.com/browse/IMP-137" scope="external" format="html"/>; KI added 0.1
<i>Cloudera internal only</i>
</p>
<p>
<b>Resolution:</b> Fixed in 0.3.
</p>
</conbody>
</concept>
</concept>
<concept id="fixed_issues_02">
<title>Issues Fixed in Version 0.2 of the Beta Release</title>
<conbody/>
<concept id="IMP-491">
<title>Subqueries which contain aggregates cannot be joined with other tables or Impala may crash</title>
<conbody>
<p>
Subqueries that contain an aggregate cannot be joined with another table or Impala may crash. For
example:
</p>
<codeblock>SELECT * FROM (SELECT sum(col1) FROM some_table GROUP BY col1) t1 JOIN other_table ON (...);</codeblock>
<p audience="Cloudera">
<b>Cloudera Bug:</b>
<xref href="https://jira.cloudera.com/browse/IMP-491" scope="external" format="html"/>; KI added 0.1
<i>Cloudera internal only</i>
</p>
<p>
<b>Resolution:</b> Fixed in 0.2
</p>
</conbody>
</concept>
<concept id="IMP-497">
<title>An insert with a limit that runs as more than one query fragment inserts more rows than the limit.</title>
<conbody>
<p>
For example:
</p>
<codeblock>INSERT OVERWRITE TABLE test SELECT * FROM test2 LIMIT 1;</codeblock>
<p audience="Cloudera">
<b>Cloudera Bug:</b>
<xref href="https://jira.cloudera.com/browse/IMP-497" scope="external" format="html"/>; KI added 0.1
<i>Cloudera internal only</i>
</p>
<p>
<b>Resolution:</b> Fixed in 0.2
</p>
</conbody>
</concept>
<concept id="IMP-535">
<title>Query with limit clause might fail.</title>
<conbody>
<p>
For example:
</p>
<codeblock>SELECT * FROM test2 LIMIT 1;</codeblock>
<p audience="Cloudera">
<b>Cloudera Bug:</b>
<xref href="https://jira.cloudera.com/browse/IMP-535" scope="external" format="html"/>; KI added 0.1
<i>Cloudera internal only</i>
</p>
<p>
<b>Resolution:</b> Fixed in 0.2
</p>
</conbody>
</concept>
<concept id="IMP-270">
<title>Files in unsupported compression formats are read as plain text.</title>
<conbody>
<p>
Attempting to read such files does not generate a diagnostic.
</p>
<p audience="Cloudera">
<b>Cloudera Bug:</b>
<xref href="https://jira.cloudera.com/browse/IMP-270" scope="external" format="html"/>; KI added 0.1
<i>Cloudera internal only</i>
</p>
<p>
<b>Resolution:</b> Fixed in 0.2
</p>
</conbody>
</concept>
<concept id="IMP-522">
<title>Impala server raises a null pointer exception when running an HBase query.</title>
<conbody>
<p>
When querying an HBase table whose row-key is string type, the Impala server may raise a null pointer
exception.
</p>
<p audience="Cloudera">
<b>Cloudera Bug:</b>
<xref href="https://jira.cloudera.com/browse/IMP-522" scope="external" format="html"/>; KI added 0.1
<i>Cloudera internal only</i>
</p>
<p>
<b>Resolution:</b> Fixed in 0.2
</p>
</conbody>
</concept>
</concept>
</concept>