mirror of
https://github.com/apache/impala.git
synced 2026-01-07 00:02:28 -05:00
Through emprical analysis, it was determined that setting the maximum number of connections to S3 as 1500 was optimal for functionality and performance. The hadoop set default of 15 connections could lead us to have deadlocks as our parquet scanner requires that we have multiple concurrent open connections proportional to the number of columns that we are scanning. Setting it to this high a value does not seem to have any negative implications. This has also been found to fix the Error(255): Unknown errors. Change-Id: Ide6f1326d5155b2e5f4da3a3f23df3f3d40c5a8d Reviewed-on: http://gerrit.cloudera.org:8080/3114 Reviewed-by: Sailesh Mukil <sailesh@cloudera.com> Tested-by: Internal Jenkins