Commit Graph

74 Commits

Author SHA1 Message Date
ishaan
0d0614765d Only use nproc to determine functional test concurrency when it's available in the os.
Some operating systems don't ship which nproc, which causes impala-config.sh to fail. This
change alleviates the problem by checking if nproc exists, and setting a reasonable
default if it fails.

Change-Id: Ic6e4d0fbce57eedc82163cfa17f71bdccbc38b51
Reviewed-on: http://gerrit.ent.cloudera.com:8080/3208
Reviewed-by: Ishaan Joshi <ishaan@cloudera.com>
Tested-by: Ishaan Joshi <ishaan@cloudera.com>
2014-06-20 12:52:08 -07:00
ishaan
f92c9a9335 Run local tests at lower concurrency.
Currently, we launch #nproc processes to run tests locally. This patch changes the default
to #proc/2, to not overload the system.

Change-Id: I8bca23eb7462a0c497df93f82a60d85835bedbe9
Reviewed-on: http://gerrit.ent.cloudera.com:8080/2972
Reviewed-by: Ishaan Joshi <ishaan@cloudera.com>
Tested-by: Ishaan Joshi <ishaan@cloudera.com>
2014-06-19 12:48:29 -07:00
Henry Robinson
3e7e7ed0dc Fix impala-config.sh when JAVA_HOME not set
Change-Id: Iaefda2039de1a5aafc782bca582d3007abcf6eff
Reviewed-on: http://gerrit.ent.cloudera.com:8080/2803
Reviewed-by: Lenni Kuff <lskuff@cloudera.com>
Tested-by: jenkins
(cherry picked from commit 48db5de6825cba8b6a1c1c658ff79a9641341dca)
Reviewed-on: http://gerrit.ent.cloudera.com:8080/2814
Reviewed-by: Henry Robinson <henry@cloudera.com>
2014-06-03 19:48:57 -07:00
Lenni Kuff
79d43e1e41 Handle cases where environment variables are not defined in impala-config.sh
Change-Id: Iee2800cb02299a9ed26da6fd079e3a72fe2a2482
Reviewed-on: http://gerrit.ent.cloudera.com:8080/2537
Reviewed-by: Lenni Kuff <lskuff@cloudera.com>
Tested-by: Lenni Kuff <lskuff@cloudera.com>
Reviewed-on: http://gerrit.ent.cloudera.com:8080/2539
2014-05-13 08:22:42 -07:00
Lenni Kuff
f1d9c0f58b [CDH5] Update Impala's Sentry dependency to Sentry v1.3 (from v1.2)
This updates Impala to use Sentry v1.3 instead of Sentry v1.2. No major functionality
changed between Sentry versions, but some Sentry classes were moved and APIs changed.

Change-Id: I3765748d2cdbe00f59eefa3c971558efede38ebd
Reviewed-on: http://gerrit.ent.cloudera.com:8080/2319
Reviewed-by: Lenni Kuff <lskuff@cloudera.com>
Tested-by: jenkins
2014-05-13 02:57:07 -07:00
Lenni Kuff
13c794db91 [CDH5] Update dependency versions to CDH5.1.0
This just updates the versions, it doesn't touch anything in /thirdparty.
Change parquet version to append SNAPSHOT
Added hadoop-hbase-compat jar in AUX_CLASSPATH and mapreduce/*.jar to HDFS

Change-Id: I4471ef4476997371cf49a9d54cfa63f2fda126e4
2014-05-07 15:10:40 -07:00
Matthew Jacobs
a283d72cdd [cdh5] Add latest cdh5 hadoop, hbase, and hive snapshots to thirdparty
Change-Id: I60c93b259a26e86aca60f2b3b5b6226eabc0b5eb
2014-03-05 01:06:09 -08:00
Alex Behm
3d764619f7 Run Hive data loading through beeline instead of the Hive shell.
Fixes our log configuration to put the Hive logs in cluster_logs/hive.

Change-Id: I5d98581e35325f2173e4b3170e36bec42d33f8f3
Reviewed-on: http://gerrit.ent.cloudera.com:8080/1497
Reviewed-by: Alex Behm <alex.behm@cloudera.com>
Tested-by: jenkins
Reviewed-on: http://gerrit.ent.cloudera.com:8080/1615
Reviewed-by: Lenni Kuff <lskuff@cloudera.com>
2014-02-20 15:43:31 -08:00
Henry Robinson
5535a8a128 [CDH5] Set CDH major version to 5
Change-Id: Ibc36ed435dd36d3489d27a977bf1726bbf2927a1
Reviewed-on: http://gerrit.ent.cloudera.com:8080/1306
Reviewed-by: Henry Robinson <henry@cloudera.com>
Tested-by: Henry Robinson <henry@cloudera.com>
2014-01-17 14:34:01 -08:00
Henry Robinson
241270044b Add CDH_MAJOR_VERSION environment variable
CDH_MAJOR_VERSION controls where HDFS data is written. In the future, we
can use its value to parameterise Jenkins jobs so that the right code is
run / data is generated.

Change-Id: Id2957df6d708bc6c50faf7a8a609aff5f9571662
Reviewed-on: http://gerrit.ent.cloudera.com:8080/1293
Reviewed-by: Nong Li <nong@cloudera.com>
Reviewed-by: Lenni Kuff <lskuff@cloudera.com>
Tested-by: jenkins
Reviewed-on: http://gerrit.ent.cloudera.com:8080/1305
Reviewed-by: Henry Robinson <henry@cloudera.com>
Tested-by: Henry Robinson <henry@cloudera.com>
2014-01-17 14:33:18 -08:00
Nong Li
53d7bbb97a [CDH5] Impala changes for updated thirdparty components.
Changes include:
  - version changes in impala-config
  - version changes in various loading scripts
  - hbase jars are no longer in hive/lib
  - mini-llama script changes
  - updates due to sentry api changes
  - JDBC tests disabled
  - unsupported types tests disabled.

Change-Id: If8cf1b7ad8e22aa4d23094b9a4b1047f7e9d93ee
2014-01-15 15:12:13 -08:00
Alex Behm
dc7b398bd3 Impala reserves resources from YARN via LLama.
Impala reserves resources from YARN via Llama and handles resources
preemptions by cancelling affected queries. Adds the Impala Resource
Broker for interacting with Llama. Refactors scheduler and coordinator
to move fragment-to-host assignment logic into scheduler. Local test
setup uses MiniLLama.

Change-Id: Ic7b0fe43de52d30f4207b4e65cce7e6a294e54e1
2014-01-15 15:12:04 -08:00
Alex Behm
60003ad211 [CDH5] Changes to make Impala work on CDH5. Mostly fixing up dependency versions. Minor code changes to address HBase API changes.
Change-Id: Icbbeb13eefa29e38286328d45600117a383cd106
2014-01-15 15:11:23 -08:00
Nong Li
752b8e3ee4 [CDH5] Added CDH5 beta2 versions of Hadoop, Hive, HBase and Llama to thirdparty.
Change-Id: Id033c0246c0ffdffd0c7703eaff9600086912380
2014-01-15 15:11:13 -08:00
Henry Robinson
6d9a7e290d Build Openldap as a thirdparty package
Change-Id: Ifbb0f468a23186f4160fceb462953bc321469c27
Reviewed-on: http://gerrit.ent.cloudera.com:8080/1049
Reviewed-by: Henry Robinson <henry@cloudera.com>
Tested-by: Henry Robinson <henry@cloudera.com>
2014-01-08 10:54:20 -08:00
Henry Robinson
cb965d259a Build changes to use cyrus-sasl-2.1.23
Change-Id: Ie87e35945b6a415b0383cb75ffcae2fe35755623
Reviewed-on: http://gerrit.ent.cloudera.com:8080/1047
Reviewed-by: Henry Robinson <henry@cloudera.com>
Tested-by: Henry Robinson <henry@cloudera.com>
2014-01-08 10:54:19 -08:00
Lenni Kuff
01660374c6 Additional fe and testdata pom.xml cleanup
This change cleans up our FE pom.xml file by removing unneeded
dependencies and system dependencies (system dependencies are now pulled in
from the Maven release repository).

The upside is that our pom is cleaner and it will also help reduce the likelihood of
broken dependencies since Maven will pull in the right versions.  The downside
is that we now pull in quite a few more JARs.

Note: I was unable to find release artifacts for Sentry and Parquet so I leaving
those as "system" for now.

Change-Id: I0b917b09a02243d78d89747591ab6bccacf7cf38

Saving changes

Change-Id: I3697a7b44884c40e077b3e354fef76625e1b881d
Reviewed-on: http://gerrit.ent.cloudera.com:8080/1011
Reviewed-by: Lenni Kuff <lskuff@cloudera.com>
Tested-by: jenkins
2014-01-08 10:54:17 -08:00
ishaan
81b80c702c Upgrade thirdparty to use CDH4.5 bits.
The following changes have been made:
  -- Update hbase
  -- Update hive
  -- Update hadoop
  -- Update the parquet version to 1.2.5

Change-Id: Id6ceaef0e9eebab27ffd408160116fa84ed300fb
2014-01-08 10:54:09 -08:00
Lenni Kuff
77e6430811 Add alias in impala-config.sh to gerrit-merge-verify script
This alias makes it easy to verify and merge Gerrit changes

Change-Id: Idb0b5c3e6c825721e375bdf0c86b5975df1ed4b9
Reviewed-on: http://gerrit.ent.cloudera.com:8080/836
Reviewed-by: Ishaan Joshi <ishaan@cloudera.com>
Tested-by: Lenni Kuff <lskuff@cloudera.com>
2014-01-08 10:53:50 -08:00
Henry Robinson
f02293bf5f Upgrade cyrus-sasl to 2.1.25
Change-Id: I1864c6fa0811f615777e9a7ed0aeef5494104449
Reviewed-on: http://gerrit.ent.cloudera.com:8080/733
Tested-by: jenkins
Reviewed-by: Henry Robinson <henry@cloudera.com>
2014-01-08 10:53:40 -08:00
Nong Li
4800995d44 Add execution for Hive UDFs.
Change-Id: I6a5ad96fed77e2b8a2701f21a917a8eb7a11d500
Reviewed-on: http://gerrit.ent.cloudera.com:8080/458
Reviewed-by: Nong Li <nong@cloudera.com>
Tested-by: Nong Li <nong@cloudera.com>
2014-01-08 10:53:25 -08:00
Lenni Kuff
79cdeac3d6 Consolidate test cluster under IMPALA_HOME/cluster_logs + store logs during data loading
Change-Id: I8f6239e4ccb0515c85bf80193a475788fb18dedb
Reviewed-on: http://gerrit.ent.cloudera.com:8080/518
Tested-by: jenkins
Reviewed-by: Nong Li <nong@cloudera.com>
2014-01-08 10:52:56 -08:00
Henry Robinson
dbed012396 Move from 'Mongoose' to 'Squeasel' webserver
We now maintain our own internal version of the Mongoose webserver,
renamed to 'Squeasel' for differentiation. This patch imports the new
code, and swaps all mentions of mongoose or mg_ for squeasel / sq_.

In the future, we might consider making Squeasel a git subproject so
that we can pull in changes more easily.

Change-Id: I83b595dc336a32f2c8aba59eee420b71274b681b
Reviewed-on: http://gerrit.ent.cloudera.com:8080/485
Tested-by: jenkins
Reviewed-by: Henry Robinson <henry@cloudera.com>
2014-01-08 10:52:55 -08:00
Aaron Davidson
d0665481d1 Vary number of build threads based on number of cores
Simply makes buildall.sh and the make_*.sh commands use 2 * ncores
build threads. ncores includes logical CPUs.

Change-Id: Ib3fbf1f1c8362c5bd3afab61f4d3030a50c51c10
Reviewed-on: http://gerrit.ent.cloudera.com:8080/288
Reviewed-by: Lenni Kuff <lskuff@cloudera.com>
Tested-by: jenkins
2014-01-08 10:52:22 -08:00
Lenni Kuff
4cd8830814 Simplify building CLASSPATH for run-jdbc-client.sh
Change-Id: I4dd77e0a83b87474260fdc19a93a0ccf4b766ad7
Reviewed-on: http://gerrit.ent.cloudera.com:8080/100
Reviewed-by: Henry Robinson <henry@cloudera.com>
Tested-by: Henry Robinson <henry@cloudera.com>
2014-01-08 10:51:58 -08:00
Nong Li
9af39ad4c4 Add check for llvm version in CMake. 2014-01-08 10:51:55 -08:00
Lenni Kuff
8c264f0395 Added Sentry v1.1.0 to thirdparty 2014-01-08 10:51:48 -08:00
Lenni Kuff
9fb3f037e3 Set PPROF_PATH in impala-config so pprof is run when HEAPCHECK fails 2014-01-08 10:51:32 -08:00
Skye Wanderman-Milne
e7ea89524b Upgrade Avro library to 1.7.4 2014-01-08 10:51:06 -08:00
Lenni Kuff
7ac88e1fa9 IMPALA-400: Add support for SQL statement authorization
This changes adds support for SQL statement authorization in Impala. The authorization
works by updating the Catalog API to require a User + Privilege when getting Table/Db
objects (and in the future can be extended to cover columns as well).
If the user doesn't have permission to access the object, an AuthorizationException is
thrown. The authorization checks are done during analysis as new Catalog objects are
encountered.

These changes build on top of the Hive Access code which handles the actually
processing of authorization requests.  The authorization is currently based
on a "policy file" which will be stored in HDFS. This policy file is read once
on startup and then reloaded every 5 minutes. It can also be reloaded on a
specific impalad by executing a "refresh" command.

Authorization is enabled by setting:
--server_name='server1'
and then pointing the impalad to the policy file using the flag:
--authorization_policy_file=/path/to/policy/file

any authorization configuration problems will result in impalad failing to
start.
2014-01-08 10:50:56 -08:00
Lenni Kuff
00b5f65e08 Enable Java signal-chaining in local test environment
From the Java documentation:

"Signal-chaining enables the Java Platform to better interoperate with
native code that installs its own signal handlers... The signal-chaining facility offers:

A. Support for pre-installed signal handlers when the HotSpot VM is created.
B. Support for signal handler installation after the HotSpot VM is created, inside JNI
code or from another native thread."
2014-01-08 10:50:53 -08:00
Lenni Kuff
97a1d4d8b8 Update Hive to CDH4.3.0 2014-01-08 10:50:44 -08:00
Lenni Kuff
5ccd53d062 Add CDH 4.3.0 HBase to thirdparty 2014-01-08 10:50:38 -08:00
ishaan
227102f4aa Add cdh4.3 bits to Impala. 2014-01-08 10:50:35 -08:00
Lenni Kuff
2f7198292a Add support for auxiliary workloads, tests, and datasets
This change adds support for auxiliary worksloads, tests, and datasets. This is useful
to augment the regular test runs with some additional tests that do not belong in the
main Impala repo.
2014-01-08 10:50:32 -08:00
Alan Choi
b282175461 IMPA-213 Disable DN server check; disable all checks if impala cannot detech cdh version 2014-01-08 10:49:44 -08:00
Alan Choi
c419ae1891 Add 4.1 direct read configuration check
Impala detects the HDFS version by reading the Namenode web UI and run
the corresponding check.

On 4.1, Impala tries to check the datanode (server side) config by reading
the datanode web UI.
2014-01-08 10:49:31 -08:00
Sean Mackrory
e260ac202b IMP-817: Rebuilding Hadoop libraries with HDFS fix 2014-01-08 10:49:11 -08:00
Elliott Clark
fdda4d70eb fix dirname call in impala-config for zsh 2014-01-08 10:48:31 -08:00
Sean Mackrory
fa4393bc1f Including LD_LIBRARY_PATH in summary of configuration variables 2014-01-08 10:48:26 -08:00
Sean Mackrory
05dd34a819 IMP-594: Impala binaries are embedding build-time locations of libraries 2014-01-08 10:48:25 -08:00
Lenni Kuff
5be657f4ce Fixed pythonpath after shell egg changes 2014-01-08 10:48:06 -08:00
Skye Wanderman-Milne
8b87099998 IMPALA-2: Support for Avro data files
Adds HdfsAvroScanner, as well as modifies the sequence scanners to be more general.
2014-01-08 10:48:05 -08:00
ishaan
846301c394 Change make_shell_tarball to be able to build external python modules in the repository. 2014-01-08 10:48:00 -08:00
Nong Li
7001fb103e Move Impala to CDH4.2 RC2 2014-01-08 10:47:50 -08:00
Alan Choi
a6c58f195b IMP-546 hbase-table-scanner needs global ref for scan_ and resultscanner_
Enable JNI checking
Add single node test for HBase queries
2014-01-08 10:47:37 -08:00
Lenni Kuff
1a2695781d Add support for targeting JDBC via run-workload and add Impala Jdbc Client tool 2014-01-08 10:47:29 -08:00
Henry Robinson
7ba437a52e Code changes to build against thrift 0.9.0 in thirdparty/ 2014-01-08 10:47:22 -08:00
Henry Robinson
35e7e2a7a9 Move thirdparty library versions to environment variables 2014-01-08 10:46:38 -08:00
Michael Ubell
8a5297a526 Add HdfsLzoTextScanner 2014-01-08 10:46:35 -08:00