impala

mirror of https://github.com/apache/impala.git synced 2026-01-01 00:00:20 -05:00

Author	SHA1	Message	Date
Nong Li	f9dd32724c	Cleanup build scripts. Consolidated our build scripts and added the -notests option which skips build the BE tests. Change-Id: Ida6aa064b7fe47e535c142b9af92b7c158e83c32 Reviewed-on: http://gerrit.ent.cloudera.com:8080/2043 Reviewed-by: Nong Li <nong@cloudera.com> Tested-by: jenkins Reviewed-on: http://gerrit.ent.cloudera.com:8080/2201	2014-04-13 17:11:39 -07:00
Lenni Kuff	86f69fb96f	IMP-1306: Fix build scripts to properly generate Impala version info for packaging builds The problem was that were were deleting the version.info file because the default of gen_build_version.py recently changed from --noclean to --clean. Also fixed a bug in the shell version generation and made debugging a bit easier by dumping the contents of version.info whenever it is generated. Change-Id: I764d01c9e46eed1bd39de79bf076c15afa599486 Reviewed-on: http://gerrit.ent.cloudera.com:8080/1901 Reviewed-by: Alex Behm <alex.behm@cloudera.com> Tested-by: Lenni Kuff <lskuff@cloudera.com> (cherry picked from commit fa673b4d3342fc825ee7fa942bd254234d222906) Reviewed-on: http://gerrit.ent.cloudera.com:8080/1910 Reviewed-by: Lenni Kuff <lskuff@cloudera.com>	2014-03-14 08:45:16 -07:00
Aaron Davidson	d0665481d1	Vary number of build threads based on number of cores Simply makes buildall.sh and the make_.sh commands use 2 ncores build threads. ncores includes logical CPUs. Change-Id: Ib3fbf1f1c8362c5bd3afab61f4d3030a50c51c10 Reviewed-on: http://gerrit.ent.cloudera.com:8080/288 Reviewed-by: Lenni Kuff <lskuff@cloudera.com> Tested-by: jenkins	2014-01-08 10:52:22 -08:00
Nong Li	4d89735ec5	Fix up bin/make_* scripts to use rm -f.	2014-01-08 10:51:14 -08:00
Nong Li	c4c2a7168d	Fix IR gen build dependency.	2014-01-08 10:48:52 -08:00
Nong Li	11191d6c82	Default bin/make_release to not use PGO.	2014-01-08 10:48:19 -08:00
Nong Li	65f4fd98e4	Move to LLVM 3.2	2014-01-08 10:47:23 -08:00
Henry Robinson	2f339f2ed8	Add ASL license to all public files	2014-01-08 10:46:32 -08:00
ishaan	ccb020c4a0	Adding copyrights to remaining files.	2014-01-08 10:46:30 -08:00
Nong Li	bc08241ffb	IR cross compile fixes for inlined string-value functions.	2014-01-08 10:46:19 -08:00
ishaan	e84cc0a9eb	Enable code coverage on release builds.	2014-01-08 10:44:41 -08:00
Michael Ubell	02d63d8dc3	Trevni file support	2014-01-08 10:44:19 -08:00
Lenni Kuff	0da77037e3	Updated Impala performance schema and test vector generation This change updates the Impala performance schema and test vector generation techniques. It also migrates the existing benchmark scripts that were Ruby over to use Python. The changes has a few parts: 1) Conversion of test vector generation and benchmark statement generation from Ruby to Python. A result of this was also to update the benchmark test vector and dimension files to be written in CSV format (python doesn't have built-in YAML support) 2) Standardize on the naming for benchmark tables to (somewhat match Query tests). In general the form is: * If file_format=text and compression=none, do not use a table suffix * Abbreviate sequence file as (seq) rc file as (rc) etc * If using BLOCK compression don't append anything to table name, if using 'record' append 'record' 3) Created a new way to adding new schemas. this is the benchmark_schema_template.sql file. The generate_benchmark_statements.py script reads this in and breaks up the sections. The section format is: ==== Data Set Name --- BASE table name --- CREATE STATEMENT Template --- INSERT ... SELECT * format --- LOAD Base statement --- LOAD STATEMENT Format Where BASE Table is a table the other file formats/compression types can be generated from. This would generally be a local file. The thinking is that if the files already exist in HDFS then we can just load the file directly rather than issue an INSERT ... SELECT * statement. The generate_benchmark_statements.py script has been updated to use this new template as well as query HDFS for each table to determine how it should be created. It then outputs an ideal file call load-benchmark-*-generated.sql. Since this file is geneated dynamically we can remove the old benchmark statement files. 4) This has been hooked into load-benchmark-data.sh and run_query has been updated to use the new format as well	2012-07-12 23:12:20 -07:00
Alan Choi	ad073ef1b2	IMP- 78 We want to expose issues in an distributed env locally. We already have 3 data nodes running locally in the MiniDFS. However, the planner does not distinguish data nodes on the same host, even though they're running on a different port. So, we're effectively only running a single node all the time. First, we make the change in FE to identify data location as "host/port" instead of just "host". Then, in TQueryExecRequest, we list the host/port that serves the data, instead of just using "host". The result is that PlannerTest and QueryTest exposes distributed planning issue. Plans are still correct when the number of node is 1 or 2. So, to make all the tests passes, I've forced Planner/Query test to execute with at most 2 nodes. To see the faulty plan, we simply have to change the number of node back to 0 (all nodes). o We've discussed randomizing the SimpleScheduler but I choose not to do it because we don't need randomization to expose the distributed planning issue. I also discovered that exchange node (BE) does not respect the "limit". I fixed it. One of the limit test (QueryTest) is completely unstable. It doesn't really test much. I removed it.	2012-06-22 14:08:44 -07:00
Lenni Kuff	0e844e7187	Updated make_release script to add flag for controlling whether or not to do PGO build	2012-06-12 17:52:27 -07:00
Michael Ubell	3608b3fb06	RC File rewrite	2012-05-22 20:37:47 -07:00
Lenni Kuff	35951643f5	Fixed benchmark generation scripts and make_release scripts to properly generate and execute the benchmark queries. Updated to demove Lzo compression and add coverage of 'DefaultCodec' Fixed up make_release to more cleanly list queries.	2012-05-17 17:40:00 -07:00
Michael Ubell	62d29ff1c6	Sequence File Scanner	2012-05-01 17:48:24 -07:00
Nong Li	17f7b16da8	Allow runquery to run multiple queries from the command line.	2012-03-01 10:47:34 -08:00
Nong Li	88237350f0	Change the build to allow debug and release builds to coexist.	2012-02-17 18:14:04 -08:00
Nong Li	bf74bc25e3	Some cleanup: - Fixed issue with SSE file parse. - Moved build scripts to impala/bin. Rebuilding from just BE does not work. - Cleanedup a few compiler warnings. - Add option to disable automatic counters for profilers.	2011-12-31 06:17:28 -08:00

21 Commits