impala

mirror of https://github.com/apache/impala.git synced 2026-01-01 09:00:42 -05:00

Author	SHA1	Message	Date
Michael Ubell	0c4f025a5e	Fix loading of nulltable data, remove loading functional-planner data	2014-01-08 10:45:58 -08:00
Michael Ubell	bf57ae27a5	IMP-291 Read sequence file to next sync mark when; ragged columns	2014-01-08 10:45:57 -08:00
Michael Ubell	5f951ffc4a	Handle missing columns at the end of a row	2014-01-08 10:45:11 -08:00
Henry Robinson	e7348a209b	IMP-232: Parallel INSERT OVERWRITE	2014-01-08 10:45:04 -08:00
Nong Li	7c411da86c	Fixed schema template.	2014-01-08 10:44:41 -08:00
Nong Li	5b2621a401	Fix null table creation to workaround hive issue.	2014-01-08 10:44:41 -08:00
Nong Li	4c9c82910a	Text parser fix for columns off end.	2014-01-08 10:44:40 -08:00
Nong Li	4d0319d32b	Fix null string parsing.	2014-01-08 10:44:40 -08:00
Lenni Kuff	6e07e0b8d8	Added support for generating ANALYZE TABLE ... COMPUTE STATISTICS statements during data loading Add support for generating ANALYZE TABLE ... COMPUTE STATISTICS statements to the data loading workflow. This allows for capturing simple table stats such as number of rows, number of partitions, and table size in bytes. These are stored into a new mysql database with the same name as the metastore except with a '_Stats' suffix. If using Derby a new database results are stored in a new derby database.	2014-01-08 10:44:34 -08:00
Alan Choi	cbadb4eac4	When a scan range begins at the starting point fo the tuple, we'll missed that tuple. This patch fixes this problem. review: 162	2014-01-08 10:44:24 -08:00
Michael Ubell	02d63d8dc3	Trevni file support	2014-01-08 10:44:19 -08:00
Lenni Kuff	84d91fca4f	Fix sequence file data loading for the alltypesmixedformat table Moved this out of the data loading framework because it is kind of a special case. I will consider how we can update the framework to address mixed format tables.	2014-01-08 10:44:18 -08:00
Lenni Kuff	bf27a31f98	Move functional data loading to new framework + initial changes for workload directory structure This change moves (almost) all the functional data loading to the new data loading framework. This removes the need for the create.sql, load.sql, and load-raw-data.sql file. Instead we just have the single schema template file: testdata/datasets/functional/functional_schema_template.sql This template can be used to generate the schema for all file formats and compression variations. It also should help make loading data easier. Now you can run: bin/load-impala-data.sh "query-test" "exhaustive" And get all data needed for running the query tests. This change also includes the initial changes for new dataset/workload directory structure. The new structure looks like: testdata/workload <- Will contain query files and test vectors/dimensions testdata/datasets <- WIll contain the data files and schema templates Note: This is the first part of the change to this directory structure - it's not yet complete. # Please enter the commit message for your changes. Lines starting	2014-01-08 10:44:18 -08:00

1 2

63 Commits