mirror of
https://github.com/apache/impala.git
synced 2025-12-30 03:01:44 -05:00
This change moves (almost) all the functional data loading to the new data loading framework. This removes the need for the create.sql, load.sql, and load-raw-data.sql file. Instead we just have the single schema template file: testdata/datasets/functional/functional_schema_template.sql This template can be used to generate the schema for all file formats and compression variations. It also should help make loading data easier. Now you can run: bin/load-impala-data.sh "query-test" "exhaustive" And get all data needed for running the query tests. This change also includes the initial changes for new dataset/workload directory structure. The new structure looks like: testdata/workload <- Will contain query files and test vectors/dimensions testdata/datasets <- WIll contain the data files and schema templates Note: This is the first part of the change to this directory structure - it's not yet complete. # Please enter the commit message for your changes. Lines starting
10 lines
384 B
Plaintext
10 lines
384 B
Plaintext
This directory contains Impala test data sets. The directory layout is structured as follows:
|
|
|
|
datasets/
|
|
<data set>/<data set>_schema_template.sql
|
|
<data set>/<data files SF1>/data files
|
|
<data set>/<data files SF2>/data files
|
|
|
|
Where SF is the scale factor controlling data size. This allows for scaling the same schema to
|
|
different sizes based on the target test environment.
|