This directory contains Impala test data sets. The directory layout is structured as follows: datasets/ /_schema_template.sql //data files //data files Where SF is the scale factor controlling data size. This allows for scaling the same schema to different sizes based on the target test environment.