mirror of
https://github.com/apache/impala.git
synced 2025-12-29 18:01:07 -05:00
This is a simpler alternative to bootstrap_development.sh - it acquires enough dependencies to build, but does not attempt to load the test data or even build the tests. This is sometimes a lightweight testing method used by Apache PPMC members who are voting on a release of an incubating project. Change-Id: If34e398052a61dfda9825b1cf3a918eb61736048 Reviewed-on: http://gerrit.cloudera.org:8080/5154 Reviewed-by: Jim Apple <jbapple-impala@apache.org> Tested-by: Impala Public Jenkins
1.5 KiB
1.5 KiB
Welcome to Impala
Lightning-fast, distributed SQL queries for petabytes of data stored in Apache Hadoop clusters.
Impala is a modern, massively-distributed, massively-parallel, C++ query engine that lets you analyze, transform and combine data from a variety of data sources:
- Best of breed performance and scalability.
- Support for data stored in HDFS, Apache HBase and Amazon S3.
- Wide analytic SQL support, including window functions and subqueries.
- On-the-fly code generation using LLVM to generate CPU-efficient code tailored specifically to each individual query.
- Support for the most commonly-used Hadoop file formats, including the Apache Parquet (incubating) project.
- Apache-licensed, 100% open source.
More about Impala
To learn more about Impala as a business user, or to try Impala live or in a VM, please visit the Impala homepage.
If you are interested in contributing to Impala as a developer, or learning more about Impala's internals and architecture, visit the Impala wiki.
Supported Platforms
Impala only supports Linux at the moment.
Build Instructions
See bin/bootstrap_build.sh.
Export Control Notice
This distribution uses cryptographic software and may be subject to export controls. Please refer to EXPORT_CONTROL.md for more information.