Apache Impala Guide
Introducing Apache Impala
Concepts and Architecture
Components
Developing Applications
Role in the Hadoop Ecosystem
Deployment Planning
Requirements
Designing Schemas
Installing Impala
Managing Impala
Post-Installation Configuration for Impala
Configuring Impala to Work with ODBC
Configuring Impala to Work with JDBC
Upgrading Impala
Starting Impala
Modifying Impala Startup Options
Tutorials
Administration
Admission Control and Query Queuing
Resource Management for Impala
Setting Timeouts
Load-Balancing Proxy for HA
Managing Disk Space
Impala Security
Security Guidelines for Impala
Securing Impala Data and Log Files
Installation Considerations for Impala Security
Securing the Hive Metastore Database
Securing the Impala Web User Interface
Configuring TLS/SSL for Impala
Enabling Sentry Authorization for Impala
Impala Authentication
Enabling Kerberos Authentication for Impala
Enabling LDAP Authentication for Impala
Using Multiple Authentication Methods with Impala
Configuring Impala Delegation for Hue and BI Tools
Auditing
Viewing Lineage Info
SQL Reference
Comments
Data Types
ARRAY Complex Type (Impala 2.3 or higher only)
BIGINT
BOOLEAN
CHAR
DECIMAL
DOUBLE
FLOAT
INT
MAP Complex Type (Impala 2.3 or higher only)
REAL
SMALLINT
STRING
STRUCT Complex Type (Impala 2.3 or higher only)
TIMESTAMP
TINYINT
VARCHAR
Complex Types (Impala 2.3 or higher only)
Literals
SQL Operators
Schema Objects and Object Names
Aliases
Databases
Functions
Identifiers
Tables
Views
SQL Statements
DDL Statements
DML Statements
ALTER TABLE
ALTER VIEW
COMPUTE STATS
CREATE DATABASE
CREATE FUNCTION
CREATE ROLE
CREATE TABLE
CREATE VIEW
DELETE
DESCRIBE
DROP DATABASE
DROP FUNCTION
DROP ROLE
DROP STATS
DROP TABLE
DROP VIEW
EXPLAIN
GRANT
INSERT
INVALIDATE METADATA
LOAD DATA
REFRESH
REVOKE
SELECT
Joins
ORDER BY Clause
GROUP BY Clause
HAVING Clause
LIMIT Clause
OFFSET Clause
UNION Clause
Subqueries
TABLESAMPLE Clause
WITH Clause
DISTINCT Operator
SET
Query Options for the SET Statement
ABORT_ON_ERROR
ALLOW_UNSUPPORTED_FORMATS
APPX_COUNT_DISTINCT
BATCH_SIZE
BUFFER_POOL_LIMIT
COMPRESSION_CODEC
COMPUTE_STATS_MIN_SAMPLE_SIZE
DEBUG_ACTION
DECIMAL_V2
DEFAULT_JOIN_DISTRIBUTION_MODE
DEFAULT_SPILLABLE_BUFFER_SIZE
DISABLE_CODEGEN
DISABLE_ROW_RUNTIME_FILTERING
DISABLE_STREAMING_PREAGGREGATIONS
DISABLE_UNSAFE_SPILLS
EXEC_SINGLE_NODE_ROWS_THRESHOLD
EXEC_TIME_LIMIT_S
EXPLAIN_LEVEL
HBASE_CACHE_BLOCKS
HBASE_CACHING
LIVE_PROGRESS
LIVE_SUMMARY
MAX_ERRORS
MAX_ROW_SIZE
MAX_NUM_RUNTIME_FILTERS
MAX_SCAN_RANGE_LENGTH
MEM_LIMIT
MIN_SPILLABLE_BUFFER_SIZE
MT_DOP
NUM_NODES
NUM_SCANNER_THREADS
OPTIMIZE_PARTITION_KEY_SCANS
PARQUET_COMPRESSION_CODEC
PARQUET_ANNOTATE_STRINGS_UTF8
PARQUET_ARRAY_RESOLUTION
PARQUET_FALLBACK_SCHEMA_RESOLUTION
PARQUET_FILE_SIZE
PREFETCH_MODE
QUERY_TIMEOUT_S
REQUEST_POOL
REPLICA_PREFERENCE
RUNTIME_BLOOM_FILTER_SIZE
RUNTIME_FILTER_MAX_SIZE
RUNTIME_FILTER_MIN_SIZE
RUNTIME_FILTER_MODE
RUNTIME_FILTER_WAIT_TIME_MS
S3_SKIP_INSERT_STAGING
SCHEDULE_RANDOM_REPLICA
SCRATCH_LIMIT
SHUFFLE_DISTINCT_EXPRS
SUPPORT_START_OVER
SYNC_DDL
SHOW
TRUNCATE TABLE
UPDATE
UPSERT
USE
Optimizer Hints
Built-In Functions
Mathematical Functions
Bit Functions
Type Conversion Functions
Date and Time Functions
Conditional Functions
String Functions
Miscellaneous Functions
Aggregate Functions
APPX_MEDIAN
AVG
COUNT
GROUP_CONCAT
MAX
MIN
NDV
STDDEV, STDDEV_SAMP, STDDEV_POP
SUM
VARIANCE, VARIANCE_SAMP, VARIANCE_POP, VAR_SAMP, VAR_POP
Analytic Functions
Impala User-Defined Functions (UDFs)
SQL Differences Between Impala and Hive
Porting SQL
The Impala Shell
Configuration Options
Connecting to impalad
Running Commands and SQL Statements
Command Reference
Performance Tuning
Performance Best Practices
Join Performance
Table and Column Statistics
Benchmarking
Controlling Resource Usage
Runtime Filtering
HDFS Caching
Testing Impala Performance
EXPLAIN Plans and Query Profiles
HDFS Block Skew
Scalability Considerations
Partitioning
File Formats
Text Data Files
Parquet Data Files
Avro Data Files
RCFile Data Files
SequenceFile Data Files
Using Impala to Query Kudu Tables
HBase Tables
S3 Tables
ADLS Tables
Isilon Storage
Logging
Troubleshooting Impala
Web User Interface
Breakpad Minidumps
Ports Used by Impala
Impala Reserved Words
Impala Frequently Asked Questions
Impala Release Notes
New Features in Apache Impala
Incompatible Changes and Limitations in Apache Impala
Known Issues and Workarounds in Impala
Fixed Issues in Apache Impala