PARQUET_ANNOTATE_STRINGS_UTF8 Query Option (Impala 2.6 or higher only)
CREATE TABLE AS SELECT statements
to write Parquet files that use the UTF-8 annotation for
By default, Impala represents a
STRING column in Parquet as an unannotated binary field.
Impala always uses the UTF-8 annotation when writing
columns to Parquet files. An alternative to using the query option is to cast
This option is to help make Impala-written data more interoperable with other data processing engines. Impala itself currently does not support all operations on UTF-8 data. Although data processed by Impala is typically represented in ASCII, it is valid to designate the data as UTF-8 when storing on disk, because ASCII is a subset of UTF-8.
Type: Boolean; recognized values are 1 and 0, or
any other value interpreted as
false (shown as 0 in output of
Added in: Impala 2.6.0