Sometimes, balancing raw query performance against scalability requires limiting the amount of resources, such as memory or CPU, used by a single query or group of queries. Impala can use several mechanisms that help to smooth out the load during heavy concurrent usage, resulting in faster overall query times and sharing of resources across Impala queries, MapReduce jobs, and other kinds of workloads across a cluster:
You can restrict the amount of memory Impala reserves during query execution by specifying the
-mem_limit
option for the impalad
daemon. See
Modifying Impala Startup Options for details. This limit applies only to the
memory that is directly consumed by queries; Impala reserves additional memory at startup, for example to
hold cached metadata.
For production deployments, implement resource isolation using your cluster management tool.