Kubernetes Cluster Autoscaler

Automatically adjusts the size of a Kubernetes cluster based on resource demands.

Overview

The Cluster Autoscaler works by continuously monitoring the cluster for two main conditions:

Unschedulable pods due to insufficient resources
Underutilized nodes for an extended period

When it detects unschedulable pods, it scales up the cluster by adding nodes to accommodate the workload. Conversely, when it identifies underutilized nodes, it scales down the cluster by removing them after migrating their pods to other nodes. This process helps optimize resource usage, improve performance, and reduce costs by ensuring the cluster maintains just enough capacity to handle current workloads without manual intervention

Option 1: Collect metrics via Prometheus scrape annotation

When deploying the Cluster Autoscaler, you can enable Prometheus metrics by adding the following annotations to the Cluster Autoscaler Helm chart.

For more details see cluster-autoscaler Helm chart.

Option 2: Collect metrics via Prometheus scrape configuration

If you are using Prometheus to scrape metrics from your Kubernetes cluster, you can configure Prometheus to scrape the cluster autoscaler metrics by adding the following configuration to your Prometheus configuration file.

You need to verify that the pod labels in the scrape configuration match the name of the cluster-autoscaler pod.

K8s Cluster Autoscaler Overview

Dashboard with cluster-autoscaler metrics

[auto-scaler]

[k8s]

[prometheus]

Technology

Kubernetes Cluster Autoscaler

Overview

Overview

Setup

Option 1: Collect metrics via Prometheus scrape annotation

Option 2: Collect metrics via Prometheus scrape configuration

Dashboards

K8s Cluster Autoscaler Overview

Kubernetes

Technology

Kubernetes Cluster Autoscaler

Overview

Overview

Setup

Option 1: Collect metrics via Prometheus scrape annotation

Option 2: Collect metrics via Prometheus scrape configuration

Dashboards

K8s Cluster Autoscaler Overview

Related Integrations

Kubernetes