Member-only story

Autoscaling Nodes in Kubernetes

12 min readDec 31, 2022

Continuing with our journey of Horizontal Scaling in Kubernetes, here is another blog which will focus on auto scaling of Nodes in Kubernetes cluster. This semantic is also referred to as Cluster AutoScaler (CA).

Please refer to this blog to get more context on how we managed Horizontal Pod Scaling in Kubernetes —

Autoscaling Pods in Kubernetes

If you are hosting your workload in a cloud environment, and your traffic pattern is fluctuating in nature (think…

waswani.medium.com

Though the concept is applicable across all major Cloud Providers, I will be using AWS Cloud provider with AWS EKS managed service as Kubernetes Platform for running the sample code.

To set the context once again — Assume as a Platform Team, you are running a Kubernetes Cluster in a cloud environment and the Application Teams hosting their services have asked you to ensure that their workload pods should automatically scale out as the traffic spikes. You make them aware of the concept of Horizontal Pod Scaling construct and the team implements it. Everyone is happy, but very soon you get into a situation where workload pods while scaling out are getting into Pending state as Kubernetes Cluster does not have enough resources available to schedule the Pods.

Autoscaling Nodes in Kubernetes

Autoscaling Pods in Kubernetes

If you are hosting your workload in a cloud environment, and your traffic pattern is fluctuating in nature (think…

Written by Naresh Waswani

No responses yet