Autoscaler: Support `custom Daemonset` Pods

Created on 15 Oct 2019 · 5Comments · Source: kubernetes/autoscaler

The Daemonset pod detection is currently done in the code by looking at the OwnerRef.Kind that reference ether a Daemonset, ReplicaSet, StatefulSet.

Internally at Datadog, we implemented thanks to CRD a custom Daemonset that aims to fix several issues that we have with the current Daemonset implementation.
Like the official Daemonset controller, our CustomDaemonset controller is creating and assigning on each Node a Pod, but in our case the OwnerRef.Kind is not Daemonset but CustomDaemonset, which has the consequence that the Autoscaler considers those pods as normal pods, and tries to control them for the cluster scaling and also the bin packing.

To allow the support of our use case, we are open to participating in the project. We see two possible implementations:

1) Add new configuration flag to provide several ApiGroup/Kind to treat as Daemonsets
2) Introduce a new Annotation on Pod to allow the Autoscaler to consider the Pod as a Daemonset pod. cluster-autoscaler.kubernetes.io/is-daemonset-pod: "true" (annotation name TBD)

1) Add new configuration flag to provide several `Daemonset` ApiGroup/Kind

Advantages:
- Users don’t need to update their PodTemplate with a new annotation.
Cons:
- Need to know all ApiGroup/Kind that needs to be considered as Daemonset.
- Requires to change the configuration of the Autoscaler deployment, which is not always possible in managed Kubernetes clusters like GKE.

2) Introduce a new Annotation on Pod to make the Autoscaler consider a Pod as a Daemonset pod.

Advantages:
- Better genericity compares to solution 1.
- No Autoscaler configuration change needed. That allows better support for managed Kubernetes clusters.
Cons:
- The user needs to add this annotation on each PodTemplate/Pod that is used as Daemonset.

Solution 2 is our preferred solution because it opens the support of more use cases. Also, it seems aligned with other use cases logic already implemented in the Autoscaler like:

Annotation for detecting Static Pods
Annotation for ignoring pods in the eviction logic: cluster-autoscaler.kubernetes.io/safe-to-evict: "true"

Currently with fix it with the following workaround: https://github.com/kubernetes/autoscaler/commit/25754c89d4d6cdbc84a1e5bf9f5e37b1e97a39ef

Feel free to comment on this issue, if you want more information about our use case, if you see corner cases or you think of another possible solution.

Source

clamoriniere

👍3

Most helpful comment

I wouldn't mind solution 2 if the autoscaler could retain its current ability to detect _DaemonSet_-managed pods. The reason is that some daemons we run come from upstream manifests that we'd have to patch—perhaps using _kustomize_—in order to place such an annotation on those pods.

seh on 21 Oct 2019

👍2

All 5 comments

It seems to me that approach 2) is less invasive and I do not see any hard reasons not to implement it. @MaciekPytel WDYT?
If we go for that it would be nice to clean up the getRequiredPodsForNode code. How it uses drain.GetPodsForDeletionOnNodeDrain under the hood is pretty disgusting.

losipiuk on 21 Oct 2019

👍1

seh on 21 Oct 2019

👍2

Option 2 SGTM. As mentioned there is precedent for using annotations in similar way in CA and I don't really see any downsides to adding it.

edit: I assume that annotation would work in addition to current DS pod detection, not replace it.

MaciekPytel on 21 Oct 2019

👍1

I wouldn't mind solution 2 if the autoscaler could retain its current ability to detect _DaemonSet_-managed pods. The reason is that some daemons we run come from upstream manifests that we'd have to patch—perhaps using _kustomize_—in order to place such an annotation on those pods.

Yeah - that is the idea. To add to what we have right now. Not replace current logic.

losipiuk on 22 Oct 2019

Thanks all for your feedback,
I will start implementing "Option 2" soon, since everyone is agree with it.

clamoriniere on 22 Oct 2019

Was this page helpful?

0 / 5 - 0 ratings

Related issues

AWS cloud provider tests take 120 seconds

losipiuk · 7Comments

CA tries to add pods to master instead of scaling a new node

pkelleratwork · 5Comments

[feature request] Optionally allow VPA to manage pod limits

davidquarles · 7Comments

Helm chart doesn't add service account annotations

lexsys27 · 5Comments

Bug: Disagreement between real scheduler and CA's simulation on whether a pod is schedulable

bpinske · 5Comments