I believe that started around the time the switch to the new scale down processor happened.
The parameter goes into effect and puts scaledown in cooldown mode based on the lastScaleDownDeleteTime.
However, this is only set when the scale down status result is ScaleDownNodeDeleted
And it seems with the switch to the async deletions, this is no longer set:
/kind bug
@MaciekPytel @towca
Should we be returning ScaleDownNodeDeleted instead of ScaleDownNodeDeleteStarted - it seems that the later isn't used around so it seems like the most straightforward fix
Thanks for pointing this out Marwan! Although I'd stick to using ScaleDownNodeDeleteStarted since it better conveys what actually happens. I'll send out a fix shortly.
The bug should be fixed by #3570.
thanks @towca @marwanad for finding/fixing this issue! any idea when/if this will make it into a patch release for versions 1.16.x/1.17.x?
@MaciekPytel - maybe you know the answer to the above question?
@ryaneorth we can prepare the cherry-pick PRs. I'm guessing we'll have one set of patch releases before K8s 1.20 and another around 1.20.
Sounds great, thanks @marwanad . I'm happy to perform the cherry-picks if you'd like - let me know!
@ryaneorth that would be great, thanks!
Done!
All of the above cherry-picks are complete. @MaciekPytel - do you have any information as to when the next patches will be released?
@ryaneorth keep an 馃憖 on https://github.com/kubernetes/autoscaler/issues/3611
Most helpful comment
The bug should be fixed by #3570.