Cadvisor: Container restart count is a label

Created on 26 May 2016 · 15Comments · Source: google/cadvisor

The number of restarts is attached as a label to all metrics for a container. This causes all time series for a container to break as it restarts as identity is defined by the full set of labels.

The restart count is a time series itself and should only be exported as such.

Source

fabxc

👍5

Most helpful comment

Any news here? It would be useful to have container_restart_count metric to setup alerting when using bare docker engine without kubernetes.

caligo-mentis on 11 Oct 2019

👍5

All 15 comments

On Thu, May 26, 2016 at 9:05 AM, Fabian Reinartz [email protected]
wrote:

The number of restarts is attached as a label to all metrics for a
container. This causes all time series for a container to break as it
restarts as identity is defined by the full set of labels.

The restart count is a time series itself and should only be exported as
such.

—
You are receiving this because you are subscribed to this thread.
Reply to this email directly or view it on GitHub
https://github.com/google/cadvisor/issues/1312

vishh on 26 May 2016

These are set by kubernetes: https://github.com/kubernetes/kubernetes/blob/b0ea89c2f6b954a27ba0f545da955db4f25009d0/pkg/kubelet/dockertools/labels.go#L37-L50

Any suggestions how to solve this? None of these labels make sense to export as metric labels. Filtering them in cAdvisor might be a bit awkward as it adds references back to Kubernetes. Changing this in Kubernetes might require quite some changes though.

grobie on 18 Aug 2016

I'm starting to understand what's going on here. Kubernetes attaches all these docker labels as a fallback to retrieve them later again. I guess the easiest would be to filter all labels which start with io.kubernetes.. What do you think @timstclair @vishh?

grobie on 18 Aug 2016

By the way, I believe cAdvisor shouldn't try to parse these labels and create a restart counter from it. This would be the job of Kubernetes, also as Docker containers don't have the concept of restarts.

grobie on 18 Aug 2016

Yes, the labels should all be treated as opaque by cAdvisor.

timstclair on 18 Aug 2016

Actually, this isn't a Kubernetes Label, but a Docker Label also (when using --restart=always directive):

docker inspect nginx1 |grep -i restartcount
        "RestartCount": 2,

This is a pure Docker 17.03 running, no Kubernetes. I'm going to check what's the impact to change cAdvisor code to also set a metric containing the RestartCount of the containers.

rikatz on 4 May 2017

kubectl describe po nginx1 | grep "Restart Count"
Restart Count: 0

Magikon on 12 Mar 2018

This is still not fixed for users who are using cadvisor directly. RestartCount is a label, which breaks everything and makes it pretty impossible to write alerts based on it (e.g. alert if restart happened more than X times in the last Y min)

fgrzadkowski on 26 Jun 2019

@fabxc @dashpole @vishh @timstclair

fgrzadkowski on 27 Jun 2019

Any news here? It would be useful to have container_restart_count metric to setup alerting when using bare docker engine without kubernetes.

caligo-mentis on 11 Oct 2019

👍5

Any news here?

OloBo-MSK on 22 Jan 2020

Any news?

Go-Pomegranate on 25 Mar 2020

cAdvisor is stateless, and does not track restarts of containers itself. Some container runtimes attach a restart count label to metrics, but labels are treated as opaque by cAdvisor as pointed out above. If the RestartCount label is causing problems, set --store_container_labels=false and use --whitelisted-container-labels to keep the ones you want.

dashpole on 25 Mar 2020

@dashpole I think the issue here is that folks want to use container_label_restartcount as a vector for alerting -- not that it is causing problems. It's just that as a label and not a vector, it is impossible to do so.

I understand Docker is what is exposing this, so maybe this is an issue to file on Docker itself? Or is there some future where cAdvisor could support remapping label values into vectors themselves?

slimsag on 14 Apr 2020

I have a very nasty hack for anyone wanting a container_restart_count metric that works today: https://gist.github.com/slimsag/85e06781eb0d4d35beee12916aefac5f

slimsag on 14 Apr 2020

Was this page helpful?

0 / 5 - 0 ratings