I found many discussions around healthcheck implementation, but I'm not sure if it's already implemented or not ?
Additional environment details (Operating System, Cloud provider, etc):
I'm running a v5 agent as a service on ECS, I'm going to upgrade to v6 and want to do things the right way.
Hi @oussemos thanks for reaching out. Yes the docker image of agent v6 comes with a healthcheck as you can see here. It calls the health command of the agent which you can test by running docker exec -it <agent-container-name> agent health.
Please let us know if you have any other question.
@hkaj that link is now 404. Did it get moved?
I am facing a similar issue where the container running the datadog daemon is unhealthy. How can I debug this and fix ?
Yeah sorry @efenderbosch i forgot to pin to a permalink. Here's the healthcheck definition, and its implementation
@RohanKurane you can look at these files and run the health command yourself to check which component of the agent goes unhealthy. If that doesn't help, feel free to reach out to support!
@RohanKurane if you are running AWS ECS, we just added this to our task definition:
"healthCheck": {
"retries": 3,
"command": [
"CMD-SHELL",
"agent health"
],
"timeout": 5,
"interval": 30,
"startPeriod": 15
}
ECS will just start a new one.
Most helpful comment
@RohanKurane if you are running AWS ECS, we just added this to our task definition:
ECS will just start a new one.