Eksctl: DNS not working as expected

Created on 10 Jan 2019 · 8Comments · Source: weaveworks/eksctl

What happened?
Cluster DNS resolution isn't working on a nodegroup:

I deployed an EKS cluster as follows:
eksctl create cluster --name foo --tags "key=val" --region us-east-1 --zones us-east-1a,us-east-1b --nodegroup-name foo --node-type m5.large --nodes-min 2 --nodes-max 4 --ssh-access --ssh-public-key=foo --node-ami auto --node-private-networking --node-labels "partition=foo" --asg-access --cfn-role-arn arn:aws:iam::xyz

Then deployed a nodegroup as follows:
eksctl create nodegroup --cluster foo --region us-east-1 --name foo-bar --node-type m5.large --nodes 1 --nodes-min 1 --nodes-max 2 --ssh-access --ssh-public-key=foo --node-ami auto --node-private-networking --node-labels "partition=bar" --asg-access --cfn-role-arn arn:aws:iam::xyz

Then ran kubectl create -f with this yaml.

Then ran kubectl exec -ti busybox -- nslookup kubernetes.default to test DNS on a partition: foo node and the output is OK:

Server:    10.100.0.10
Address 1: 10.100.0.10 ip-10-100-0-10.ec2.internal

Name:      kubernetes.default
Address 1: 10.100.0.1 ip-10-100-0-1.ec2.internal

But after modifying the pod yaml to run on a partition: bar node, the above command fails:

Server:    10.100.0.10
Address 1: 10.100.0.10

nslookup: can't resolve 'kubernetes.default'
command terminated with exit code 1

What you expected to happen?
I expected the pod on the "bar" node to be able to resolve cluster DNS.

How to reproduce it?
See above

Anything else we need to know?
eksctl 0.1.17 (latest) installed via homebrew on OSX Mojave.

Versions
Please paste in the output of these commands:

$ eksctl version
[ℹ]  version.Info{BuiltAt:"", GitCommit:"", GitTag:"0.1.17"}
$ uname -a
Darwin dschott-mbp.local 18.2.0 Darwin Kernel Version 18.2.0: Mon Nov 12 20:24:46 PST 2018; root:xnu-4903.231.4~2/RELEASE_X86_64 x86_64
$ kubectl version
Client Version: version.Info{Major:"1", Minor:"13", GitVersion:"v1.13.1", GitCommit:"eec55b9ba98609a46fee712359c7b5b365bdd920", GitTreeState:"clean", BuildDate:"2018-12-13T19:44:19Z", GoVersion:"go1.11.2", Compiler:"gc", Platform:"darwin/amd64"}
Server Version: version.Info{Major:"1", Minor:"11+", GitVersion:"v1.11.5-eks-6bad6d", GitCommit:"6bad6d9c768dc0864dab48a11653aa53b5a47043", GitTreeState:"clean", BuildDate:"2018-12-06T23:13:14Z", GoVersion:"go1.10.3", Compiler:"gc", Platform:"linux/amd64"}

Also include your version of heptio-authenticator-aws

weaveworks/tap/eksctl-aws-iam-authenticator: stable 0.3.0
A tool to use AWS IAM credentials to authenticate to a Kubernetes cluster
https://github.com/kubernetes-sigs/aws-iam-authenticator
/usr/local/Cellar/eksctl-aws-iam-authenticator/0.3.0 (3 files, 17.5MB) *
  Built from source on 2018-12-26 at 09:10:51
From: https://github.com/weaveworks/homebrew-tap/blob/master/Formula/eksctl-aws-iam-authenticator.rb

Logs
See above

Source

schottsfired

All 8 comments

For the future, I wouldn't recommend using busybox or alpine for any DNS tests, as there are differences in glibc vs musl, I would recommend testing with e.g. Ubuntu and Alpine. Did you try not using --node-ami auto and just use the default AMI? I can see there was an update and will certainly open a PR to add those new AMIs.