-
Notifications
You must be signed in to change notification settings - Fork 41.5k
WIP: Don't assume that NodeName == Node host name #10663
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
Fix bug I introduced when I renamed AWS nodes so that their names were their AWS instance ids, instance of a resolvable DNS name. Fix kubernetes#10612
Triage wrt 1.0? What happens if we set on this for a few weeks and merge On Wed, Jul 1, 2015 at 10:31 PM, Justin Santa Barbara <
|
GCE e2e build/test failed for commit f8a9211. |
So I definitely screwed up here. If we want logs/proxy/exec to work on AWS in 1.0, we should either merge this (or something like it), or revert #9728. I'm OK either way (and I'm so sorry; I don't understand how I didn't catch this earlier - I think I've been focused on the systemd work and incorrectly attributing some failures to that). This PR still needs a little more work because the 'proxy' command is currently using PodIP rather than the NodeName, but once this is complete I think this is the "more correct" approach. I'm working on that right now. But I think for 1.0 we can revert #9728 if that is less risky, though then there will be additional work needed in the AWS code to map names -> instances again. Another option would be to try to have this code only be triggered on AWS, although that feels like we're adding complexity. Long-term I like this PR because we might imagine a world where one day we have to use a tunnel to contact a node, or sometimes we can go direct via an internal IP, and sometimes we have to use an external IP. For the short-term (1.0) though, I don't think this is too risky (although I note that tests are currently failing), but I understand if we choose to revert #9728 instead. |
@justinsb: The pull request builder thinks you broke port forwarding and exec on GCE with this, so that's definitely your first gate. (Driveby) This seems somewhat risky to take at this point. |
I think the roll-forward is less risky than the roll-back at this point, but I am not sure it's worth the risk either way. I want second opinions, but I am inclined to just document that v1 is broken in some regards on AWS and that v1.0.1 will fix it. @quinton-hoole because we discussed the importance of AWS support. |
@bgrant0607 we should make a decision on this ASAP. |
I am going to vote against my own PR here. Although this PR fixes logs, it looks like it does not fix exec / proxy, because they fall foul of the SSL certificate (which is by node-name). Fixing that would be very invasive I think. I am preparing a patch that rolls back just the AWS renaming portion of #9728 (i.e. the minimal rollback). It will have also to map names to instance ids in a few places. I think this will be confined to AWS. Sorry for the mess. Hopefully we can clean up node-names in 1.1 |
Thanks Justin. Reasonable as always. On Thu, Jul 2, 2015 at 9:25 AM, Justin Santa Barbara <
|
I'm with @justinsb on this. The minimal roll-back seems the most sensible option right now. |
Will hang tight for a minimal rollback.. On Thu, Jul 2, 2015 at 11:06 AM, Nikhil Jindal [email protected]
|
We merged #10699 instead (thanks!). Closing this one. |
Fix bug I introduced when I renamed AWS nodes so that their names
were their AWS instance ids, instance of a resolvable DNS name.
Fix #10612