Need more visibility into agent issues when workspaces are running in kubernetes #17994
-
This is extremely similar to an issue that was already closed due to not enough information #15867. This yellow warning can appear for a number of reasons, and some of them are completely unrelated to the agent not being able to connect, but instead to do with the pod that the agent is connecting to being difficult to start up for a variety of issues. In the example I have screenshotted, the agent cannot connect to this pod (this is a k8s deployment) for 2 reasons:
The problem with this is that there is absolutely no visibility to this in the Coder UI, and the only way to know that this is happening is to have direct access to the cluster, and look at the events being raised against the pod. What would be ideal is to have a way to configure the expected startup of a workspace, based on internal knowledge about how long it could potentially take. Or - show the users exactly why the workspace is taking a while. Even just bubbling up the events on the pod the agent is trying to connect to would be an improvement. |
Beta Was this translation helpful? Give feedback.
Replies: 4 comments
-
Part of #15423 |
Beta Was this translation helpful? Give feedback.
-
Hi @rhysduggan5, Thanks for submitting the issue
Both of these issues can be resolved by using the Kubernetes Logging integration. It's a small service that you deploy in the same cluster as the workspace pods. I agree that we need to improve the troubleshooting experience for other agent connection issues. It's a chicken-and-egg problem, as the agent is the one sending logs from the workspace to be displayed in the Dashboard. Without the agent starting, we have no visibility into what's happening |
Beta Was this translation helpful? Give feedback.
-
I have just installed the Kubernetes Logging Interaction service (that I found shortly after submitting this issue), and it has completely solved the problem. The good news here is that for my users, seeing a big orange box with no context is really poor, but seeing a black box with logs progressing is "mentally" a much more positive environment. Thanks for the tip! |
Beta Was this translation helpful? Give feedback.
-
I will move this to a discussion for others to discover. |
Beta Was this translation helpful? Give feedback.
Hi @rhysduggan5, Thanks for submitting the issue
Both of these issues can be resolved by using the Kubernetes Logging integration. It's a small service that you deploy in the same cluster as the workspace pods. I agree tβ¦