feat: Add workspace agent lifecycle state reporting #5785

mafredri · 2023-01-19T09:34:10Z

This PR adds workspace agent lifecycle state reporting as an additional property on the agent.

Ref: #5749

Relies on terraform-provider-coder v0.6.7 (feat: Add startup_script_timeout and delay_login_until_ready terraform-provider-coder#84)
Introduce new agent property describing workspace agent state, WorkspaceAgentLifecycle: created, starting, start_timeout, start_error, ready
Add new database fields to workspace_agents (startup_script_timeout_seconds, delay_login_until_ready, lifecycle_state)
Initial state is created, only changes once the agent reports a new state
- An agent that never connects will only ever return created status, since we simply don't know
- (Implementation detail) State changes can be missed if the agent loses connection and any state is allowed to move to any other state (e.g. ready -> agent crashes and restarts -> starting -> ...)
Plumbing for provider values -> database

Examples uses:

If state is starting and delay_login_until_ready = true, keep users waiting e.g. during ssh coder.workspace (loading indicator, stream startup log, etc.)
If state is start_timeout (or error), show users a warning
If startup script exits with non-zero status, the start_error state lets us direct users towards inspecting the startup log

agent/agent.go

coderd/workspaceagents_test.go

agent/agent.go

mtojek · 2023-01-24T10:12:56Z

agent/agent.go

+		if metadata.GitAuthConfigs > 0 {
+			err := gitauth.OverrideVSCodeConfigs(a.filesystem)
+			if err != nil {
+				a.logger.Warn(ctx, "failed to override vscode git auth configs", slog.Error(err))


nit: should the agent quit here?

We probably shouldn't, this could fail for a multitude of reasons, like a user setting chmod 0000 on a folder. The worst that can happen is a degraded user experience (git auth in vscode not working). Ultimately we might want to:

Continue executing startup script after this

Set lifecycle to start_error because this failed

Result (error message) must be visible in build/startup logs

The worst that can happen is a degraded user experience (git auth in vscode not working).

Yes, that's the situation I was considering. What can user do in this case? Restart the workspace until it works?

If the change was done before, failure here doesn't really matter either (btw). Restarting will most likely not help, unless it's a problem mounting FS or similar. They'll most likely need to resolve the issue in the workspace, then restart, or create a new one.

For now, I've only added a note. I'll revisit this behavior when I add startup log streaming 👍🏻.

coderd/database/migrations/000091_add_workspace_agent_state.up.sql

coderd/workspaceagents.go

mtojek

Ship it!

mafredri self-assigned this Jan 19, 2023

mafredri mentioned this pull request Jan 19, 2023

Change agent startup script behavior from being never-ending to indicating the workspace is ready on end #5749

Closed

13 tasks

mafredri added 7 commits January 23, 2023 11:15

feat: Add new migrations

40e9c87

Generate database changes

5abf555

feat: Add agent state reporting

a84e467

Generate docs

8119e71

WIP

515e342

Rename state -> lifecycle state

1e53635

Add tests, improve lifecycle reporting

534f954

mafredri force-pushed the mafredri/feat-add-workspace-agent-ready-status branch from ec6def8 to 534f954 Compare January 23, 2023 13:08

Fix lint

90192e5

mafredri changed the title ~~feat: Add workspace agent readyness state reporting~~ feat: Add workspace agent lifecycle state reporting Jan 23, 2023

mafredri added 3 commits January 23, 2023 14:05

test: Update terraform tests for provider v0.6.7

aaf19bc

test(site): Update agent mocks

bd1a87f

Add new columns to workspace agent table

df9944e

mafredri force-pushed the mafredri/feat-add-workspace-agent-ready-status branch from 9d03a6a to e2dda7d Compare January 23, 2023 15:47

Fix plumbing

60f414f

mafredri force-pushed the mafredri/feat-add-workspace-agent-ready-status branch from e2dda7d to 60f414f Compare January 23, 2023 15:56

Run OverrideVSCodeConfigs even on error

087070e

mafredri marked this pull request as ready for review January 23, 2023 16:42

mafredri requested a review from a team as a code owner January 23, 2023 16:42

mafredri requested review from Kira-Pilot, kylecarbs, BrunoQuaresma and a team and removed request for a team and Kira-Pilot January 23, 2023 16:42

Use consistent Seconds terminology

c6928e9

kylecarbs reviewed Jan 23, 2023

View reviewed changes

agent/agent.go Show resolved Hide resolved

agent/agent.go Outdated Show resolved Hide resolved

coderd/workspaceagents_test.go Outdated Show resolved Hide resolved

Fix s/continue/break/

c132d4e

mafredri added 2 commits January 23, 2023 17:06

Fix git auth override order

903e850

Fix nit

034a850

kylecarbs approved these changes Jan 23, 2023

View reviewed changes

Publish workspace update

f76247f

mtojek approved these changes Jan 24, 2023

View reviewed changes

mafredri added 4 commits January 24, 2023 10:43

Hide report lifecycle endpoint from apidocs

2ccdcac

Set all existing agents to ready

830f9f1

Add debug logging to lifecycle states

2b1569f

Add note about error during vscode git auth override

71706e0

mtojek self-requested a review January 24, 2023 11:41

Fix typo in tests

21a9d28

mtojek approved these changes Jan 24, 2023

View reviewed changes

Merge branch 'main' into mafredri/feat-add-workspace-agent-ready-status

da67c73

mafredri merged commit 138887d into main Jan 24, 2023

mafredri deleted the mafredri/feat-add-workspace-agent-ready-status branch January 24, 2023 12:24

github-actions bot locked and limited conversation to collaborators Jan 24, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: Add workspace agent lifecycle state reporting #5785

feat: Add workspace agent lifecycle state reporting #5785

Uh oh!

mafredri commented Jan 19, 2023 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

mtojek Jan 24, 2023

Uh oh!

mafredri Jan 24, 2023

Uh oh!

mtojek Jan 24, 2023

Uh oh!

mafredri Jan 24, 2023

Uh oh!

mafredri Jan 24, 2023

Uh oh!

Uh oh!

Uh oh!

mtojek left a comment

Uh oh!

Uh oh!

feat: Add workspace agent lifecycle state reporting #5785

feat: Add workspace agent lifecycle state reporting #5785

Uh oh!

Conversation

mafredri commented Jan 19, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

mtojek Jan 24, 2023

Choose a reason for hiding this comment

Uh oh!

mafredri Jan 24, 2023

Choose a reason for hiding this comment

Uh oh!

mtojek Jan 24, 2023

Choose a reason for hiding this comment

Uh oh!

mafredri Jan 24, 2023

Choose a reason for hiding this comment

Uh oh!

mafredri Jan 24, 2023

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

mtojek left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

mafredri commented Jan 19, 2023 •

edited

Loading