Workspace restart button/kira pilot #7070

Kira-Pilot · 2023-04-10T19:53:38Z

Trying to address #5800 and #6241

Need some help figuring out the best approach here. We've added a new transition, restart and attempted to amend api.postWorkspaceBuilds such that if a restart is requested, we insert two workspace builds: 1 for stop and 1 for start.

I'm worried Provisioner isn't set up to handle this scenario. @mafredri pointed out that if we create two jobs, we may have separate daemons execute each simultaneously.

coderd/workspacebuilds.go

…ton/kira-pilot

mafredri

@Kira-Pilot I think your approach here can work, but ultimately I wonder if we should take a slightly different one.

My thought is that we would introduce a new transition state for workspaces called restart (

coder/codersdk/workspacebuilds.go

Lines 14 to 20 in 4dd5d79

    
           type WorkspaceTransition string 
        
           const ( 
        
           	WorkspaceTransitionStart  WorkspaceTransition = "start" 
        
           	WorkspaceTransitionStop   WorkspaceTransition = "stop" 
        
           	WorkspaceTransitionDelete WorkspaceTransition = "delete" 
        
           )

).

The motivation behind a new state is that right now, the restart action is dependent on the Coder API and can be interrupted. Consider if we issue restart, and begin by stopping. Now while the workspace is stopping, the coder server is restarted (or updated). Now the workspace would remain in the stopped state.

The new restart state would allow for the state to be fully registered so that we can ensure it's completion.

Do you have any thoughts on this @kylecarbs?

coderd/workspacebuilds.go

mtojek

Side note: the abort transaction error mentioned in the Github issue can be fixed by increasing the build number for the "start" job.

Regarding the overall design, the "restart" transition is purely virtual, and could do the trick if the provisionerserver doesn't acquire jobs that are conflicting. In this case, stop and start are executed at the same time:

and the "stop" job fails:

... or does nothing:

Fun fact:

a similar inconsistency can be observed for the "start" job. It can just add a new resource:

... or add the resource and destroy the old one (depending on the race with "stop" job):

This inconsistency is kind of funny, because for some (all?) providers like Docker it's absolutely fine to just call /workspacebuilds with transition=start. I'm afraid that to fully solve the racing challenge, you need to implement mutual exclusiveness between provisioner jobs, so tweak the provisionerserver. I strongly recommend doing it in a separate PR.

@mafredri Please doublecheck if this all makes sense 👍

mafredri · 2023-04-13T09:47:55Z

@mtojek great insights. I like your proposal of limiting the acquirement of conflicting jobs in the provisioner(s). One thing that worries me though is if stop fails, we should probably abort the start job? But thins brings me to another idea for potentially solving the conflict:

Provisioner job dependencies. I.e. a job could depend on another job. Meaning we wouldn't pick up a job if it's dependent on a job that hasn't completed. A toggle for requiring success or simply completion (to continue after fail) of the dependent job could also be added. (Motivation for toggle: Possibility to implement "force restart".)

mtojek · 2023-04-13T10:07:44Z

Provisioner job dependencies. I.e. a job could depend on another job. Meaning we wouldn't pick up a job if it's dependent on a job that hasn't completed. A toggle for requiring success or simply completion (to continue after fail) of the dependent job could also be added. (Motivation for toggle: Possibility to implement "force restart".)

It is a valid point and good motivation for feature expansion. I'm a bit concerned about the provisioner job being left in the queue forever (if the dependent job failed), so maybe provisionerserver can pick it up and simply discard it.

mafredri · 2023-04-13T13:49:32Z

It is a valid point and good motivation for feature expansion. I'm a bit concerned about the provisioner job being left in the queue forever (if the dependent job failed), so maybe provisionerserver can pick it up and simply discard it.

Yeah, that's about how I'd imagine it would behave as well. Or rather than discard, mark the dependent job as failed too.

Kira-Pilot · 2023-04-14T15:47:32Z

@mafredri and @mtojek - thanks so much for your feedback! I spoke with @sreya and we've decided to close this PR for now in favor of #7137, which is an entirely FE solution. The concern is that this feature is quite small and does not warrant a Provisioner update. That might change in the future, in which case I'd resurrect this implementation. Let me know if you have any concerns!

mafredri · 2023-04-14T15:49:20Z

@Kira-Pilot sounds good, that is what the CLI does currently too so we should deal with it at some point but doesn't have to be now. 👍🏻

Kira-Pilot added 3 commits March 30, 2023 13:47

added new restart button and wrote tests

7a314a5

moved restart fn into workspacebuilds.go

a3cfc85

added new route

4f064d4

github-actions bot assigned Kira-Pilot Apr 10, 2023

Kira-Pilot marked this pull request as draft April 10, 2023 19:53

Kira-Pilot added 2 commits April 10, 2023 19:58

cleanup

b350519

remove comment

05a03f2

Kira-Pilot commented Apr 10, 2023

View reviewed changes

coderd/workspacebuilds.go Outdated Show resolved Hide resolved

Kira-Pilot commented Apr 10, 2023

View reviewed changes

coderd/workspacebuilds.go Outdated Show resolved Hide resolved

Kira-Pilot commented Apr 10, 2023

View reviewed changes

coderd/workspacebuilds.go Outdated Show resolved Hide resolved

Kira-Pilot requested a review from mafredri April 10, 2023 20:13

Merge remote-tracking branch 'origin/main' into workspace-restart-but…

1c6a891

…ton/kira-pilot

mafredri reviewed Apr 11, 2023

View reviewed changes

coderd/workspacebuilds.go Outdated Show resolved Hide resolved

coderd/workspacebuilds.go Outdated Show resolved Hide resolved

coderd/workspacebuilds.go Outdated Show resolved Hide resolved

added a new restart transition

054663f

mtojek reviewed Apr 13, 2023

View reviewed changes

add err blocks; fix build number

b3b4a47

Kira-Pilot mentioned this pull request Apr 14, 2023

feat(UI): add workspace restart button #7137

Merged

Kira-Pilot closed this Apr 14, 2023

github-actions bot locked and limited conversation to collaborators Apr 14, 2023

Kira-Pilot deleted the workspace-restart-button/kira-pilot branch September 25, 2023 15:19

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Workspace restart button/kira pilot #7070

Workspace restart button/kira pilot #7070

Uh oh!

Kira-Pilot commented Apr 10, 2023 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

mafredri left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

mtojek left a comment

Uh oh!

mafredri commented Apr 13, 2023

Uh oh!

mtojek commented Apr 13, 2023

Uh oh!

mafredri commented Apr 13, 2023

Uh oh!

Kira-Pilot commented Apr 14, 2023

Uh oh!

mafredri commented Apr 14, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

	type WorkspaceTransition string

	const (
	WorkspaceTransitionStart WorkspaceTransition = "start"
	WorkspaceTransitionStop WorkspaceTransition = "stop"
	WorkspaceTransitionDelete WorkspaceTransition = "delete"
	)

Workspace restart button/kira pilot #7070

Workspace restart button/kira pilot #7070

Uh oh!

Conversation

Kira-Pilot commented Apr 10, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

mafredri left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

mtojek left a comment

Choose a reason for hiding this comment

Uh oh!

mafredri commented Apr 13, 2023

Uh oh!

mtojek commented Apr 13, 2023

Uh oh!

mafredri commented Apr 13, 2023

Uh oh!

Kira-Pilot commented Apr 14, 2023

Uh oh!

mafredri commented Apr 14, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Kira-Pilot commented Apr 10, 2023 •

edited

Loading