Monitor: Reap all processes #15676

xpivarc · 2025-09-16T11:59:54Z

What this PR does

Delivery of signals can actually drop signal
if multiple signals are generated while the signal is being blocked.
In this case only one signal is delivered after it is un-blocked. Because we always do Wait4 per signal we can actually miss a process being terminated.
Sometimes the ordering happen to be unfortunate and the virt-launcher process is missed and never cleaned up.

This cause for the virt-launcher to hang around indefinitely.

Therefore this commit tries to reap as much processes as possible per signal.

This was observed upon successful migration where both source and target Pods continued to be running, see:

ps aux
USER         PID %CPU %MEM    VSZ   RSS TTY      STAT START   TIME COMMAND
qemu           1  0.0  0.0 1679844 13328 ?       Ssl  Aug21   0:00 /usr/bin/virt-launcher-monitor --qemu-timeout 248s --name rhel9-3195 --uid 0db4e8a0-c0e4-4e95-9b1c-bd7f2a16c8d4 --namespace vm-ns-32 --kubevirt-
qemu           8  0.0  0.0      0     0 ?        Z    Aug21  13:51 [virt-launcher] <defunct>
qemu         444  0.5  0.0   4452  2688 pts/0    Ss   09:19   0:00 bash
qemu         445  0.0  0.0   7032  2688 pts/0    R+   09:19   0:00 ps aux

and logs:

{"component":"virt-launcher","level":"info","msg":"Exiting...","pos":"virt-launcher.go:513","timestamp":"2025-08-29T18:18:58.885293Z"}
{"component":"virt-launcher-monitor","level":"info","msg":"Reaped pid 19 with status 9","pos":"virt-launcher-monitor.go:202","timestamp":"2025-08-29T18:18:58.886212Z"}
{"component":"virt-launcher-monitor","level":"info","msg":"Reaped pid 18 with status 9","pos":"virt-launcher-monitor.go:202","timestamp":"2025-08-29T18:18:58.889628Z"}

Links to places where the discussion took place:

Special notes for your reviewer

It is not clear to me if the go runtime suspend the signal (makes it non blocking) before the signal is send to channel and so there is no race.

Release note

Bug fix, virt-launcher is properly reaped

sourcery-ai

Hey there - I've reviewed your changes and they look great!

Sourcery is free for open source - if you like our reviews please consider sharing them ✨

_{Help me be more useful! Please click 👍 or 👎 on each comment and I'll use the feedback to improve your reviews.}

fossedihelm · 2025-09-16T12:06:34Z

Could it be a fix for #15373?

Barakmor1 · 2025-09-16T14:01:38Z

Happy to see the additional logs and artifacts ended up being useful.

/lgtm

Barakmor1 · 2025-09-16T14:04:59Z

cmd/virt-launcher-monitor/virt-launcher-monitor.go

+					if wpid == 0 {
+						log.Log.Infof("No more processes to be repead")
+						break


nit: maybe we should add a debug log in case wpid < 0

tbh, I think when syscall.Wait4 returns -1 it will return it with err set - and that is already logged...

cmd/virt-launcher-monitor/virt-launcher-monitor.go

vladikr · 2025-09-16T17:45:14Z

It is not clear to me if the go runtime suspend the signal (makes it non blocking) before the signal is send to channel and so there is no race.

I think the new loop already eliminates any possible race since it reaps all the waiting child processes in one go, even if only one SIGCHLD was sent.
/approve

kubevirt-bot · 2025-09-16T17:45:25Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: vladikr

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Details

Needs approval from an approver in each of these files:

~~cmd/virt-launcher-monitor/OWNERS~~ [vladikr]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

vladikr · 2025-09-16T17:45:26Z

/hold

kubevirt-commenter-bot · 2025-09-16T17:45:28Z

Required labels detected, running phase 2 presubmits:
/test pull-kubevirt-e2e-k8s-1.31-windows2016
/test pull-kubevirt-e2e-kind-1.33-vgpu
/test pull-kubevirt-e2e-kind-sriov
/test pull-kubevirt-e2e-k8s-1.33-ipv6-sig-network
/test pull-kubevirt-e2e-k8s-1.32-sig-network
/test pull-kubevirt-e2e-k8s-1.32-sig-storage
/test pull-kubevirt-e2e-k8s-1.32-sig-compute
/test pull-kubevirt-e2e-k8s-1.32-sig-operator
/test pull-kubevirt-e2e-k8s-1.33-sig-compute-serial
/test pull-kubevirt-e2e-k8s-1.33-sig-network
/test pull-kubevirt-e2e-k8s-1.33-sig-storage
/test pull-kubevirt-e2e-k8s-1.33-sig-compute
/test pull-kubevirt-e2e-k8s-1.33-sig-operator

Barakmor1 · 2025-09-16T18:37:57Z

It is not clear to me if the go runtime suspend the signal (makes it non blocking) before the signal is send to channel and so there is no race.

I'm not completely sure, but I think the race happens when multiple child processes finish before the first signal is received. Since signals are just flags and aren't queued, we only get one signal, even if more than one child has exited. That could explain the issue, though it's a rare case.

Delivery of signals can actually drop signal if multiple signals are generated while the signal is being blocked. In this case only one signal is delivered after it is un-blocked. Because we always do Wait4 per signal we can actually miss a process being terminated. Sometimes the ordering happen to be unfortunate and the virt-launcher process is missed and never cleaned up. This cause for the virt-launcher to hang around indefinitely. Therefore this commit tries to reap as much processes as possible per signal. Signed-off-by: Luboslav Pivarc <[email protected]>

xpivarc · 2025-09-17T13:45:26Z

It is not clear to me if the go runtime suspend the signal (makes it non blocking) before the signal is send to channel and so there is no race.

I'm not completely sure, but I think the race happens when multiple child processes finish before the first signal is received. Since signals are just flags and aren't queued, we only get one signal, even if more than one child has exited. That could explain the issue, though it's a rare case.

Yes, this is exactly what is happening but the think I describe is the abstraction that Go does. It matters if the signal is processed first and the sig is send to channel or the sig can be send to channel before the signal was processed. In the latter case we can end the loop, new child dies and we miss it.

Anyway I did small test

diff --git a/pkg/virt-launcher/monitor_test.go b/pkg/virt-launcher/monitor_test.go
index e2879b4963..97cfd3ecdb 100644
--- a/pkg/virt-launcher/monitor_test.go
+++ b/pkg/virt-launcher/monitor_test.go
@@ -21,7 +21,9 @@ package virtlauncher
 
 import (
 	"flag"
+	"os"
 	"os/exec"
+	"os/signal"
 	"path/filepath"
 	"strings"
 	"syscall"
@@ -29,6 +31,7 @@ import (
 
 	. "github.com/onsi/ginkgo/v2"
 	. "github.com/onsi/gomega"
+	"kubevirt.io/client-go/log"
 
 	"github.com/google/uuid"
 )
@@ -108,8 +111,58 @@ var _ = Describe("VirtLauncher", func() {
 	AfterEach(func() {
 		if processStarted {
 			stopProcess()
+			_ = cmd.Wait()
 		}
-		_ = cmd.Wait()
+
+	})
+
+	FIt("t", func() {
+		start := func() *exec.Cmd {
+			cmd := exec.Command(fakeQEMUBinary, "--uuid", uuid.New().String(), "--pidfile", filepath.Join(pidDir, "fakens_fakevmi.pid"))
+			err := cmd.Start()
+			ExpectWithOffset(1, err).ToNot(HaveOccurred(), "command failed to start")
+			return cmd
+		}
+
+		reap := make(chan bool, 10)
+		sigs := make(chan os.Signal, 10)
+		signal.Notify(sigs, syscall.SIGCHLD)
+		go func() {
+			for sig := range sigs {
+				switch sig {
+				case syscall.SIGCHLD:
+					for {
+						var wstatus syscall.WaitStatus
+						wpid, err := syscall.Wait4(-1, &wstatus, syscall.WNOHANG, nil)
+						if err != nil {
+							log.Log.Reason(err).Errorf("Failed to reap process %d", wpid)
+						}
+						if wpid == 0 {
+							log.Log.Infof("No more processes to be reaped")
+							break
+						}
+						reap <- true
+						log.Log.Infof("Reaped pid %d with status %d", wpid, int(wstatus))
+					}
+
+				default:
+					Panic()
+				}
+			}
+		}()
+
+		cmds := []*exec.Cmd{}
+		for range 10 {
+			cmds = append(cmds, start())
+		}
+
+		for i := range 10 {
+			go func(i int) {
+				Expect(cmds[i].Process.Kill()).To(Succeed())
+			}(i)
+		}
+		Eventually(reap).Should(HaveLen(10))
+
 	})
 
 	Describe("VirtLauncher", func() {

Without the loop, we missed 1-7 signals. With the loop I couldn't reproduce, so I am pretty sure this helps but still not confident that this is completely fixing the issue.

xpivarc · 2025-09-17T13:46:32Z

cmd/virt-launcher-monitor/virt-launcher-monitor.go

-					exitStatus <- wstatus.ExitStatus()
+				for {
+					var wstatus syscall.WaitStatus
+					wpid, err := syscall.Wait4(-1, &wstatus, syscall.WNOHANG, nil)


@vladikr maybe we can actually run loop that will cleanup children regardless the signal

on exit, right?

All the time as the syscall should block if nothing happens. But I think as is this is good enough

qkfrksvl · 2025-09-20T07:52:00Z

big thanks @xpivarc

qkfrksvl · 2025-09-22T07:25:31Z

@xpivarc I had the same issue, but it was a bit more serious. #15711

xpivarc · 2025-09-23T09:48:53Z

@fossedihelm @vladikr PTAL

fossedihelm · 2025-09-23T10:43:00Z

/lgtm
Big thanks
@vladikr up to you to unhold :)

kubevirt-commenter-bot · 2025-09-23T10:43:07Z

Required labels detected, running phase 2 presubmits:
/test pull-kubevirt-e2e-k8s-1.31-windows2016
/test pull-kubevirt-e2e-kind-1.33-vgpu
/test pull-kubevirt-e2e-kind-sriov
/test pull-kubevirt-e2e-k8s-1.33-ipv6-sig-network
/test pull-kubevirt-e2e-k8s-1.32-sig-network
/test pull-kubevirt-e2e-k8s-1.32-sig-storage
/test pull-kubevirt-e2e-k8s-1.32-sig-compute
/test pull-kubevirt-e2e-k8s-1.32-sig-operator
/test pull-kubevirt-e2e-k8s-1.33-sig-network
/test pull-kubevirt-e2e-k8s-1.33-sig-storage
/test pull-kubevirt-e2e-k8s-1.33-sig-compute
/test pull-kubevirt-e2e-k8s-1.33-sig-operator

vladikr · 2025-10-02T11:01:02Z

/unhold

xpivarc · 2025-10-02T12:40:15Z

/retest-required

kubevirt-bot · 2025-10-02T22:52:52Z

@xpivarc: The following tests failed, say /retest to rerun all failed tests or /retest-required to rerun all mandatory failed tests:

Test name	Commit	Details	Required	Rerun command
pull-kubevirt-e2e-k8s-1.33-sig-compute-serial	`7fa127f`	link	true	`/test pull-kubevirt-e2e-k8s-1.33-sig-compute-serial`
pull-kubevirt-e2e-k8s-1.33-sig-compute-arm64	`72229df`	link	false	`/test pull-kubevirt-e2e-k8s-1.33-sig-compute-arm64`

Details

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository. I understand the commands that are listed here.

xpivarc · 2025-10-03T20:58:37Z

/cherry-pick release-1.6 release-1.5 release-1.4 release-1.3 release-1.2 release-1.1 release-1.0

kubevirt-bot · 2025-10-03T21:00:06Z

@xpivarc: new pull request created: #15816

Details

In response to this:

/cherry-pick release-1.6 release-1.5 release-1.4 release-1.3 release-1.2 release-1.1 release-1.0

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository.

kubevirt-bot added release-note Denotes a PR that will be considered when it comes time to generate release notes. dco-signoff: yes Indicates the PR's author has DCO signed all their commits. labels Sep 16, 2025

kubevirt-bot requested review from lyarwood and victortoso September 16, 2025 12:00

kubevirt-bot added size/S area/launcher sig/compute labels Sep 16, 2025

sourcery-ai bot reviewed Sep 16, 2025

View reviewed changes

kubevirt-bot assigned Barakmor1 Sep 16, 2025

kubevirt-bot added the lgtm Indicates that a PR is ready to be merged. label Sep 16, 2025

Barakmor1 reviewed Sep 16, 2025

View reviewed changes

vladikr reviewed Sep 16, 2025

View reviewed changes

cmd/virt-launcher-monitor/virt-launcher-monitor.go Outdated Show resolved Hide resolved

kubevirt-bot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label Sep 16, 2025

kubevirt-bot added the do-not-merge/hold Indicates that a PR should not merge because someone has issued a /hold command. label Sep 16, 2025

xpivarc force-pushed the zombie branch from 7fa127f to 72229df Compare September 17, 2025 13:41

kubevirt-bot removed the lgtm Indicates that a PR is ready to be merged. label Sep 17, 2025

xpivarc commented Sep 17, 2025

View reviewed changes

qkfrksvl mentioned this pull request Sep 22, 2025

Source VM remains in Running after live migration, resulting in two instances running #15711

Closed

kubevirt-bot assigned fossedihelm Sep 23, 2025

kubevirt-bot added the lgtm Indicates that a PR is ready to be merged. label Sep 23, 2025

kubevirt-bot removed the do-not-merge/hold Indicates that a PR should not merge because someone has issued a /hold command. label Oct 2, 2025

kubevirt-bot merged commit 71fc271 into kubevirt:main Oct 3, 2025
47 checks passed

kubevirt-bot mentioned this pull request Oct 3, 2025

[release-1.6] Monitor: Reap all processes #15816

Closed

xpivarc mentioned this pull request Oct 24, 2025

[release-1.6] Monitor: Reap all processes #15950

Merged

Monitor: Reap all processes #15676

Monitor: Reap all processes #15676

Uh oh!

Conversation

xpivarc commented Sep 16, 2025

What this PR does

Special notes for your reviewer

Release note

Uh oh!

sourcery-ai bot left a comment

Choose a reason for hiding this comment

Uh oh!

fossedihelm commented Sep 16, 2025

Uh oh!

Barakmor1 commented Sep 16, 2025

Uh oh!

Barakmor1 Sep 16, 2025

Choose a reason for hiding this comment

Uh oh!

vladikr Sep 16, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

vladikr commented Sep 16, 2025

Uh oh!

kubevirt-bot commented Sep 16, 2025

Uh oh!

vladikr commented Sep 16, 2025

Uh oh!

kubevirt-commenter-bot commented Sep 16, 2025

Uh oh!

Barakmor1 commented Sep 16, 2025

Uh oh!

xpivarc commented Sep 17, 2025

Uh oh!

xpivarc Sep 17, 2025

Choose a reason for hiding this comment

Uh oh!

vladikr Sep 18, 2025

Choose a reason for hiding this comment

Uh oh!

xpivarc Sep 23, 2025

Choose a reason for hiding this comment

Uh oh!

qkfrksvl commented Sep 20, 2025

Uh oh!

qkfrksvl commented Sep 22, 2025

Uh oh!

xpivarc commented Sep 23, 2025

Uh oh!

fossedihelm commented Sep 23, 2025

Uh oh!

kubevirt-commenter-bot commented Sep 23, 2025

Uh oh!

vladikr commented Oct 2, 2025

Uh oh!

xpivarc commented Oct 2, 2025

Uh oh!

kubevirt-bot commented Oct 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

xpivarc commented Oct 3, 2025

Uh oh!

kubevirt-bot commented Oct 3, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

kubevirt-bot commented Oct 2, 2025 •

edited

Loading