FAILED [0.0016s] external-builds\pytorch\pytorch\test\test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearest2d_launch_rocm_cuda - RuntimeError: input tensor has spatial dimension larger than the kernel capacity
= 1 failed, 15541 passed, 24534 skipped, 301 deselected, 44 xfailed, 2 subtests passed in 783.13s (0:13:03) =
Overview
PyTorch unit tests are hanging on multiple test runners, across multiple torch versions. This does not appear to be a recent regression.
Symptoms / evidence / details
Workflow run: https://github.com/ROCm/TheRock/actions/runs/26707885453, using rocm version
7.14.0a20260531gfx1151:
CS-RORDMZ-DT244runnerhttps://github.com/ROCm/TheRock/actions/runs/26738630672/job/78812375372
gfx110X-all:
azure-windows-11-gfx1101runnersindex https://rocm.nightlies.amd.com/v2-staging/gfx110X-all/
torch version 2.9, tests segfaulted: https://github.com/ROCm/TheRock/actions/runs/26707885453/job/78783769908#step:13:3270
torch version 2.10, tests completed: https://github.com/ROCm/TheRock/actions/runs/26707885453/job/78783770046#step:13:40191
Observed on torch version e.g.
2.10.0+rocm7.14.0a20260531Jobs and log snippets: