Codestin Search App

kaladron · 2026-04-23T17:07:51Z

Under CTest, LIBC_GPU_TEST_JOBS controlled a ninja job pool that limited concurrent GPU test processes. The AMD GPU buildbot sets this to 4 to avoid overloading the GPU driver.

When running tests via lit, this constraint was lost because lit uses its own -j flag (defaulting to nproc, or set to 64 on the AMD bot via LLVM_LIT_ARGS). All GPU loader processes launched simultaneously, leading to hangs from GPU resource exhaustion.

Propagated LIBC_GPU_TEST_JOBS into the lit site config as a parallelism group so lit throttles GPU test concurrency independently of the global -j setting.

Under CTest, LIBC_GPU_TEST_JOBS controlled a ninja job pool that limited concurrent GPU test processes. The AMD GPU buildbot sets this to 4 to avoid overloading the GPU driver. When running tests via lit, this constraint was lost because lit uses its own -j flag (defaulting to nproc, or set to 64 on the AMD bot via LLVM_LIT_ARGS). All GPU loader processes launched simultaneously, leading to hangs from GPU resource exhaustion. Propagated LIBC_GPU_TEST_JOBS into the lit site config as a parallelism group so lit throttles GPU test concurrency independently of the global -j setting.

llvmbot · 2026-04-23T17:09:45Z

@llvm/pr-subscribers-libc

Author: Jeff Bailey (kaladron)

Changes

Under CTest, LIBC_GPU_TEST_JOBS controlled a ninja job pool that limited concurrent GPU test processes. The AMD GPU buildbot sets this to 4 to avoid overloading the GPU driver.

When running tests via lit, this constraint was lost because lit uses its own -j flag (defaulting to nproc, or set to 64 on the AMD bot via LLVM_LIT_ARGS). All GPU loader processes launched simultaneously, leading to hangs from GPU resource exhaustion.

Propagated LIBC_GPU_TEST_JOBS into the lit site config as a parallelism group so lit throttles GPU test concurrency independently of the global -j setting.

Full diff: https://github.com/llvm/llvm-project/pull/193797.diff

2 Files Affected:

(modified) libc/cmake/modules/prepare_libc_gpu_build.cmake (+1)
(modified) libc/test/lit.site.cfg.py.in (+5)

diff --git a/libc/cmake/modules/prepare_libc_gpu_build.cmake b/libc/cmake/modules/prepare_libc_gpu_build.cmake
index c87a1df926c85..554c6c49b0435 100644
--- a/libc/cmake/modules/prepare_libc_gpu_build.cmake
+++ b/libc/cmake/modules/prepare_libc_gpu_build.cmake
@@ -29,6 +29,7 @@ if(LIBC_GPU_TEST_JOBS)
   set_property(GLOBAL PROPERTY JOB_POOLS LIBC_GPU_TEST_POOL=${LIBC_GPU_TEST_JOBS})
   set(LIBC_HERMETIC_TEST_JOB_POOL JOB_POOL LIBC_GPU_TEST_POOL)
 else()
+  set(LIBC_GPU_TEST_JOBS 1)
   set_property(GLOBAL PROPERTY JOB_POOLS LIBC_GPU_TEST_POOL=1)
   set(LIBC_HERMETIC_TEST_JOB_POOL JOB_POOL LIBC_GPU_TEST_POOL)
 endif()
diff --git a/libc/test/lit.site.cfg.py.in b/libc/test/lit.site.cfg.py.in
index 3668a491cd05c..bc8d0e3e31713 100644
--- a/libc/test/lit.site.cfg.py.in
+++ b/libc/test/lit.site.cfg.py.in
@@ -40,3 +40,8 @@ if hasattr(config, "llvm_tools_dir") and config.llvm_tools_dir:
         [config.llvm_tools_dir, config.environment.get("PATH", "")]
     )
 
+# Limit concurrent GPU tests to avoid overloading the GPU driver.
+libc_gpu_test_jobs = "@LIBC_GPU_TEST_JOBS@"
+if libc_gpu_test_jobs:
+    lit_config.parallelism_groups["libc-gpu"] = int(libc_gpu_test_jobs)
+    config.parallelism_group = "libc-gpu"

michaelrj-google

Approving to avoid blocking lit switchover

jhuber6

I'll need to reevaluate this, it's a bit better than it was in the past, but still possible to exhaust scratch.

kaladron · 2026-04-23T18:37:30Z

I'll need to reevaluate this, it's a bit better than it was in the past, but still possible to exhaust scratch.

My best guess is that what was taking out the AMD fmul test was running 64 GPU tests simultaneously. I think it did remarkably well. =) The NV tests on my machine happily handled 32.

kaladron requested a review from jhuber6 April 23, 2026 17:07

kaladron marked this pull request as ready for review April 23, 2026 17:09

llvmbot added the libc label Apr 23, 2026

kaladron mentioned this pull request Apr 23, 2026

[libc] Switch check-libc from CTest to lit #193798

Open

michaelrj-google approved these changes Apr 23, 2026

View reviewed changes

jhuber6 approved these changes Apr 23, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[libc] Honour LIBC_GPU_TEST_JOBS in lit test runs#193797

[libc] Honour LIBC_GPU_TEST_JOBS in lit test runs#193797
kaladron wants to merge 1 commit intollvm:mainfrom
kaladron:lit-gpu-jobs

kaladron commented Apr 23, 2026

Uh oh!

llvmbot commented Apr 23, 2026

Uh oh!

michaelrj-google left a comment

Uh oh!

jhuber6 left a comment

Uh oh!

kaladron commented Apr 23, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

kaladron commented Apr 23, 2026

Uh oh!

llvmbot commented Apr 23, 2026

Uh oh!

michaelrj-google left a comment

Choose a reason for hiding this comment

Uh oh!

jhuber6 left a comment

Choose a reason for hiding this comment

Uh oh!

kaladron commented Apr 23, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants