use `constraints-dev.txt` in e2e tests #3320

ktdreyer · 2025-04-28T15:31:00Z

For instructlab, pip install . does not install vllm, but it does install an uncapped torch (2.7.0 currently).

When we install vllm later, we compile a binary flash_attn wheel against torch 2.7.0. vllm 0.8.4 requires torch==2.6.0, so we downgrade torch, and then we use that with the incompatible flash_attn binary wheel.

ImportError looks like:

/actions-runner/_work/instructlab/instructlab/venv/lib64/python3.11/site-packages/flash_attn_2_cuda.cpython-311-x86_64-linux-gnu.so: undefined symbol: _ZN3c105ErrorC2ENS_14SourceLocationENSt7__cxx1112basic_stringIcSt11char_traitsIcESaIcEEE

To resolve this, use constraints-dev.txt in the first pip install operation. This restricts torch to 2.6.0 immediately when we first install instructlab, so that we will compile flash_attn against that torch version.

For instructlab, "pip install ." does not install vllm, but it does install an uncapped torch (2.7.0 currently). When we install vllm later, we compile a binary flash_attn wheel against torch 2.7.0. vllm 0.8.4 requires torch==2.6.0, so we downgrade torch, and then we use that with the incompatible flash_attn binary wheel. To resolve this, use constraints-dev.txt in the first pip install operation. This restricts torch to 2.6.0 immediately when we first install instructlab, so that we will compile flash_attn against that torch version. Signed-off-by: Ken Dreyer <[email protected]>

github-actions · 2025-04-28T15:37:27Z

E2E (NVIDIA L40S x4) workflow launched on this PR: View run

github-actions · 2025-04-28T18:33:59Z

e2e workflow failed on this PR: View run, please investigate.

ktdreyer · 2025-04-28T19:19:22Z

This fixes the flash-attn problem. The e2e tests get further, but they still fail in NCCL timeouts. I've filed #3321 to track that separately.

booxter · 2025-04-28T21:05:36Z

Force merging since Ken confirmed this improves situation even if it doesn't make CI green yet.

courtneypacheco · 2025-04-30T16:39:43Z

@mergify backport release-v0.26

mergify · 2025-04-30T16:40:00Z

backport release-v0.26

✅ Backports have been created

Details

#3330 use constraints-dev.txt in e2e tests (backport #3320) has been created for branch release-v0.26 but encountered conflicts

…-3320 use `constraints-dev.txt` in e2e tests (backport #3320)

This is a port of instructlab/instructlab#3320 over to the SDG repository. While doing so, I noticed we also were not using "-DGGML_CUDA=ON" so updated that as well, since it's the same pip install line in the file. Signed-off-by: Ben Browning <[email protected]>

This is a port of instructlab/instructlab#3320 over to the SDG repository. While doing so, I noticed we also were not using "-DGGML_CUDA=ON" so updated that as well, since it's the same pip install line in the file. Signed-off-by: Ben Browning <[email protected]> (cherry picked from commit 225612c)

mergify bot added the CI/CD Affects CI/CD configuration label Apr 28, 2025

ktdreyer mentioned this pull request Apr 28, 2025

use instructlab constraints-dev.txt in e2e test instructlab/training#499

Merged

ktdreyer mentioned this pull request Apr 28, 2025

chore(deps): Bump minimum instructlab-training version #3312

Merged

6 tasks

mergify bot added the ci-failure PR has at least one CI failure label Apr 28, 2025

booxter approved these changes Apr 28, 2025

View reviewed changes

mergify bot added the one-approval PR has one approval from a maintainer label Apr 28, 2025

ktdreyer requested a review from courtneypacheco April 28, 2025 20:18

ktdreyer added this to the 0.26.0 milestone Apr 28, 2025

cdoern approved these changes Apr 28, 2025

View reviewed changes

mergify bot removed the one-approval PR has one approval from a maintainer label Apr 28, 2025

booxter removed the request for review from courtneypacheco April 28, 2025 21:03

booxter merged commit f243048 into main Apr 28, 2025
24 of 27 checks passed

mergify bot mentioned this pull request Apr 30, 2025

use constraints-dev.txt in e2e tests (backport #3320) #3330

Merged

booxter added a commit that referenced this pull request Apr 30, 2025

Merge pull request #3330 from instructlab/mergify/bp/release-v0.26/pr…

0f79cfe

…-3320 use `constraints-dev.txt` in e2e tests (backport #3320)

bbrowning mentioned this pull request May 5, 2025

Use constraints-dev.txt in ilab e2e tests instructlab/sdg#618

Merged

mergify bot mentioned this pull request May 15, 2025

Use constraints-dev.txt in ilab e2e tests (backport #618) instructlab/sdg#628

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

use `constraints-dev.txt` in e2e tests #3320

use `constraints-dev.txt` in e2e tests #3320

Uh oh!

ktdreyer commented Apr 28, 2025 •

edited

Loading

Uh oh!

github-actions bot commented Apr 28, 2025

Uh oh!

github-actions bot commented Apr 28, 2025

Uh oh!

ktdreyer commented Apr 28, 2025

Uh oh!

booxter commented Apr 28, 2025

Uh oh!

Uh oh!

courtneypacheco commented Apr 30, 2025

Uh oh!

mergify bot commented Apr 30, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

use constraints-dev.txt in e2e tests #3320

use constraints-dev.txt in e2e tests #3320

Uh oh!

Conversation

ktdreyer commented Apr 28, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented Apr 28, 2025

Uh oh!

github-actions bot commented Apr 28, 2025

Uh oh!

ktdreyer commented Apr 28, 2025

Uh oh!

booxter commented Apr 28, 2025

Uh oh!

Uh oh!

courtneypacheco commented Apr 30, 2025

Uh oh!

mergify bot commented Apr 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

✅ Backports have been created

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

use `constraints-dev.txt` in e2e tests #3320

use `constraints-dev.txt` in e2e tests #3320

ktdreyer commented Apr 28, 2025 •

edited

Loading

mergify bot commented Apr 30, 2025 •

edited

Loading