Thanks to visit codestin.com
Credit goes to github.com

Skip to content

Comments

Change tiny model dtype from float16 to bfloat16 to fix CUDA error#4745

Merged
albertvillanova merged 1 commit intohuggingface:mainfrom
albertvillanova:fix-4741
Dec 24, 2025
Merged

Change tiny model dtype from float16 to bfloat16 to fix CUDA error#4745
albertvillanova merged 1 commit intohuggingface:mainfrom
albertvillanova:fix-4741

Conversation

@albertvillanova
Copy link
Member

@albertvillanova albertvillanova commented Dec 23, 2025

Change tiny model dtype from float16 to bfloat16 to fix CUDA error.

Fix #4741.

Regenerate tiny models: I have checked that CI tests pass with the Hub PR branch before merging to main

@HuggingFaceDocBuilderDev

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

@albertvillanova
Copy link
Member Author

albertvillanova commented Dec 24, 2025

I have already used this PR to regenerate the "trl-internal-testing/tiny-LlavaNextForConditionalGeneration" model, I have merged it to main and all the CUDA errors disappeared from our CI:

  • from 192 failed
  • to 50 failed

@albertvillanova albertvillanova merged commit 51113d1 into huggingface:main Dec 24, 2025
8 of 9 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

CI fails with test dependencies: torch.AcceleratorError: CUDA error: device-side assert

3 participants