Thanks to visit codestin.com
Credit goes to github.com

Skip to content

Tags: vosen/ZLUDA

Tags

v6-preview.54

Toggle v6-preview.54's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
Implement extended precision integer addition (#607)

* Refactor emit_intrinsic to allow struct return type

* Implement extended precision integer addition

Uses `llvm.*add.with.overflow.*`. That intrinsic does not take a carry argument, so handling carry in requires multiple additions and combining the carry out, but the AMDGPU target is able to translate that pattern into a single instruction.

These four PTX instructions:

```
    add.cc.u32      r0, a0, b0;
    addc.cc.u32     r1, a1, b1;
    addc.cc.u32     r2, a2, b2;
    addc.u32        r3, a3, b3;
```

are translated into four RDNA3 instructions:

```
    v_add_co_u32 v0, vcc_lo, v0, v4
    v_add_co_ci_u32_e32 v1, vcc_lo, v1, v5, vcc_lo
    v_add_co_ci_u32_e32 v2, vcc_lo, v2, v6, vcc_lo
    v_add_co_ci_u32_e32 v3, vcc_lo, v7, v3, vcc_lo
```

* cargo fmt

* Rename to match convention

* cargo fmt

v6-preview.53

Toggle v6-preview.53's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
Host functions for vLLM (#606)

v6-preview.52

Toggle v6-preview.52's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
Add sad and dp2a instructions (#605)

v6-preview.51

Toggle v6-preview.51's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
Use partial parsing result in release mode (#603)

Fix for a regression in #569 and some minor fixes for llama.cpp on Windows

v6-preview.50

Toggle v6-preview.50's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
Update docs (add llama.cpp, zluda_precompile sections) (#602)

v6-preview.49

Toggle v6-preview.49's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
Stop failing on bf16 uint_to_fp on amdgpu < gfx11 (#601)

v6-preview.48

Toggle v6-preview.48's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
Add CUDA 13.1 compatibility (#599)

Also fix all the warnings

v6-preview.47

Toggle v6-preview.47's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
When building ZLUDA in CI, make sure we build Linux binaries compatib…

…le with both ROCm 6 and ROCm 7 (#589)

v6-preview.46

Toggle v6-preview.46's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
When building ZLUDA in CI, make sure we build Linux binaries compatib…

…le with both ROCm 6 and ROCm 7 (#589)

v6-preview.45

Toggle v6-preview.45's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
Allow implicit conversion from bit scalar to vec for st (#585)