Codestin Search App

v6-preview.54

Implement extended precision integer addition (#607)

* Refactor emit_intrinsic to allow struct return type

* Implement extended precision integer addition

Uses `llvm.*add.with.overflow.*`. That intrinsic does not take a carry argument, so handling carry in requires multiple additions and combining the carry out, but the AMDGPU target is able to translate that pattern into a single instruction.

These four PTX instructions:

```
    add.cc.u32      r0, a0, b0;
    addc.cc.u32     r1, a1, b1;
    addc.cc.u32     r2, a2, b2;
    addc.u32        r3, a3, b3;
```

are translated into four RDNA3 instructions:

```
    v_add_co_u32 v0, vcc_lo, v0, v4
    v_add_co_ci_u32_e32 v1, vcc_lo, v1, v5, vcc_lo
    v_add_co_ci_u32_e32 v2, vcc_lo, v2, v6, vcc_lo
    v_add_co_ci_u32_e32 v3, vcc_lo, v7, v3, vcc_lo
```

* cargo fmt

* Rename to match convention

* cargo fmt

Jan 20, 2026
0faa099
zip
tar.gz
Notes
Downloads

v6-preview.53

Host functions for vLLM (#606)

Jan 15, 2026
5901264
zip
tar.gz
Notes
Downloads

v6-preview.52

Add sad and dp2a instructions (#605)

Jan 15, 2026
8e1f995
zip
tar.gz
Notes
Downloads

v6-preview.51

Use partial parsing result in release mode (#603)

Fix for a regression in #569 and some minor fixes for llama.cpp on Windows

Jan 13, 2026
2beaee4
zip
tar.gz
Notes
Downloads

v6-preview.50

Update docs (add llama.cpp, zluda_precompile sections) (#602)

Jan 13, 2026
b3ede5d
zip
tar.gz
Notes
Downloads

v6-preview.49

Stop failing on bf16 uint_to_fp on amdgpu < gfx11 (#601)

Jan 12, 2026
0d7f3fd
zip
tar.gz
Notes
Downloads

v6-preview.48

Add CUDA 13.1 compatibility (#599)

Also fix all the warnings

Jan 9, 2026
d9a5304
zip
tar.gz
Notes
Downloads

v6-preview.47

When building ZLUDA in CI, make sure we build Linux binaries compatib…

…le with both ROCm 6 and ROCm 7 (#589)

Jan 8, 2026
7fe40d3
zip
tar.gz
Notes
Downloads

v6-preview.46

When building ZLUDA in CI, make sure we build Linux binaries compatib…

…le with both ROCm 6 and ROCm 7 (#589)

Jan 8, 2026
7fe40d3
zip
tar.gz
Notes
Downloads

v6-preview.45

Allow implicit conversion from bit scalar to vec for st (#585)

Jan 3, 2026
294f236
zip
tar.gz
Notes
Downloads

PreviousNext

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

v6-preview.54

v6-preview.53

v6-preview.52

v6-preview.51

v6-preview.50

v6-preview.49

v6-preview.48

v6-preview.47

v6-preview.46

v6-preview.45

Tags: vosen/ZLUDA