Adds support for loading F8 e5m2 weights #460

LostRuins · 2024-11-13T03:33:42Z

Adds support for loading F8 e5m2 weights, which is an alternative of f8 e4m3. Added in the similar manner as #359 by @Green-Sky converting to f16.

Tested and seems to work.

…f8 e4m3.

…leejet/stable-diffusion.cpp#460

stduhpf · 2024-11-13T17:46:12Z

model.cpp

+uint16_t f8_e5m2_to_f16(uint8_t fp8) {
+    uint8_t sign = (fp8 >> 7) & 0x1;
+    uint8_t exponent = (fp8 >> 2) & 0x1F;
+    uint8_t mantissa = fp8 & 0x3;
+
+    uint16_t fp16_sign = sign << 15;
+    uint16_t fp16_exponent;
+    uint16_t fp16_mantissa;
+
+    if (exponent == 0 && mantissa == 0) { //zero
+        return fp16_sign;
+    }
+
+    if (exponent == 0x1F) { //NAN and INF
+        fp16_exponent = 0x1F;
+        fp16_mantissa = mantissa ? (mantissa << 8) : 0;
+        return fp16_sign | (fp16_exponent << 10) | fp16_mantissa;
+    }
+
+    if (exponent == 0) { //subnormal numbers
+        fp16_exponent = 0;
+        fp16_mantissa = (mantissa << 8);
+        return fp16_sign | fp16_mantissa;
+    }
+
+    //normal numbers
+    int16_t true_exponent = (int16_t)exponent - 15 + 15;
+    if (true_exponent <= 0) {
+        fp16_exponent = 0;
+        fp16_mantissa = (mantissa << 8);
+    } else if (true_exponent >= 0x1F) {
+        fp16_exponent = 0x1F;
+        fp16_mantissa = 0;
+    } else {
+        fp16_exponent = (uint16_t)true_exponent;
+        fp16_mantissa = mantissa << 8;
+    }
+
+    return fp16_sign | (fp16_exponent << 10) | fp16_mantissa;
+}
+


I might be mistaken, but can't this whole thing be replaced with just return (uint16_t)fp8<<8;, since fp8_e5m2 is basically truncated fp16?

(or rather return static_cast<uint16_t>(fp8) << 8;)

Adds support for loading F8 e5m2 weights, which is an alternative of …

5ba7cc9

…f8 e4m3.

LostRuins added a commit to LostRuins/koboldcpp that referenced this pull request Nov 13, 2024

add e5m2 support for use in Kobo, also made a separate contribution PR …

dd95f88

…leejet/stable-diffusion.cpp#460

stduhpf reviewed Nov 13, 2024

View reviewed changes

leejet merged commit 8f94efa into leejet:master Nov 23, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Adds support for loading F8 e5m2 weights #460

Adds support for loading F8 e5m2 weights #460

Uh oh!

LostRuins commented Nov 13, 2024

Uh oh!

stduhpf Nov 13, 2024

Uh oh!

stduhpf Nov 13, 2024

Uh oh!

Uh oh!

Adds support for loading F8 e5m2 weights #460

Adds support for loading F8 e5m2 weights #460

Uh oh!

Conversation

LostRuins commented Nov 13, 2024

Uh oh!

stduhpf Nov 13, 2024

Choose a reason for hiding this comment

Uh oh!

stduhpf Nov 13, 2024

Choose a reason for hiding this comment

Uh oh!

Uh oh!