Fix incorrect CPU float16 implementation by eliasachermann · Pull Request #2395 · spcl/dace

eliasachermann · 2026-06-06T15:07:08Z

Problem

The dace::half fallback in dace/runtime/include/dace/types.h is broken in two independent ways:

No default constructor.
Generated code that declares an uninitialized dace::float16 fails to compile:
```
error: no matching function for call to 'dace::half::half()'
```
Incorrect conversions.
The float↔half conversion is wrong for zero, subnormals, Inf/NaN, and does no rounding.

Fix

Replace both conversions with IEEE-754 round-to-nearest-even routines, correctly handling zero, subnormals, overflow→Inf and Inf/NaN.
Add a default constructor.

Tests

Adds tests/half_cpu_test.py, to test the conversion against NumPy as the reference.

Remove print statement indicating all tests passed.

ThrudPrimrose

We need subnormal, inf and NaN in roundtrip tests to ensure we handle them correctly.

ThrudPrimrose · 2026-06-08T16:20:22Z

+            1e-3,
+            6e-5,
+            6e-8,
+            -6e-8,


Can you add a subnormal number to your tests?

Added inf and nan tests

acalotoiu

LGTM

Merges PR #2395 (head e33d5b0) into extended: replaces the buggy CPU half-precision conversion in dace/runtime/include/dace/types.h with the round-to-nearest-even implementation from the PR, and adds tests/half_cpu_test.py. Conflict resolution: extended had clang-format-reformatted types.h, so the PR's surrounding context conflicted. Kept extended's formatting and swapped in only the corrected half struct (the PR's functional change). half_cpu_test.py 2/2 pass. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>

ahmadbelb · 2026-06-29T22:29:07Z

@eliasachermann @ThrudPrimrose Quick follow up. On ARMv8.2-A targets built with the FP16 arithmetic extension (+fp16 / FEAT_FP16, which Apple Silicon has), _Float16 lowers to native fp16 instructions. So we could use it directly there and keep the software half as the fallback for x86 and anything without native fp16.

#elif (defined(__aarch64__) || defined(__arm64__)) && defined(__FLT16_MANT_DIG__)
    typedef std::complex<float> complex64;
    typedef std::complex<double> complex128;
    typedef _Float16 float16;   // native ARM hardware fp16
#else

@ThrudPrimrose Not sure if ARM is something you want to support here, but if it is I'm happy to open a
separate PR for this, or add it here if that's easier. What do you think?

ThrudPrimrose · 2026-06-30T08:39:14Z

@eliasachermann @ThrudPrimrose Quick follow up. On ARMv8.2-A targets built with the FP16 arithmetic extension (+fp16 / FEAT_FP16, which Apple Silicon has), _Float16 lowers to native fp16 instructions. So we could use it directly there and keep the software half as the fallback for x86 and anything without native fp16.
#elif (defined(__aarch64__) || defined(__arm64__)) && defined(__FLT16_MANT_DIG__)
    typedef std::complex<float> complex64;
    typedef std::complex<double> complex128;
    typedef _Float16 float16;   // native ARM hardware fp16
#else
@ThrudPrimrose Not sure if ARM is something you want to support here, but if it is I'm happy to open a separate PR for this, or add it here if that's easier. What do you think?

This would be a nice extension in my opinion.

Once this PR is merged, I would propose that you open a new PR on top of what has merged such that we have native fp16 support for corresponding ARM target. I also know that latest avx512 instruction sets also have native fp16, could be nice to also support them.

correct CPU half (float16) conversion, add default constructor

334cbf7

eliasachermann changed the title ~~correct CPU half (float16) conversion, add default constructor~~ Fix incorrect CPU float16 implementation Jun 6, 2026

ThrudPrimrose reviewed Jun 8, 2026

View reviewed changes

Comment thread tests/half_cpu_test.py Outdated

Remove success message from test script

463d6b0

Remove print statement indicating all tests passed.

ThrudPrimrose self-requested a review June 8, 2026 16:18

ThrudPrimrose requested changes Jun 8, 2026

View reviewed changes

eliasachermann added 2 commits June 8, 2026 18:55

Add inf and nan tests

a3ce049

Update Copyright

e33d5b0

ThrudPrimrose approved these changes Jun 8, 2026

View reviewed changes

acalotoiu approved these changes Jun 9, 2026

View reviewed changes

ThrudPrimrose and others added 2 commits June 19, 2026 15:35

Merge branch 'main' into fix/cpu-half-conversion

46d46da

Merge branch 'spcl:main' into fix/cpu-half-conversion

0c202be

This was referenced Jun 28, 2026

Fix: Add default constructor to dace::half (fixes float16 CPU compilation) #2416

Closed

float16 arrays fail to compile on the CPU backend, dace::half has no default constructor #2415

Closed

Merge branch 'main' into fix/cpu-half-conversion

8f9b212

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix incorrect CPU float16 implementation#2395

Fix incorrect CPU float16 implementation#2395
eliasachermann wants to merge 7 commits into
spcl:mainfrom
eliasachermann:fix/cpu-half-conversion

eliasachermann commented Jun 6, 2026

Uh oh!

Uh oh!

ThrudPrimrose left a comment

Uh oh!

ThrudPrimrose Jun 8, 2026

Uh oh!

eliasachermann Jun 8, 2026

Uh oh!

acalotoiu left a comment

Uh oh!

ahmadbelb commented Jun 29, 2026

Uh oh!

ThrudPrimrose commented Jun 30, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

+e-3,
+e-5,
+e-8,
+                          -6e-8,

Uh oh!

Conversation

eliasachermann commented Jun 6, 2026

Problem

Fix

Tests

Uh oh!

Uh oh!

ThrudPrimrose left a comment

Choose a reason for hiding this comment

Uh oh!

ThrudPrimrose Jun 8, 2026

Choose a reason for hiding this comment

Uh oh!

eliasachermann Jun 8, 2026

Choose a reason for hiding this comment

Uh oh!

acalotoiu left a comment

Choose a reason for hiding this comment

Uh oh!

ahmadbelb commented Jun 29, 2026

Uh oh!

ThrudPrimrose commented Jun 30, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants