FP16 isNaN, isFinite, isInfinite intrinsics without Reinterpret nodes #1242

Bhavana-Kilambi · 2024-09-12T07:48:22Z

This patch removes the ReinterpretS2HF nodes in the mid-end during the generation of isNaNHF,isFiniteHF and isInfiniteHF nodes.

Performance results for this patch on an aarch64 machine -

Benchmark               Gain over baseline      Gain over default
FP16Ops.isFiniteHF      1.29                    1.85
FP16Ops.isInfiniteHF    1.28                    1.90
FP16Ops.isNaNHF         1.45                    1.58

The baseline patch generates floating point FP16 instructions and the default is where no FP16 intrinsics are used and FP32 instructions are generated.

Gain : thrpt of this patch / thrpt of either baseline or default

Tested FP16Ops.isInfiniteHF test on x86 and the performance is 2.6x better over the default case (which converts FP16 to FP32 and uses the vfpclass instruction).

The JMH tests are added in this patch.

Progress

Change must not contain extraneous whitespace

Reviewing

Using git

Checkout this PR locally:
$ git fetch https://git.openjdk.org/valhalla.git pull/1242/head:pull/1242
$ git checkout pull/1242

Update a local copy of the PR:
$ git checkout pull/1242
$ git pull https://git.openjdk.org/valhalla.git pull/1242/head

Using Skara CLI tools

Checkout this PR locally:
$ git pr checkout 1242

View PR using the GUI difftool:
$ git pr show -t 1242

Using diff file

Download this PR as a diff file:
https://git.openjdk.org/valhalla/pull/1242.diff

This patch adds intrinsic support for FP16 isNaN, isFinite and isInfinite methods and also adds aarch64 backend for these intrinsics. Tested all FP16 related tests successfully on aarch64.

This patch removes the ReinterpretS2HF nodes in the mid-end during the generation of isNaNHF,isFiniteHF and isInfiniteHF nodes. Performance results for this patch on an aarch64 machine - Benchmark Gain over baseline Gain over default FP16Ops.isFiniteHF 1.29 1.85 FP16Ops.isInfiniteHF 1.28 1.90 FP16Ops.isNaNHF 65504 1.45 1.58 The baseline patch generates floating point FP16 instructions and the default is where no FP16 intrinsics are used and FP32 instructions are generated. Gain : thrpt of this patch / thrpt of either baseline or default Tested FP16Ops.isInfiniteHF test on x86 and the performance is 2.6x better over the default case (which converts FP16 to FP32 and uses the vfpclass instruction). The JMH tests are added in this patch.

bridgekeeper · 2024-09-12T07:49:05Z

👋 Welcome back bkilambi! A progress list of the required criteria for merging this PR into lworld+fp16 will be added to the body of your pull request. There are additional pull request commands available for use with this pull request.

openjdk · 2024-09-12T07:49:32Z

❗ This change is not yet ready to be integrated.
See the Progress checklist in the description for automated requirements.

Bhavana-Kilambi added 2 commits September 11, 2024 08:35

8339473: Add support for FP16 isFinite, isInfinite and isNaN

1d1dfb3

This patch adds intrinsic support for FP16 isNaN, isFinite and isInfinite methods and also adds aarch64 backend for these intrinsics. Tested all FP16 related tests successfully on aarch64.

Bhavana-Kilambi changed the title ~~JDK-8339473: FP16 isNaN, isFinite, isInfinite intrinsics without Reinterpret nodes~~ FP16 isNaN, isFinite, isInfinite intrinsics without Reinterpret nodes Sep 12, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

FP16 isNaN, isFinite, isInfinite intrinsics without Reinterpret nodes #1242

FP16 isNaN, isFinite, isInfinite intrinsics without Reinterpret nodes #1242

Bhavana-Kilambi commented Sep 12, 2024 •

edited by openjdk bot

Loading

bridgekeeper bot commented Sep 12, 2024

openjdk bot commented Sep 12, 2024

FP16 isNaN, isFinite, isInfinite intrinsics without Reinterpret nodes #1242

Are you sure you want to change the base?

FP16 isNaN, isFinite, isInfinite intrinsics without Reinterpret nodes #1242

Conversation

Bhavana-Kilambi commented Sep 12, 2024 • edited by openjdk bot Loading

Progress

Reviewing

bridgekeeper bot commented Sep 12, 2024

openjdk bot commented Sep 12, 2024

Bhavana-Kilambi commented Sep 12, 2024 •

edited by openjdk bot

Loading