FP16 isNaN, isFinite, isInfinite intrinsics without Reinterpret nodes #1242
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
This patch removes the ReinterpretS2HF nodes in the mid-end during the generation of isNaNHF,isFiniteHF and isInfiniteHF nodes.
Performance results for this patch on an aarch64 machine -
The baseline patch generates floating point FP16 instructions and the default is where no FP16 intrinsics are used and FP32 instructions are generated.
Gain : thrpt of this patch / thrpt of either baseline or default
Tested FP16Ops.isInfiniteHF test on x86 and the performance is 2.6x better over the default case (which converts FP16 to FP32 and uses the vfpclass instruction).
The JMH tests are added in this patch.
Progress
Reviewing
Using
git
Checkout this PR locally:
$ git fetch https://git.openjdk.org/valhalla.git pull/1242/head:pull/1242
$ git checkout pull/1242
Update a local copy of the PR:
$ git checkout pull/1242
$ git pull https://git.openjdk.org/valhalla.git pull/1242/head
Using Skara CLI tools
Checkout this PR locally:
$ git pr checkout 1242
View PR using the GUI difftool:
$ git pr show -t 1242
Using diff file
Download this PR as a diff file:
https://git.openjdk.org/valhalla/pull/1242.diff