Lower conditional traps in backend #9072

amartosch · 2024-08-02T21:22:48Z

This PR moves conditional traps to backends from legalization, as described in #6055. Hope this is what was meant in the issue and is still needed. There are two things I'm not sure about:

Sequences for s390x. I'm not familiar with the ISA, but conditional trap generation has already been implemented in the backend, so I've just blessed the tests.
On RISCV traps are emitted inline; not trapping means jumping over it. On aarch64 and x86_64 they are emitted in islands. I wonder, if there is a deeper reason for this. If not, should the approach be unified with the other backends?

This is my first contribution, but I didn't ask any questions because this seemed to be straightforward. Please, feel free to simply discard it if the assumption was wrong.

fitzgen · 2024-08-02T22:19:25Z

Excited to see this come in! Will take a closer look on Monday, thanks!

fitzgen

Thanks! This is indeed exactly the kind of thing we were imagining in #6055. Overall the PR looks good, just a few nitpicks inline below.

Sequences for s390x. I'm not familiar with the ISA, but conditional trap generation has already been implemented in the backend, so I've just blessed the tests.

This is generally fine. It would be good to double check whether we have run tests and fuzzgen support for conditional traps and add them if they are missing. Then we would be exercising actual execution behavior, which is going to give us a lot more confidence in the correctness of the implementation than blessing output will.

I don't see any trap[n]z.clif file in https://github.com/bytecodealliance/wasmtime/tree/main/cranelift/filetests/filetests/runtests nor does a grep show anything, so it would definitely be good to add as part of this PR.

I think CLIF interpreter and fuzzgen support could happen in follow ups, though.

On RISCV traps are emitted inline; not trapping means jumping over it. On aarch64 and x86_64 they are emitted in islands. I wonder, if there is a deeper reason for this. If not, should the approach be unified with the other backends?

I believe that the answer is that we can implement them inline in risc-v without bloating code size (compared to branching out of line; s390x is inline as well, fwiw) but not on the other architectures. So, for those other architectures, we push the actual trapping instruction out of line to improve icache usage (since traps are extremely rare and effectively terminate the program).

cranelift/codegen/src/isa/aarch64/lower.isle

cranelift/codegen/src/isa/x64/lower.isle

afonso360 · 2024-08-06T08:23:22Z

This is generally fine. It would be good to double check whether we have run tests and fuzzgen support for conditional traps and add them if they are missing. Then we would be exercising actual execution behavior, which is going to give us a lot more confidence in the correctness of the implementation than blessing output will.

We don't have great support for this in runtests (see #4781). So we might only be able to test the non trapping path.

On RISCV traps are emitted inline; not trapping means jumping over it. On aarch64 and x86_64 they are emitted in islands. I wonder, if there is a deeper reason for this. If not, should the approach be unified with the other backends?

I attempted to fix this a while ago, but with the conditional branch instruction that we use, we only have an effective jump range of +/-4KiB which is quite restrictive. So we opted to leave the traps inline.

This is somewhat mitigated by using the compressed instruction extension, which lets us emit conditional traps using only 2 compressed instructions (4 bytes) so it ends up not being a big deal.

amartosch · 2024-08-06T18:41:18Z

Also added very basic runtests that only check non-trapping path for now.

fitzgen

Thanks! Looks great!

fitzgen · 2024-09-20T23:04:08Z

Strange, for some reason CI hung and this never merged.

cranelift/codegen/src/isa/aarch64/inst.isle

fitzgen · 2024-09-20T23:06:31Z

Nice, looks like adding a commit (just rewording a comment) triggered CI.

fitzgen · 2024-09-20T23:07:12Z

Sorry for not catching this sooner!

Instead of legalizing `trapz` and `trapnz` in the mid-end, we now take them all the way to the backend. This allows us to GVN them and remove redundant trap checks. This also allows us to avoid creating new blocks in the legalizer and otherwise invalidating the control-flow graph.

amartosch requested a review from a team as a code owner August 2, 2024 21:22

amartosch requested review from fitzgen and removed request for a team August 2, 2024 21:22

github-actions bot added cranelift Issues related to the Cranelift code generator cranelift:area:aarch64 Issues related to AArch64 backend. cranelift:area:x64 Issues related to x64 codegen labels Aug 3, 2024

fitzgen reviewed Aug 6, 2024

View reviewed changes

cranelift/codegen/src/isa/aarch64/lower.isle Outdated Show resolved Hide resolved

cranelift/codegen/src/isa/x64/lower.isle Outdated Show resolved Hide resolved

fitzgen approved these changes Aug 6, 2024

View reviewed changes

fitzgen enabled auto-merge August 6, 2024 20:09

fitzgen disabled auto-merge September 20, 2024 23:04

fitzgen reviewed Sep 20, 2024

View reviewed changes

cranelift/codegen/src/isa/aarch64/inst.isle Outdated Show resolved Hide resolved

fitzgen enabled auto-merge September 20, 2024 23:06

fitzgen force-pushed the cond-traps-in-backend branch from 4199ac3 to 62a8701 Compare September 20, 2024 23:13

fitzgen disabled auto-merge September 21, 2024 01:20

fitzgen requested a review from a team as a code owner September 21, 2024 01:29

fitzgen requested review from alexcrichton and removed request for a team September 21, 2024 01:29

fitzgen force-pushed the cond-traps-in-backend branch from 3bb1841 to 745c604 Compare September 21, 2024 01:40

Lower trap[n]z clif instructions in Pulley

3a6bf4f

fitzgen force-pushed the cond-traps-in-backend branch from 745c604 to e9cb9e6 Compare September 21, 2024 01:54

Update disas tests

3886384

fitzgen force-pushed the cond-traps-in-backend branch from e9cb9e6 to 3886384 Compare September 21, 2024 01:55

alexcrichton approved these changes Sep 23, 2024

View reviewed changes

alexcrichton added this pull request to the merge queue Sep 23, 2024

Merged via the queue into bytecodealliance:main with commit eb0428e Sep 23, 2024
71 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Lower conditional traps in backend #9072

Lower conditional traps in backend #9072

amartosch commented Aug 2, 2024 •

edited

Loading

fitzgen commented Aug 2, 2024

fitzgen left a comment

afonso360 commented Aug 6, 2024 •

edited

Loading

amartosch commented Aug 6, 2024

fitzgen left a comment

fitzgen commented Sep 20, 2024

fitzgen commented Sep 20, 2024 •

edited

Loading

fitzgen commented Sep 20, 2024

Lower conditional traps in backend #9072

Lower conditional traps in backend #9072

Conversation

amartosch commented Aug 2, 2024 • edited Loading

fitzgen commented Aug 2, 2024

fitzgen left a comment

Choose a reason for hiding this comment

afonso360 commented Aug 6, 2024 • edited Loading

amartosch commented Aug 6, 2024

fitzgen left a comment

Choose a reason for hiding this comment

fitzgen commented Sep 20, 2024

fitzgen commented Sep 20, 2024 • edited Loading

fitzgen commented Sep 20, 2024

amartosch commented Aug 2, 2024 •

edited

Loading

afonso360 commented Aug 6, 2024 •

edited

Loading

fitzgen commented Sep 20, 2024 •

edited

Loading