cuda implementation of magnetic reflectivity #135

pkienzle · 2021-07-30T22:32:04Z

Here's a 4x slower version of magnetic reflectivity using numba.cuda.

Run using:

python run.py doc/examples/magrough/model.py --profile --steps=50 | less -S

Feel free to play and try to make it run faster.

pkienzle · 2021-07-30T22:38:21Z

Note that this implementation is single precision with no stability correction. If it were fast enough then it might be worth trying to improve it. Basically, divide each matrix entry by the maximum on the diagonal and multiply the final result by that product. At least, that's how I was able to compute reflectivity from 10 km thick samples for the non-polarized case.

Modern gaming cards are 30x slower for double precision; maybe I'm accidentally promoting floats to double and killing performance that way.

pkienzle · 2021-07-30T23:24:04Z

Part of the problem appears to be communication overhead with the card. Change q length from 400 to 4000 and execution time changes very little (the RTX 2080 card has 4300 processors). For 40000 q values execution time is 10x, which makes sense given the number of processors.

Changing the layer data to fill a single matrix might help with the communication overhead, but that's a more involved code modification. The new memory layout may help performance on the card, making it easier to place it into shared memory so that access patterns don't matter so much.

Making the reflectivity calculation "asynchronous" so that you can compose the next layer matrix while the current calculation is running on the card would also help.

In any case, convolution dominates when number of q increases, so that, too, needs to be moved onto the card.

pkienzle added 2 commits July 30, 2021 22:23

cuda implementation of magnetic reflectivity

261e7ea

print selected cuda device

8f480e0

pkienzle changed the base branch from magnetic_reflectivity_py to master May 3, 2022 20:46

pkienzle marked this pull request as draft May 3, 2022 20:46

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

cuda implementation of magnetic reflectivity #135

cuda implementation of magnetic reflectivity #135

pkienzle commented Jul 30, 2021

pkienzle commented Jul 30, 2021

pkienzle commented Jul 30, 2021

cuda implementation of magnetic reflectivity #135

Are you sure you want to change the base?

cuda implementation of magnetic reflectivity #135

Conversation

pkienzle commented Jul 30, 2021

pkienzle commented Jul 30, 2021

pkienzle commented Jul 30, 2021