Think about how to pull patterns from GPU devices #176

jyoung3131 · 2024-02-02T16:25:14Z

jyoung3131
Feb 2, 2024
Maintainer

Right now gs_patterns uses Intel PIN to pull CPU traces and bin them into appropriate Spatter inputs. This workflow works well, but it does not provide a true "fair" comparison for GPU Spatter benchmarking runs since the inputs are targeted towards a CPU cache-based architecture. To provide a better comparison, we should use an Intel PIN equivalent that traces directly from GPU applications (CUDA/HIP/OneAPI).

Our understanding is that NVBit could be used in a similar way as Intel Pin or DynamoRio to pull traces from CUDA applications that could be used as inputs to the gs_patterns tool. This is likely a large project that would require significant effort, but we should consider and flesh out the steps to do so.

One noted caveat is that NVBit does not currently support H100 GPUs (sm_90), so we'd need to test with A100 or earlier.

plavin · 2024-07-24T17:50:54Z

plavin
Jul 24, 2024
Maintainer

This feature has been added to LANL's GS Patterns tool. Here is a link to the HPC Garage fork, which will hopefully be merged upstream soon.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Think about how to pull patterns from GPU devices #176

{{title}}

Replies: 1 comment

{{title}}

Select a reply

Think about how to pull patterns from GPU devices #176

jyoung3131 Feb 2, 2024 Maintainer

Replies: 1 comment

plavin Jul 24, 2024 Maintainer

jyoung3131
Feb 2, 2024
Maintainer

plavin
Jul 24, 2024
Maintainer