[Benchmark] Add parquet read benchmark #1371

rjzamora · 2024-07-30T19:03:02Z

Adds new benchmark for parquet read performance using a LocalCUDACluster. The user can pass in --key and --secret options to specify S3 credentials.

E.g.

$ python ./local_read_parquet.py --devs 0,1,2,3,4,5,6,7 --filesystem fsspec --type gpu --file-count 48 --aggregate-files

Parquet read benchmark
--------------------------------------------------------------------------------
Path                      | s3://dask-cudf-parquet-testing/dedup_parquet
Columns                   | None
Backend                   | cudf
Filesystem                | fsspec
Blocksize                 | 244.14 MiB
Aggregate files           | True
Row count                 | 372066
Size on disk              | 1.03 GiB
Number of workers         | 8
================================================================================
Wall clock                | Throughput
--------------------------------------------------------------------------------
36.75 s                   | 28.78 MiB/s
21.29 s                   | 49.67 MiB/s
17.91 s                   | 59.05 MiB/s
================================================================================
Throughput                | 41.77 MiB/s +/- 7.81 MiB/s
Bandwidth                 | 0 B/s +/- 0 B/s
Wall clock                | 25.32 s +/- 8.20 s
================================================================================
...

Notes:

S3 Performance generally scales with the number of workers (multiplied the number of threads per worker)
The example shown above was not executed from an EC2 instance
The example shown above should perform better after Multi-file and Parquet-aware prefetching from remote storage cudf#16657
Using --filesystem arrow together with --type gpu performs well, but depends on Add experimental filesystem="arrow" support in dask_cudf.read_parquet cudf#16684

pentschev · 2024-07-30T20:43:13Z

Performance generally scales with the number of workers (multiplied the number of threads per worker)

I'm assuming this apply to CPU-only operations, or are there CUDA kernels executed as part of this as well?

rjzamora · 2024-07-30T21:00:01Z

I'm assuming this apply to CPU-only operations, or are there CUDA kernels executed as part of this as well?

This benchmark is entirely IO/CPU bound. There is effectively no CUDA compute - we are just transferring remote data into host memory and moving it into device memory (when the default --type gpu is used). Therefore, increasing threads_per_worker * n_workers typically improves performance (because we have more threads making connections and sending requests to S3).

dask_cuda/benchmarks/custom/parquet.py

dask_cuda/benchmarks/remote_parquet.py

rjzamora · 2024-08-29T16:44:31Z

Update: I've generalized this benchmark. It's easy to use with S3 storage, but is also a useful benchmark for local-storage performance.

pentschev

Thanks @rjzamora , I've left some comments.

dask_cuda/benchmarks/local_read_parquet.py

madsbk

Nice @rjzamora, looks good. I only have a minor suggestion.

dask_cuda/benchmarks/utils.py

pentschev

+1 to Mads' suggestion, otherwise LGTM. Thanks @rjzamora !

Co-authored-by: Mads R. B. Kristensen <[email protected]>

dask_cuda/benchmarks/read_parquet.py

rjzamora · 2024-08-30T13:20:56Z

/merge

add new remote-io benchmark

0b48642

rjzamora added 2 - In Progress Currently a work in progress feature request New feature or request non-breaking Non-breaking change labels Jul 30, 2024

rjzamora self-assigned this Jul 30, 2024

github-actions bot added the python python code needed label Jul 30, 2024

use fragment_parallelism

1505890

pentschev reviewed Jul 30, 2024

View reviewed changes

dask_cuda/benchmarks/custom/parquet.py Outdated Show resolved Hide resolved

wence- reviewed Aug 1, 2024

View reviewed changes

dask_cuda/benchmarks/custom/parquet.py Outdated Show resolved Hide resolved

rjzamora added 4 commits August 12, 2024 16:14

Merge branch 'branch-24.10' into remote-io-bench

3860705

Merge remote-tracking branch 'upstrea/branch-24.10' into remote-io-bench

4aa53be

remove custom arrow code path in favor of proper dask-cudf support

ad8df90

fix typo

5b24195

rjzamora commented Aug 29, 2024

View reviewed changes

dask_cuda/benchmarks/remote_parquet.py Outdated Show resolved Hide resolved

rjzamora added 2 commits August 29, 2024 08:49

rename file and generalize

4b045de

fix benchmark name

31f5a8a

rjzamora changed the title ~~[WIP][Benchmark] Add new remote parquet benchmark~~ [Benchmark] Add new remote parquet benchmark Aug 29, 2024

rjzamora changed the title ~~[Benchmark] Add new remote parquet benchmark~~ [Benchmark] Add parquet read benchmark Aug 29, 2024

rjzamora added 3 - Ready for Review Ready for review by team and removed 2 - In Progress Currently a work in progress labels Aug 29, 2024

rjzamora marked this pull request as ready for review August 29, 2024 16:02

rjzamora requested a review from a team as a code owner August 29, 2024 16:02

fix case where path ends in slash

a15aa35

pentschev reviewed Aug 29, 2024

View reviewed changes

rjzamora added 2 commits August 29, 2024 17:25

address code review

6f3e5c5

fix output

8745e1d

madsbk approved these changes Aug 30, 2024

View reviewed changes

dask_cuda/benchmarks/utils.py Outdated Show resolved Hide resolved

pentschev approved these changes Aug 30, 2024

View reviewed changes

Update dask_cuda/benchmarks/utils.py

5306575

Co-authored-by: Mads R. B. Kristensen <[email protected]>

rjzamora commented Aug 30, 2024

View reviewed changes

dask_cuda/benchmarks/read_parquet.py Outdated Show resolved Hide resolved

Update dask_cuda/benchmarks/read_parquet.py

5f2937b

rjzamora added 5 - Ready to Merge Testing and reviews complete, ready to merge and removed 3 - Ready for Review Ready for review by team labels Aug 30, 2024

rapids-bot bot merged commit 1cc4d0b into rapidsai:branch-24.10 Aug 30, 2024
23 checks passed

rjzamora deleted the remote-io-bench branch August 30, 2024 14:45

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Benchmark] Add parquet read benchmark #1371

[Benchmark] Add parquet read benchmark #1371

rjzamora commented Jul 30, 2024 •

edited

Loading

pentschev commented Jul 30, 2024

rjzamora commented Jul 30, 2024

rjzamora commented Aug 29, 2024

pentschev left a comment

madsbk left a comment

pentschev left a comment

rjzamora commented Aug 30, 2024

[Benchmark] Add parquet read benchmark #1371

[Benchmark] Add parquet read benchmark #1371

Conversation

rjzamora commented Jul 30, 2024 • edited Loading

pentschev commented Jul 30, 2024

rjzamora commented Jul 30, 2024

rjzamora commented Aug 29, 2024

pentschev left a comment

Choose a reason for hiding this comment

madsbk left a comment

Choose a reason for hiding this comment

pentschev left a comment

Choose a reason for hiding this comment

rjzamora commented Aug 30, 2024

rjzamora commented Jul 30, 2024 •

edited

Loading