Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[FEA] Unified memory for Polars GPU Engine #16850

Open
beckernick opened this issue Sep 19, 2024 · 0 comments
Open

[FEA] Unified memory for Polars GPU Engine #16850

beckernick opened this issue Sep 19, 2024 · 0 comments
Labels
cudf.polars Issues specific to cudf.polars feature request New feature or request

Comments

@beckernick
Copy link
Member

We now use unified memory + prefetching by default for cuDF Pandas, which has materially improved the user experience while still providing large speedups to workflows -- particularly on the lower-memory GPUs where users are frequently trying to process e.g., 5-20GB datasets.

In contrast, users of the Polars GPU engine are currently at high risk of experiencing out-of-memory errors when processing the medium-sized datasets for which they're hoping accelerated computing can help them handle.

A brief series of ad-hoc investigations exploring UVM for Polars on PDS-H on 100-200GB datasets on higher-memory GPUs like the A100 and H100 have showed promise but run into some performance challenges in selected scenarios. We should formalize this investigation to target enabling UVM for the Polars GPU engine.

@beckernick beckernick added cudf.polars Issues specific to cudf.polars feature request New feature or request labels Sep 19, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
cudf.polars Issues specific to cudf.polars feature request New feature or request
Projects
None yet
Development

No branches or pull requests

1 participant