[jvm-packages] Support Ranker #10823

wbo4958 · 2024-09-16T13:26:21Z

No description provided.

wbo4958 · 2024-09-16T13:28:30Z

Hi @trivialfis, @eordentlich Could you help review it? Thx

eordentlich · 2024-09-17T16:40:35Z

Would be good to explain how this resolves the issue raised in this comment and issue linked therein:
#10639 (comment)

eordentlich

Some questions re plugin and preprocess.

eordentlich · 2024-09-17T22:03:15Z

jvm-packages/xgboost4j-spark/src/main/scala/ml/dmlc/xgboost4j/scala/spark/XGBoostRanker.scala

+   */
+  override private[spark] def preprocess(dataset: Dataset[_]): (Dataset[_], ColumnIndices) = {
+    val (output, columnIndices) = super.preprocess(dataset)
+    (output.sortWithinPartitions(getGroupCol), columnIndices)


How does this operation interact with spark-rapids plugin if enabled? Any implications on GPU memory?

Does this preprocess even get called if plugin is enabled? If not, partition might not be sorted.

My bad. Fixed this issue. Please help review it again. Thx very much.

wbo4958 · 2024-09-19T02:52:16Z

Hi @eordentlich @trivialfis, Could you help take a look at it. Thx very much.

eordentlich

This resolves the issue with the plugin and sorted partitions (and nice to see the test for this case too), but still wondering how that partition sort is computed by the spark-rapids plugin when enabled. Is done on the GPU?

Also, does this PR resolve the issue I reference in an earlier comment?

@trivialfis should take a look as well.

wbo4958 · 2024-09-19T13:46:34Z

HI @eordentlich,

I just tried the below case which has the same pattern with XGBoost

df.repartition(2).sortWithinPartitions("class").collect()

and got below Physical plans

== Physical Plan ==
AdaptiveSparkPlan (10)
+- == Final Plan ==
   GpuColumnarToRow (6)
   +- GpuSort (5)
      +- GpuShuffleCoalesce (4)
         +- ShuffleQueryStage (3), Statistics(sizeInBytes=6.4 KiB, rowCount=150)
            +- GpuColumnarExchange (2)
               +- GpuScan parquet  (1)
+- == Initial Plan ==
   Sort (9)
   +- Exchange (8)
      +- Scan parquet  (7)

XGBoost leverages ColumnarRdd to extract the CUDF table. ColumnarRdd is going to do some case match to filter out the RDDs that involves row-wised operations. So after converting,

we we get below corresponding RDDs which are coming from below GPU plans. So you can see the final cudf table was coming from GpuSort which will run on GPUs

   +- GpuSort (5)
      +- GpuShuffleCoalesce (4)
         +- ShuffleQueryStage (3), Statistics(sizeInBytes=6.4 KiB, rowCount=150)
            +- GpuColumnarExchange (2)
               +- GpuScan parquet  (1)

eordentlich

👍

trivialfis · 2024-09-19T19:18:56Z

Will look into this tomorrow.

trivialfis · 2024-09-20T21:17:15Z

Also, does this PR resolve the issue I reference in an earlier comment?

Thank you for raising that. I will be looking into it along with other LTR issues/feature requests after sorting out some of the work on external memory. I still think within-partition sort is sufficient for most of the use cases. The worst case is adding these qid-based partitioning, which might be as expensive as a global sort.

trivialfis

Looks good to me, assuming all tests can pass.

wbo4958 added 2 commits September 13, 2024 09:48

Support ranker

ece0b9b

test the group col which should be sorted in each partition

eeca573

eordentlich reviewed Sep 18, 2024

View reviewed changes

wbo4958 added 2 commits September 18, 2024 15:38

sort partition for gpu

1c8c3b9

Merge remote-tracking branch 'upstream/master' into ranker

da969b7

wbo4958 requested a review from eordentlich September 19, 2024 02:51

eordentlich reviewed Sep 19, 2024

View reviewed changes

eordentlich approved these changes Sep 19, 2024

View reviewed changes

Merge branch 'master' into ranker

b62bb8c

trivialfis approved these changes Sep 20, 2024

View reviewed changes

trivialfis merged commit 19b55b3 into dmlc:master Sep 21, 2024
30 checks passed

wbo4958 deleted the ranker branch September 21, 2024 23:01

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[jvm-packages] Support Ranker #10823

[jvm-packages] Support Ranker #10823

wbo4958 commented Sep 16, 2024

wbo4958 commented Sep 16, 2024

eordentlich commented Sep 17, 2024

eordentlich left a comment

eordentlich Sep 17, 2024

eordentlich Sep 18, 2024

wbo4958 Sep 18, 2024

wbo4958 commented Sep 19, 2024

eordentlich left a comment

wbo4958 commented Sep 19, 2024

eordentlich left a comment

trivialfis commented Sep 19, 2024

trivialfis commented Sep 20, 2024 •

edited

Loading

trivialfis left a comment

[jvm-packages] Support Ranker #10823

[jvm-packages] Support Ranker #10823

Conversation

wbo4958 commented Sep 16, 2024

wbo4958 commented Sep 16, 2024

eordentlich commented Sep 17, 2024

eordentlich left a comment

Choose a reason for hiding this comment

eordentlich Sep 17, 2024

Choose a reason for hiding this comment

eordentlich Sep 18, 2024

Choose a reason for hiding this comment

wbo4958 Sep 18, 2024

Choose a reason for hiding this comment

wbo4958 commented Sep 19, 2024

eordentlich left a comment

Choose a reason for hiding this comment

wbo4958 commented Sep 19, 2024

eordentlich left a comment

Choose a reason for hiding this comment

trivialfis commented Sep 19, 2024

trivialfis commented Sep 20, 2024 • edited Loading

trivialfis left a comment

Choose a reason for hiding this comment

trivialfis commented Sep 20, 2024 •

edited

Loading