You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
We haven't supported polars yet. I'm hoping #10452 can help XGBoost manage the variety of dataframe inputs. We currently have cuDF, pandas, modin, pyarrow (no categorical support yet due to missing feature in pyarrow the last time we check), and datatable. The dispatching code along with the CI is getting out of hand now, especially with the many extensions that pandas has.
Some positive news regarding polars adoption: Kaggle has made a push in that direction. The API in some visible competitions accept both pandas and polars df as solution (see here - in that case the pandas version seems bugged, making polars the only viable solution).
From what I understand polars is not directly supported yet. What would be the best alternative to make it works ? (avoiding memory duplicates + categorical support)
XGBoost raises a
ValueError
when trying to train a model with a Polars dataframe with categorical data.Error:
Expected result: That it works like pandas
The text was updated successfully, but these errors were encountered: