Version 0.14
IMPORTANT: This release paves the way towards a breaking update of OpenML-Python. From version 0.15, functions that had the option to return a pandas DataFrame will return a pandas DataFrame by default. This version (0.14) emits a warning if you still use the old access functionality.
More concretely:
- In 0.15 we will drop the ability to return dictionaries in listing calls and only provide pandas DataFrames. To disable warnings in 0.14 you have to request a pandas DataFrame (using
output_format="dataframe"
). - In 0.15 we will drop the ability to return datasets as numpy arrays and only provide pandas DataFrames. To disable warnings in 0.14 you have to request a pandas DataFrame (using
dataset_format="dataframe"
).
Furthermore, from version 0.15, OpenML-Python will no longer download datasets and dataset metadata by default. This version (0.14) emits a warning if you don't explicitly specify the desired behavior.
Please see the pull requests #1258 and #1260 for further information.
- ADD #1081: New flag that allows disabling downloading dataset features.
- ADD #1132: New flag that forces a redownload of cached data.
- FIX #1244: Fixes a rare bug where task listing could fail when the server returned invalid data.
- DOC #1229: Fixes a comment string for the main example.
- DOC #1241: Fixes a comment in an example.
- MAINT #1124: Improve naming of helper functions that govern the cache directories.
- MAINT #1223, #1250: Update tools used in pre-commit to the latest versions (
black==23.30
,mypy==1.3.0
,flake8==6.0.0
). - MAINT #1253: Update the citation request to the JMLR paper.
- MAINT #1246: Add a warning that warns the user that checking for duplicate runs on the server cannot be done without an API key.