Skip to content

Version 0.14

Compare
Choose a tag to compare
@mfeurer mfeurer released this 05 Jul 06:40
· 25 commits to main since this release
2791074

IMPORTANT: This release paves the way towards a breaking update of OpenML-Python. From version 0.15, functions that had the option to return a pandas DataFrame will return a pandas DataFrame by default. This version (0.14) emits a warning if you still use the old access functionality.

More concretely:

  • In 0.15 we will drop the ability to return dictionaries in listing calls and only provide pandas DataFrames. To disable warnings in 0.14 you have to request a pandas DataFrame (using output_format="dataframe").
  • In 0.15 we will drop the ability to return datasets as numpy arrays and only provide pandas DataFrames. To disable warnings in 0.14 you have to request a pandas DataFrame (using dataset_format="dataframe").

Furthermore, from version 0.15, OpenML-Python will no longer download datasets and dataset metadata by default. This version (0.14) emits a warning if you don't explicitly specify the desired behavior.

Please see the pull requests #1258 and #1260 for further information.

  • ADD #1081: New flag that allows disabling downloading dataset features.
  • ADD #1132: New flag that forces a redownload of cached data.
  • FIX #1244: Fixes a rare bug where task listing could fail when the server returned invalid data.
  • DOC #1229: Fixes a comment string for the main example.
  • DOC #1241: Fixes a comment in an example.
  • MAINT #1124: Improve naming of helper functions that govern the cache directories.
  • MAINT #1223, #1250: Update tools used in pre-commit to the latest versions (black==23.30, mypy==1.3.0, flake8==6.0.0).
  • MAINT #1253: Update the citation request to the JMLR paper.
  • MAINT #1246: Add a warning that warns the user that checking for duplicate runs on the server cannot be done without an API key.