Openml

Latest version: v0.15.1

Safety actively analyzes 701475 Python packages for vulnerabilities to keep your Python projects secure.

Scan your dependencies

Page 1 of 3

2.0.20241110

0.15.1

Will clean up release notes later, highlights:

* Fix usage of environment variables for locating the default cache and configuration directories by eddiebergman in https://github.com/openml/openml-python/pull/1359
* Allow skip trying to download parquet files by setting the `OPENML_SKIP_PARQUET` variable to `true` by PGijsbers in https://github.com/openml/openml-python/pull/1388
* a lot of maintenance work by eddiebergman and LennartPurucker

Thanks to everyone who contributed in any way ❤️

What's Changed
* [pre-commit.ci] pre-commit autoupdate by pre-commit-ci in https://github.com/openml/openml-python/pull/1329
* Bump codecov/codecov-action from 3 to 4 by dependabot in https://github.com/openml/openml-python/pull/1328
* Disable docker release on PR by LennartPurucker in https://github.com/openml/openml-python/pull/1360
* fix(datasets): Add code `111` for dataset description not found error by eddiebergman in https://github.com/openml/openml-python/pull/1356
* Test Fixes for v0.15.1 by LennartPurucker in https://github.com/openml/openml-python/pull/1358
* fix: Avoid Random State and Other Test Bug by LennartPurucker in https://github.com/openml/openml-python/pull/1362
* fix/maint: Make Docs Work Again and Stop Progress.rst Usage by LennartPurucker in https://github.com/openml/openml-python/pull/1365
* doc: README Rework by LennartPurucker in https://github.com/openml/openml-python/pull/1361
* doc: make all examples use names instead of IDs as reference. by LennartPurucker in https://github.com/openml/openml-python/pull/1367
* fix: avoid stripping whitespaces for feature names by LennartPurucker in https://github.com/openml/openml-python/pull/1368
* fix: workaround for git test workflow for Python 3.8 by LennartPurucker in https://github.com/openml/openml-python/pull/1369
* add: test for dataset comparison and ignore fields by LennartPurucker in https://github.com/openml/openml-python/pull/1370
* fix: github workflows and pytest issue by LennartPurucker in https://github.com/openml/openml-python/pull/1373
* feat: support for loose init model from run by LennartPurucker in https://github.com/openml/openml-python/pull/1371
* fix/maint: avoid exit code (which kills the docs building) by LennartPurucker in https://github.com/openml/openml-python/pull/1374
* ux: Provide helpful link to documentation when error due to missing API token by eddiebergman in https://github.com/openml/openml-python/pull/1364
* ci: Docker/build-push-action from 5 to 6 by dependabot in https://github.com/openml/openml-python/pull/1357
* ci: Bumb peter-evans/dockerhub-description from 3 to 4 by dependabot in https://github.com/openml/openml-python/pull/1326
* fix: resolve Sphinx style error by LennartPurucker in https://github.com/openml/openml-python/pull/1375
* docs: fix borken links after openml.org rework by LennartPurucker in https://github.com/openml/openml-python/pull/1376
* [pre-commit.ci] pre-commit autoupdate by pre-commit-ci in https://github.com/openml/openml-python/pull/1380
* [pre-commit.ci] pre-commit autoupdate by pre-commit-ci in https://github.com/openml/openml-python/pull/1381
* Mark test as production by PGijsbers in https://github.com/openml/openml-python/pull/1384
* Patch release bump by PGijsbers in https://github.com/openml/openml-python/pull/1389


**Full Changelog**: https://github.com/openml/openml-python/compare/v0.15.0...v0.15.1

0.15.0

What's Changed

* ADD 1335: Improve MinIO support.
* Add progress bar for downloading MinIO files. Enable it with setting `show_progress` to true on either `openml.config` or the configuration file.
* When using `download_all_files`, files are only downloaded if they do not yet exist in the cache.
* FIX 1338: Read the configuration file without overwriting it.
* MAINT 1340: Add Numpy 2.0 support. Update tests to work with scikit-learn <= 1.5.
* ADD 1342: Add HTTP header to requests to indicate they are from openml-python.
* ADD 1345: `task.get_dataset` now takes the same parameters as `openml.datasets.get_dataset` to allow fine-grained control over file downloads.
* MAINT 1346: The ARFF file of a dataset is now only downloaded if parquet is not available.
* MAINT 1349: Removed usage of the `disutils` module, which allows for Py3.12 compatibility.
* MAINT 1351: Image archives are now automatically deleted after they have been downloaded and extracted.
* MAINT 1352, 1354: When fetching tasks and datasets, file download parameters now default to not downloading the file.
Files will be downloaded only when a user tries to access properties which require them (e.g., `dataset.qualities` or `dataset.get_data`).

New Contributors
* BrunoBelucci made their first contribution in https://github.com/openml/openml-python/pull/1338
* knyazer made their first contribution in https://github.com/openml/openml-python/pull/1345

**Full Changelog**: https://github.com/openml/openml-python/compare/v0.14.2...v0.15.0

0.14.2

This is a minor release to support several hotfixes and technical debt.

* MAINT 1280: Use the server-provided ``parquet_url`` instead of ``minio_url`` to determine the location of the parquet file.
* ADD 716: add documentation for remaining attributes of classes and functions.
* ADD 1261: more annotations for type hints.
* MAINT 1294: update tests to new tag specification.
* FIX 1314: Update fetching a bucket from MinIO.
* FIX 1315: Make class label retrieval more lenient.
* ADD 1316: add feature descriptions ontologies support.
* MAINT 1310/1307: switch to ruff and resolve all mypy errors.

0.14.0

**IMPORTANT:** This release paves the way towards a breaking update of OpenML-Python. From version 0.15, functions that had the option to return a pandas DataFrame will return a pandas DataFrame by default. This version (0.14) emits a warning if you still use the old access functionality.

More concretely:

* In 0.15 we will drop the ability to return dictionaries in listing calls and only provide pandas DataFrames. To disable warnings in 0.14 you have to request a pandas DataFrame (using `output_format="dataframe"`).
* In 0.15 we will drop the ability to return datasets as numpy arrays and only provide pandas DataFrames. To disable warnings in 0.14 you have to request a pandas DataFrame (using `dataset_format="dataframe"`).

Furthermore, from version 0.15, OpenML-Python will no longer download datasets and dataset metadata by default. This version (0.14) emits a warning if you don't explicitly specify the desired behavior.

Please see the pull requests 1258 and 1260 for further information.

* ADD 1081: New flag that allows disabling downloading dataset features.
* ADD 1132: New flag that forces a redownload of cached data.
* FIX 1244: Fixes a rare bug where task listing could fail when the server returned invalid data.
* DOC 1229: Fixes a comment string for the main example.
* DOC 1241: Fixes a comment in an example.
* MAINT 1124: Improve naming of helper functions that govern the cache directories.
* MAINT 1223, 1250: Update tools used in pre-commit to the latest versions (``black==23.30``, ``mypy==1.3.0``, ``flake8==6.0.0``).
* MAINT 1253: Update the citation request to the JMLR paper.
* MAINT 1246: Add a warning that warns the user that checking for duplicate runs on the server cannot be done without an API key.

0.13.1

* ADD 1028: Add functions to delete runs, flows, datasets, and tasks (e.g., openml.datasets.delete_dataset).
* ADD 1144: Add locally computed results to the OpenMLRun object’s representation if the run was created locally and not downloaded from the server.
* ADD 1180: Improve the error message when the checksum of a downloaded dataset does not match the checksum provided by the API.
* ADD 1201: Make OpenMLTraceIteration a dataclass.
* DOC 1069: Add argument documentation for the OpenMLRun class.
* FIX 1197 559 1131: Fix the order of ground truth and predictions in the OpenMLRun object and in format_prediction.
* FIX 1198: Support numpy 1.24 and higher.
* FIX 1216: Allow unknown task types on the server. This is only relevant when new task types are added to the test server.
* MAINT 1155: Add dependabot github action to automatically update other github actions.
* MAINT 1199: Obtain pre-commit’s flake8 from github.com instead of gitlab.com.
* MAINT 1215: Support latest numpy version.
* MAINT 1218: Test Python3.6 on Ubuntu 20.04 instead of the latest Ubuntu (which is 22.04).
* MAINT 1221 1212 1206 1211: Update github actions to the latest versions.

Page 1 of 3

© 2025 Safety CLI Cybersecurity Inc. All Rights Reserved.