Datasets

Latest version: v3.1.0

Safety actively analyzes 681775 Python packages for vulnerabilities to keep your Python projects secure.

Scan your dependencies

Page 1 of 6

4.9.7

Added

- New datasets.

Changed

- `CroissantBuilder`'s API to generate TFDS datasets from Croissant files.

Deprecated

Removed

Fixed

- Versions for existing datasets.

Security

4.9.6

Added

- Full support for Python 3.12.

4.9.5

Added

- Support to download and prepare datasets using the
[Parquet](https://parquet.apache.org) data format.
python
builder = tfds.builder('fashion_mnist', file_format='parquet')
builder.download_and_prepare()
ds = builder.as_dataset(split='train')
print(next(iter(ds)))


- [`tfds.data_source`](https://www.tensorflow.org/datasets/api_docs/python/tfds/data_source)
is pickable, thus working smoothly with
[PyGrain](https://github.com/google/grain). Learn more by following the
[tutorial](https://www.tensorflow.org/datasets/data_source).

- TFDS plays nicely with
[Croissant](https://mlcommons.org/working-groups/croissant). Learn more by
following the
[recipe](https://colab.research.google.com/github/mlcommons/croissant/blob/main/python/mlcroissant/recipes/tfds_croissant_builder.ipynb).

Changed

Deprecated

Removed

Fixed

Security

4.9.4

Added

- A new [CroissantBuilder](https://www.tensorflow.org/datasets/format_specific_dataset_builders#croissantbuilder)
which initializes a DatasetBuilder based on a [Croissant](https://github.com/mlcommons/croissant)
metadata file.
- New conversion options between different bounding boxes formats.
- Better support for `HuggingfaceDatasetBuilder`.
- A [script](https://github.com/tensorflow/datasets/blob/master/tensorflow_datasets/scripts/convert_format.py)
to convert a dataset from one format to another.

Changed

Deprecated

- Python 3.9 support. TFDS now uses Python 3.10

Removed

Fixed

Security

4.9.3

Added

- [Segment Anything](https://ai.facebook.com/datasets/segment-anything-downloads)
(SA-1B) dataset.

Changed

- Hugging Face datasets accept `None` values for any features. TFDS has no
`tfds.features.Optional`, so `None` values are converted to default values.
Those default values used to be `0` and `0.0` for int and float. Now, it's
`-inf` as defined by NumPy (e.g., `np.iinfo(np.int32).min` or
`np.finfo(np.float32).min`). This avoids ambiguous values when `0` and `0.0`
exist in the values of the dataset. The roadmap is to implement
`tfds.features.Optional`.

Deprecated

- Python 3.8 support. As per
[NEP 29](https://numpy.org/neps/nep-0029-deprecation_policy.html), TFDS now
uses Python>=3.9.

Removed

Fixed

Security

4.9.2

Added

- [Experimental] A list of freeform text tags can now be attached to a
`BuilderConfig`. For example:
py
BUILDER_CONFIGS = [
tfds.core.BuilderConfig(name="foo", tags=["foo", "live"]),
tfds.core.BuilderConfig(name="bar", tags=["bar", "old"]),
]

The tags are recorded with the dataset metadata and can later be retrieved
using the info object:
py
builder.info.config_tags ["foo", "live"]

This feature is experimental and there are no guidelines on tags format.

Changed

Deprecated

Removed

Fixed

- Fixed generated proto files (see issue [4858](https://github.com/tensorflow/datasets/issues/4858)).

Security

Page 1 of 6

© 2024 Safety CLI Cybersecurity Inc. All Rights Reserved.