Datasets

Latest version: v3.5.0

Safety actively analyzes 723158 Python packages for vulnerabilities to keep your Python projects secure.

Scan your dependencies

Page 2 of 6

4.9.2

Added

- [Experimental] A list of freeform text tags can now be attached to a
`BuilderConfig`. For example:
py
BUILDER_CONFIGS = [
tfds.core.BuilderConfig(name="foo", tags=["foo", "live"]),
tfds.core.BuilderConfig(name="bar", tags=["bar", "old"]),
]

The tags are recorded with the dataset metadata and can later be retrieved
using the info object:
py
builder.info.config_tags ["foo", "live"]

This feature is experimental and there are no guidelines on tags format.

Changed

Deprecated

Removed

Fixed

- Fixed generated proto files (see issue [4858](https://github.com/tensorflow/datasets/issues/4858)).

Security

4.9.1

Added

Changed

Deprecated

Removed

Fixed

- The installation on macOS now works (see issues
[4805](https://github.com/tensorflow/datasets/issues/4805) and
[4852](https://github.com/tensorflow/datasets/issues/4852)). The ArrayRecord
dependency is lazily loaded, so the
[TensorFlow-less path](https://www.tensorflow.org/datasets/tfless_tfds) is
not possible at the moment on macOS. A fix for this will follow soon.

Security

4.9.0

Added

- Native support for JAX and PyTorch. TensorFlow is no longer a dependency for
reading datasets. See the
[documentation](https://www.tensorflow.org/datasets/tfless_tfds).
- Added minival split to
[LVIS dataset](https://www.tensorflow.org/datasets/catalog/lvis).
- [Mixed-human](https://www.tensorflow.org/datasets/catalog/robomimic_mh) and
[machine-generated](https://www.tensorflow.org/datasets/catalog/robomimic_mg)
robomimic datasets.
- WebVid dataset.
- ImagenetPI dataset.
- [Wikipedia](https://www.tensorflow.org/datasets/catalog/wikipedia) for
20230201.

Changed

- Support for `tensorflow=2.12`.

Deprecated

Removed

Fixed

Security

4.8.3

Added

Changed

Deprecated

- Python 3.7 support: this version and future version use Python 3.8.

Removed

Fixed

- Flag `ignore_verifications` from Hugging Face's `datasets.load_dataset` is
deprecated, and used to cause errors in `tfds.load(huggingface:foo)`.

Security

4.8.2

Deprecated

- Python 3.7 support: this is the last version of TFDS supporting Python 3.7.
Future versions will use Python 3.8.

Fixed

- `tfds new` and `tfds build` better support the new recommended datasets
organization, where individual datasets have their own package under
`datasets/`, builder class is called `Builder` and is defined within module
`${dsname}_dataset_builder.py`.

Security

4.8.1

Changed

- Added file `valid_tags.txt` to not break builds.
- TFDS no longer relies on TensorFlow DTypes. We chose NumPy DTypes to keep the
typing expressiveness, while dropping the heavy dependency on TensorFlow. We
migrated all our internal datasets. Please, migrate accordingly:
- `tf.bool`: `np.bool_`
- `tf.string`: `np.str_`
- `tf.int64`, `tf.int32`, etc: `np.int64`, `np.int32`, etc
- `tf.float64`, `tf.float32`, etc: `np.float64`, `np.float32`, etc

Page 2 of 6

© 2025 Safety CLI Cybersecurity Inc. All Rights Reserved.