Rdt

Latest version: v1.15.1

Safety actively analyzes 723607 Python packages for vulnerabilities to keep your Python projects secure.

Scan your dependencies

Page 5 of 10

1.4.2

This release fixes a bug that caused datetime and numerical transformers to crash if a column was all NaNs. Additionally, it adds support for Pandas 2.0!

Bugs

* Numerical & datetime transformers crash if the entire column is null - Issue [637](https://github.com/sdv-dev/RDT/issues/637) by fraces-h

Maintenance

* Remove upper bound for pandas - Issue [633](https://github.com/sdv-dev/RDT/issues/633) by pvk-developer

1.4.1

This release patches an issue that prevented the `RegexGenerator` from working with regexes that had a very large number of possible combinations.

Bugs

* RegexGenerator continues to have problems if there are too many possibilities - Issue [635](https://github.com/sdv-dev/RDT/issues/635) by pvk-developer

1.4.0

This release adds a couple of new features including adding the `OrderedLabelEncoder` and deprecating the `CustomLabelEncoder`. It also adds a change that makes all generator type transformers in the `HyperTransformer` use a different random seed.

Additionally, bugs were patched in the `RegexGenerator` that caused it to crash or take too long in certain cases. Finally, this release improved the detection of Faker functions in the `AnonymizedFaker`.

Bugs

* Find nested Faker provider submodules - PR [630](https://github.com/sdv-dev/RDT/pull/630) by frances-h
* RegexGenerator fails to generate values if there are too many possibilities - Issue [623](https://github.com/sdv-dev/RDT/issues/623) by R-Palazzo
* RegexGenerator takes too much time and runs out of memory if there are too many possibilities - Issue [624](https://github.com/sdv-dev/RDT/issues/624) by R-Palazzo

New Features

* Choose a different seed for each transformer - Issue [619](https://github.com/sdv-dev/RDT/issues/619) by fealho
* Rename CustomLabelEncoder to OrderedLabelEncoder - Issue [621](https://github.com/sdv-dev/RDT/issues/621) by R-Palazzo
* Add functionality to find version add-on - Issue [620](https://github.com/sdv-dev/RDT/issues/620) by frances-h

1.3.0

This release makes changes to the way that individual transformers are stored in the `HyperTransformer`. When accessing the config via `HyperTransformer.get_config()`, the transformers listed in the config are now the actual transformer instances used during fitting and transforming. These instances can now be accessed and used to examine their properties post fitting. For example, you can now view the mapping for a `PseudoAnonymizedFaker` instance using `PseudoAnonymizedFaker.get_mapping()` on the instance retrieved from the config.

Additionally, the output of `reverse_tranform` no longer appends the `.value` suffix to every unnamed output column. Only output columns that are created from context extracted from the input columns will have suffixes (eg. `.normalized` in the `ClusterBasedNormalizer`).

The `AnonymizedFaker` and `RegexGenerator` now have an `enforce_uniqueness` parameter, which controls whether the data returned by `reverse_transform` should be unique. The `HyperTransformer` now has a method called `create_anonymized_columns` that can be used to generate columns that are matched with anonymizing transformers like `AnonymizedFaker` and `RegexGenerator`. The method can be used as follows:
`HyperTransformer.create_anonymized_columns(num_rows=5, column_names=['email_optin', 'credit_card'])`

Another major change in this release is the ability to control randomization. Every time a `HyperTransformer` is initialized, its randomness will be reset to the same seed, and it will yield the same results for `reverse_transform` if given the same input. Every subsequent call to `reverse_transform` yields a different result. If a user desires to reset the seed, they can call `HyperTransformer.reset_randomization`.

Finally, this release adds support for Python 3.10 and drops support for 3.6.

Bugs

* The reset_randomization should also apply to fit and transform - Issue [608](https://github.com/sdv-dev/RDT/issues/608) by amontanez24
* Cannot print CustomLabelEncoder: ValueError - Issue [607](https://github.com/sdv-dev/RDT/issues/607) by amontanez24
* Float formatter learn_rounding_scheme doesn't work on all digits - Issue [556](https://github.com/sdv-dev/RDT/issues/556) by fealho
* Warnings not showing on update_transformers_by_sdtype - Issue [582](https://github.com/sdv-dev/RDT/issues/582) by amontanez24
* OneHotEncoder doesn't work with boolean sdtype - Issue [583](https://github.com/sdv-dev/RDT/issues/583) by pvk-developer
* Setting config on HyperTransformer does not read supported_sdtypes - Issue [560](https://github.com/sdv-dev/RDT/issues/560) by pvk-developer
* https://github.com/sdv-dev/RDT/issues/545 - Issue [#545](https://github.com/sdv-dev/RDT/issues/545) by pvk-developer
* Add error to NullTransformer when data only contains nans - PR [567](https://github.com/sdv-dev/RDT/pull/567) by fealho
* Update update_transformers validation - PR [563](https://github.com/sdv-dev/RDT/pull/563) by fealho

Maintenance

* Support Python 3.10 - Issue [593](https://github.com/sdv-dev/RDT/issues/593) by pvk-developer
* RDT 1.3 Package Maintenance Updates - Issue [594](https://github.com/sdv-dev/RDT/issues/594) by pvk-developer

New Features

* Update errors - Issue [599](https://github.com/sdv-dev/RDT/issues/599) by amontanez24
* Add ability to control randomness - Issue [584](https://github.com/sdv-dev/RDT/issues/584) by amontanez24
* Printing and error improvements - Issue [581](https://github.com/sdv-dev/RDT/issues/581) by amontanez24
* Make RegexGenerator not to reset itself - Issue [558](https://github.com/sdv-dev/RDT/issues/558) by pvk-developer
* Add a reset_anonymization method - Issue [559](https://github.com/sdv-dev/RDT/issues/559) by pvk-developer
* Don't copy instances of tranformer - Issue [541](https://github.com/sdv-dev/RDT/issues/541) by fealho
* Remove '.value' suffix - Issue [533](https://github.com/sdv-dev/RDT/issues/533) by fealho
* Change the NEXT_TRANSFORMERS logic - Issue [557](https://github.com/sdv-dev/RDT/issues/557) by fealho
* Add utility functions to AnonymizedFaker - Issue [561](https://github.com/sdv-dev/RDT/issues/561) by pvk-developer
* Update API for update_transformers_by_sdtype to be more explicit about instances vs. copies - Issue [540](https://github.com/sdv-dev/RDT/issues/540) by fealho
* Add create_anonymized_columns method to anonymize data from scratch - Issue [546](https://github.com/sdv-dev/RDT/issues/546) by pvk-developer
* Add parameter to AnonymizedFaker() and RegexGenerator() to generate only unique values - Issue [542](https://github.com/sdv-dev/RDT/issues/542) by pvk-developer

1.2.1

This release fixes a bug that caused the `UnixTimestampEncoder` to return data with the incorrect datetime format. It also fixes a bug that caused the null column
not to be reverse transformed when using the `UnixTimestampEncoder` when the `missing_value_replacement` was not set.

Bugs

* Inconsistency in date format after reverse transform - Issue [515](https://github.com/sdv-dev/RDT/issues/515) by pvk-developer
* Fix calling null_transformer with model_missing_values. - PR [550](https://github.com/sdv-dev/RDT/pull/550) by pvk-developer

1.2.0

This release adds a new transformer called the `PseudoAnonymizedFaker`. This transformer enables the pseudo-anonymization of your data by mapping all of a column's original values to fake values that get returned during the reverse transformation process. Each original value is always mapped to the same fake value.

Additionally, this release enables the `HyperTransformer` to use categorical transformers on boolean columns. It also introduces a new parameter called `computer_representation` to the `FloatFormatter` that will allow for values to be clipped to certain bounds based on the computer type used for a numerical column.

Finally, this release patches a bug that caused unpredicatable results from the `reverse_transform` method of the `FrequencyEncoder` when `add_noise` is enabled.

New Features

* Add PseudoAnonymizedFaker transformer - Issue [517](https://github.com/sdv-dev/RDT/issues/517) by pvk-developer
* Boolean columns should be able to use any of the categorical transformers - Issue[527](https://github.com/sdv-dev/RDT/issues/527) by pvk-developer
* Update FloatFormatter with parameters for the computer representation - Issue[521](https://github.com/sdv-dev/RDT/issues/521) by fealho

Bugs

* Unpredictable results for FrequencyEncoder(add_noise=True) - Issue [528](https://github.com/sdv-dev/RDT/issues/528) by fealho

Internal

* Performance Tests update - Issue [524](https://github.com/sdv-dev/RDT/issues/524) by pvk-developer

Page 5 of 10

© 2025 Safety CLI Cybersecurity Inc. All Rights Reserved.