Nannyml

Latest version: v0.12.1

Safety actively analyzes 683322 Python packages for vulnerabilities to keep your Python projects secure.

Scan your dependencies

Page 2 of 7

0.10.4

Changed

- We've changed the defaults for the `incomplete` parameter in the `SizeBasedChunker` and `CountBasedChunker`
to `keep` from the previous `append`. This means that from now on, by default, you might have an additional
"incomplete" final chunk. Previously these records would have been appended to the last "complete" chunk.
This change was required for some internal developments, and we also felt it made more sense when looking at
continuous monitoring (as the incomplete chunk will be filled up later as more data is appended). [(367)](https://github.com/NannyML/nannyml/issues/367)
- We've renamed the *Classifier for Drift Detection (CDD)* to the more appropriate *Domain Classifier*. [(368)](https://github.com/NannyML/nannyml/issues/368)
- Bumped the version of the `pyarrow` dependency to `^14.0.0` if you're running on Python 3.8 or up.
Congrats on your first contribution here [amrit110](https://github.com/amrit110), much appreciated!

Fixed

- Continuous distribution plots will now be scaled per chunk, as opposed to globally. [(369)](https://github.com/NannyML/nannyml/issues/369)

0.10.3

Fixed

- Handle median summary stat calculation failing due to NaN values
- Fix standard deviation summary stat sampling error calculation occasionally returning infinity [(363)](https://github.com/NannyML/nannyml/issues/363)
- Fix plotting confidence bands when value gaps occur [(364)](https://github.com/NannyML/nannyml/issues/364)

Added

- New multivariate drift detection method using a classifier and density ration estimation.

0.10.2

Changed

- Removed p-value based thresholds for Chi2 univariate drift detection [(349)](https://github.com/NannyML/nannyml/issues/349)
- Change default thresholds for univariate drift methods to standard deviation based thresholds.
- Add summary stats support to the Runner and CLI [(353)](https://github.com/NannyML/nannyml/issues/353)
- Add unique identifier columns to included datasets for better joining [(348)](https://github.com/NannyML/nannyml/issues/348)
- Remove unused `confidence_deviation` properties in CBPE metrics [(357)](https://github.com/NannyML/nannyml/issues/357)
- Improved error handling: failing metric calculation for a single chunk will no longer stop an entire calculator.

Added

- Add feature distribution calculators [(352)](https://github.com/NannyML/nannyml/issues/352)

Fixed

- Fix join column settings for CLI [(356)](https://github.com/NannyML/nannyml/issues/356)
- Fix crashes in `UnseenValuesCalculator`

0.10.1

- Various small fixes to the docs, thanks once again ghostwriter [NeoKish](https://github.com/NeoKish)! [(#345)](https://github.com/NannyML/nannyml/issues/345)
- Fixed an issue with estimated accuracy for multiclass classification in CBPE. [(346)](https://github.com/NannyML/nannyml/issues/346)

0.10.0

Changed

- Telemetry now detects AKS and EKS and NannyML Cloud runtimes. [(325)](https://github.com/NannyML/nannyml/issues/325)
- Runner was refactored, so it can be extended with premium NannyML calculators and estimators. [(325)](https://github.com/NannyML/nannyml/issues/325)
- Sped up telemetry reporting to ensure it doesn't hinder performance.
- Some love for the docs as [santiviquez](https://github.com/santiviquez) tediously standardized variable names. [(#338)](https://github.com/NannyML/nannyml/issues/338)
- Optimize calculations for L-infinity method. [[(340)](https://github.com/NannyML/nannyml/issues/340)]
- Refactored the `CalibratorFactory` to align with our other factory implementations. [[(341)](https://github.com/NannyML/nannyml/issues/341)]
- Updated the `Calibrator` interface with `*args` and `**kwargs` for easier extension.
- Small refactor to the `ResultComparisonMixin` to allow easier extension.

Added

- Added support for directly estimating the confusion matrix of multiclass classification models using CBPE.
Big thanks to our appreciated alumnus [cartgr](https://github.com/cartgr) for the effort (and sorry it took soooo long). [(#287)](https://github.com/NannyML/nannyml/issues/287)
- Added `DatabaseWriter` support for results from `MissingValuesCaclulator` and `UnseenValuesCalculator`. Some
excellent work by [bgalvao](https://github.com/bgalvao), thanks for being a long-time user and supporter!


Fixed

- Fix issues with calculation and filtering in performance calculation and estimation. [(321)](https://github.com/NannyML/nannyml/issues/321)
- Fix multivariate reconstruction error plot labels. [(323)](https://github.com/NannyML/nannyml/issues/323)
- Log a warning when performance metrics for a chunk will return `NaN` value. [(326)](https://github.com/NannyML/nannyml/issues/326)
- Fix issues with ReadTheDocs build failing
- Fix erroneous `specificity` calculation, both realized and estimated. Well spotted [nikml](https://github.com/nikml)! [(#334)](https://github.com/NannyML/nannyml/issues/334)
- Fix threshold computation when dealing with `NaN` values. Major thanks to the eagle-eyed [giodavoli](https://github.com/giodavoli). [(#333)](https://github.com/NannyML/nannyml/issues/333)
- Fix exports for confusion matrix metrics using the `DatabaseWriter`. An inspiring commit that lead to some other changes.
Great job [shezadkhan137](https://github.com/shezadkhan137)! [(#335)](https://github.com/NannyML/nannyml/issues/335)
- Fix incorrect normalization for the business value metric in realized and estimated performance. [(337)](https://github.com/NannyML/nannyml/issues/337)
- Fix handling `NaN` values when fitting univariate drift. [[(340)](https://github.com/NannyML/nannyml/issues/340)]

0.9.1

Changed

- Updated Mendable client library version to deal with styling overrides in the RTD documentation theme
- Removed superfluous limits for confidence bands in the CBPE class (these are present in the metric classes instead)
- Threshold value limiting behaviour (e.g. overriding a value and emitting a warning) will be triggered not only when
the value crosses the threshold but also when it is equal to the threshold value. This is because we interpret the
threshold as a theoretical maximum.

Added

- Added a new example notebook walking through a full use case using the NYC Green Taxi dataset, based on the blog of [santiviquez](https://github.com/santiviquez)

Fixed

- Fixed broken Docker container build due to changes in public Poetry installation procedure
- Fixed broken image source link in the README, thanks [NeoKish](https://github.com/NeoKish)!

Page 2 of 7

© 2024 Safety CLI Cybersecurity Inc. All Rights Reserved.