Pybaselines

Latest version: v1.1.0

Safety actively analyzes 687767 Python packages for vulnerabilities to keep your Python projects secure.

Scan your dependencies

Page 1 of 2

1.1.0

==========================

This is a minor version with new features, deprecations,
and documentation improvements.

New Features
-------------

* Added two dimensional versions of various baseline correction algorithms,
with the focus on Whittaker-smoothing-based, spline, and polynomial methods.
These can be accessed using the new `Baseline2D` class.
* Added the `Baseline2D.individual_axes` method, which allows fitting each row and/or
column in two dimensional data with any one dimensional method in pybaselines.
* Added a version of the rubberband method to pybaselines.misc which allows fitting
individual segments within data to better fit concave-shaped data.
* Added the Customized Baseline Correction (custom_bc) method to
pybaselines.optimizers, which allows fitting baselines with controllable
levels of stiffness in different regions.
* Added a penalized version of mpls (pspline_mpls) to pybaselines.spline.
* Updated spline.mixture_model to use expectation-maximization rather than the previous
nieve approach of fitting the histogram of the residuals with the probability density
function. Should reduce calculation times.
* Added a function for penalized spline (P-spline) smoothing to pybaselines.utils,
`pybaselines.utils.pspline_smooth`, which will return a tuple of the smoothed input and
the knots, spline coefficients, and spline degree for any further use with
SciPy's BSpline.

Other Changes
-------------

* Officially list Python 3.12 as supported.
* Updated lowest supported Python version to 3.8
* Updated lowest supported dependency versions: NumPy 1.20, SciPy 1.5,
pentapy 1.1, and Numba 0.49
* Use SciPy's sparse arrays when the installed SciPy version is 1.12 or newer. This
only affects user codes if using functions from the pybaselines.utils module.
* Vendor SciPy's cwt and ricker functions, which were deprecated from SciPy in version 1.12.

Deprecations/Breaking Changes
-----------------------------

* Deprecated passing `num_bins` to spline.mixture_model. The keyword argument will
be removed in version 1.3.
* Removed the pybaselines.config module, which was simply used to set the pentapy solver.
The same behavior can be done by setting the `pentapy_solver` attribute of a `Baseline`
object after initialization.

Documentation/Examples
----------------------

* Added a section of the documentation explaining the extension of baseline correction for
two dimensional data.
* Added new examples for 2D baseline correction and for custom_bc.

1.0.0

==========================

This is a major version with new features, bug fixes, deprecations,
and documentation improvements.

New Features
------------

* Added a new class-based api for all algorithms, which can be accessed by using
the `pybaselines.api.Baseline` class. All algorithms are available as methods of
the `Baseline` class. The functional api from earlier versions is also maintained
for backwards compatibility.
* All functions now allow inputting an `x_data` keyword, even if it is not used within
the function, to allow for a more consistent api. Likewise, `pybaselines.misc.interp_pts`
added an unused `data` keyword. Now, all algorithms can be called with
the signature `baseline_algorithm(data=y_data, x_data=x_data, ...)`.
* Added a function for Whittaker smoothing to pybaselines.utils,
`pybaselines.utils.whittaker_smooth`.
* whittaker.iasls and spline.psline_iasls now allow inputting a `diff_order` parameter.

Bug Fixes
---------

* Fixed the addition of the penalty difference diagonals in spline.pspline_drpls, which
was incorrectly treating the penalty diagonals as lower banded rather than fully banded.

Other Changes
-------------

* Officially list Python 3.11 as supported.
* Added default `half_window` values for snip and noise_median.
* collab_pls accomodates `alpha` for aspls and pspline_aspls; the `alpha` parameter is
calculated for the entire dataset in the same way as the weights and is then fixed when
fitting each of the individual data entries.
* Improved input validation.
* Improved testing base classes to reduce copied code and improve test coverage.
* Improved code handling for banded systems and penalized splines to simplify internal code.

Deprecations/Breaking Changes
-----------------------------

* Removed the ability to pass addtional keyword arguments to algorithms in
pybaselines.optimizers, which was deprecated in version 0.8.0.
* Removed the deprecated pybaselines.window module, which was formally deprecated in version 0.8.
* Moved the `PENTAPY_SOLVER` constant from pybaselines.utils to the new pybaselines.config module.

Documentation/Examples
----------------------

* Added citation guidelines to make it easier to cite pybaselines.
* Added new examples showing how to use the new `Baseline` class.
* Added a new example examining the `beads` algorithm.

0.8.0

==========================

This is a minor version with new features, bug fixes, deprecations,
and documentation improvements.

New Features
------------

* Added more efficient ways for creating the spline basis, and now solve penalized
spline equations as a banded system rather than as a sparse system. Compared to
version 0.7.0, spline.mixture_model, spline.irsqr, and morphological.mpspline are
~60-90% faster when numba is installed and ~10-70% faster without numba.
* Made several calculations in spline.mixture_model more efficient, further reducing the
time by ~60-70% compared to the timings above without numba. The total time reduction
from version 0.7.0 for spline.mixture_model without numba is ~50-90%.
* Added penalized spline versions of all Whittaker-smoothing-based algorithms
(pspline_asls, pspline_iasls, pspline_airpls, pspline_arpls, pspline_drpls, pspline_iarpls,
pspline_aspls, pspline_psalsa, and pspline_derpsalsa) to pybaselines.spline.

Bug Fixes
---------

* Was not multiplying the penalty in whittaker.iasls by `lam_1`.
* The output weights for polynomial.quant_reg and polynomial.loess are now squared
before returning since the square root of the weights are used internally.
* The output weights and polynomial coefficients (if `return_coef` is True) for
polynomial.loess are now sorted to match the original order of the input x-values.
* The output weights for optimizers.optimize_extended_range are now truncated and
sorted before returning to match the original order and length of the input x-values.
* smooth.noise_median now works with a `smooth_half_window` value of 0 to give no smoothing.

Other Changes
-------------

* Officially list Python 3.10 as supported.
* pybaselines is now available to install using conda from the conda-forge channel.
* Changed a factor in the weighting for whittaker.aspls to better match the
implementation in literature.
* Allow inputting x-values for all penalized spline functions rather than assuming
evenly spaced measurements.
* optimizers.adaptive_minmax now allows separate `constrained_fraction` and
`constrained_weight` values for for the left and right edges.
* The error raised by optimizers.collab_pls if the input data is not 2-dimensional
is now more explicit.

Deprecations/Breaking Changes
-----------------------------

* No longer allow negative or array-like values for the penalty multipliers in
Whittaker-smoothing-based functions, penalized spline functions, morphological.jbcd,
or misc.beads. Array-like penalty values are technically valid; however, they change the
symmetry of the banded linear system, so additional code will have to be added in a
later version to ensure the setup is correct before re-allowing array-like values.
* Deprecated passing keyword arguments to all functions in pybaselines.optimizers.
Passing additional keyword arguments will raise an error starting in version 0.10.0
or 1.0.0, whichever comes first (the same deprecation for optimize_extended_range made
in version 0.7.0 is also pushed back to 0.10.0 or 1.0.0).
* For spline algorithms, the min and max x-values are now included as inner knots when
creating the spline basis rather than counting them as the first outer knots. To match
the number of knots from previous versions, the `num_knots` parameter should add 2 to
the `num_knots` used in previous versions.
* Formally deprecated pybaselines.window, which was replaced by pybaselines.smooth in
version 0.6.0. pybaselines.window will be removed in version 1.0.
* Removed optimize_window from pybaselines.morphological, which was deprecated in
version 0.6.0
* Removed the code for allowing array-like `half_window` or `smooth_half_window` values
for morphological.rolling_ball, which was deprecated in version 0.7.0.

Documentation/Examples
----------------------

* Added more examples to the documentation for fitting noisy data and exploring
penalized spline parameters.
* Added an introduction for the splines category in the algorithms section of the
documentation.

0.7.0

==========================

This is a minor version with new features, bug fixes, deprecations,
and documentation improvements.

Notice: beginning in version 0.8.0, a DeprecationWarning will be emitted
when using any function from the pybaselines.window module. Use the
pybaselines.smooth module instead.

New Features
------------

* Added the range independent algorithm (ria) to pybaselines.smooth, which extends
the left and/or right edges, similar to optimizers.optimize_extended_range, and
iteratively smooths until the area of the extended regions is recovered.
* Added the joint baseline correction and denoising algorithm (jbcd) to
pybaselines.morphological, which uses regularized least-squares fitting combined
with morphological operations to simultaneously obtain the baseline and denoised signal.
* Added the iterative polynomial smoothing algorithm (ipsa) to pybaselines.smooth, which
iteratively smooths the input data using a second-order Savitzky–Golay filter.
* Added the continuous wavelet transform baseline recognition algorithm (cwt_br) to
pybaselines.classification, which uses a continuous wavelet transform to classify
the baseline points and iterative polynomial fitting to create the baseline.
* Added the fully automatic baseline correction algorithm (fabc) to
pybaselines.classification, which is very similar to classification.dietrich, except
that it uses a continuous wavelet transform to estimate the derivative and fits the
baseline using Whittaker smoothing.
* Added a `min_length` parameter to most classification algorithms, which allows
discarding any values in the baseline mask where the number of consecutive points
designated as baseline is less than `min_length`, making the algorithms more robust.
* The `threshold` for polynomial.fastchrom can now be a Callable to allow the user to
define their own thresholding functions based on the rolling standard deviation
distribution.
* Allow optimizers.optimize_extended_range to use spline (mixture_model, irsqr)
and classification (dietrich, cwt_br, fabc) functions.
* Allow optimizers.collab_pls to use spline functions (mixture_model, irsqr).

Bug Fixes
---------

* Increased the minimum scipy version to 1.0 in order to use the BLAS function
gbmv (dot product of a banded matrix and vector) for misc.beads.
* Use stable sorting when sorting the x-values for polynomial.loess and
optimizers.optimize_extended_range to ensure that the sorting is correct.
* Fixed an issue when specifying `output` with scipy.ndimage.uniform_filter1d in scipy
versions before version 1.1.0.
* Fixed an issue using `dtype` with numpy.arange in a numba jit wrapped function, which
was not introduced until numba version 0.47.
* Fixed an indexing error in spline.corner_cutting which would give an erroneous index
at which the maximum area removal occurred.
* Fixed an issue that occurred when inputting weights into spline.mixture_model.
* If weights are input into optimizers.optimize_extended_range as keyword arguments,
the weights are now correctly sorted to match the sorting of the x-values and padded
to account for the added portions on the left and/or right edges before using in the
fitting function.
* Fixed the output of utils.padded_convolve when the kernel was even shaped (which
never happens in actual application in pybaselines) or larger than the data.
* Fixed an issue caused by using an `extrapolate_window` of 1 for utils.pad_edges,
or an `extrapolate_window` of 0 or 1 for utils._get_edges (called by
optimizers.optimize_extended_range).

Other Changes
-------------

* Use scipy's expit function for whittaker.arpls and aspls, which does not emit the
warning for exponential overflow. The warning was not needed since the overflow
ultimately makes weights of 0 for the two functions.
* Use np.gradient for the computed derivatives in derpsalsa and dietrich, which gives
slightly less noisy derivatives than the finite difference used by np.diff.
* Only sort x-values if they are given for polynomial.loess and
optimizers.optimize_extended_range, which saves a little time otherwise.
* Made whittaker.airpls error handling more robust in order to catch errors from the
solvers as well, which should catch any errors not prevented by checking the residual's
length.
* Allow the `mode` for utils.pad_edges to be a callable padding function,
matching numpy.pad's behavior.
* Added `tol_history` to the output parameters of classification.dietrich.
* Switched to using Scipy's convolve over Numpy's. Scipy's convolve can choose between
the direct convolution, which is always used by Numpy, or an FFT based convolution,
which is significantly faster for large arrays.
* Added testing for the minimum supported versions of all dependencies to
the project's continuous integration in order to ensure that the minimum
stated dependencies actually work.
* Allow specifying two separate extrapolate windows when padding using
utils.pad_edges to allow better flexibility for fitting the edges.

Deprecations/Breaking Changes
-----------------------------

* Deprecated allowing passing additional keyword arguments to optimizers.optimize_extended_range
since the `pad_kwargs` parameter is used by both the optimize_extended_range function
and the internal functions it supports. Now, all keyword arguments should be placed in
the `method_kwargs` dictionary. Passing additional keyword arguments will raise
an error starting in version 0.9.0.
* Deprecated allowing an array for the `half_window` or `smooth_half_window` parameters in
morphological.rolling_ball. While the array-based moving min/max functions were valid,
when combined for the morphological opening, the output would produce invalid results
where the opening values were greater than the input data, which should not be allowed by
the actual morphological opening. Using an array `half_window` will raise an error in
version 0.8.0.

Documentation/Examples
----------------------

* Added several new examples that explore different aspects of pybaselines.
* Use sphinx-gallery to display the example programs' code and outputs within
the documentation.

0.6.0

==========================

This is a minor version with new features, bug fixes, deprecations,
and documentation improvements.

New Features
------------

* Added goldindec to pybaselines.polynomial, which uses a non-quadratic cost
function with a shrinking threshold to fit the baseline.
* Added the morphological penalized spline (mpspline) algorithm to
pybaselines.morphological, which uses morphology to identify baseline points
and then fits the points using a penalized spline.
* Added the derivative peak-screening asymmetric least squares algorithm (derpsalsa)
to pybaselines.whittaker, which includes additional weights based on the first and
second derivatives of the data.
* Added the fastchrom algorithm to pybaselines.classification, which identifies baseline
points as where the rolling standard deviation is less than the specified threshold.
* Added the module pybaselines.spline, which contains algorithms that use splines
to create the baseline.
* Added the mixture model algorithm (mixture_model) to pybaselines.spline, which uses
a weighted penalized spline to fit the baseline, where weights are calculated based
on the probability each point belongs to the noise.
* Added iterative reweighted spline quantile regression (irsqr) to pybaselines.spline,
which uses penalized splines and iterative reweighted least squares to perform
quantile regression on the data.
* Added the corner-cutting algorithm (corner_cutting) to pybaselines.spline, which
iteratively removes corner points and then fits a quadratic Bezier spline with the
remaining points.

Bug Fixes
---------

* Fixed an issue with utils.pad_edges when `mode` was "extrapolate" and `extrapolate_window`
was 1.

Other Changes
-------------

* Increased the minimum SciPy version to 0.17 in order to use bounds with
scipy.optimize.curve_fit.
* Changed the default `extrapolate_window` value in pybaselines.utils.pad_edges to
the input window length, rather than ``2 * window length + 1``.
* Slightly sped up pybaselines.optimizers.adaptive_minmax when `poly_order` is
None by using the numpy array's min and max methods rather than the built-in
functions.

Deprecations/Breaking Changes
-----------------------------

* Renamed pybaselines.window to pybaselines.smooth to make its usage more
clear. Using pybaselines.window will still work for now, but will begin emitting
a DeprecationWarning in a later version (maybe version 0.8 or 0.9) and will
be removed shortly thereafter.
* Removed the constant utils.PERMC_SPEC that was deprecated in version 0.4.1.
* Deprecated the function pybaselines.morphological.optimize_window, which will
be removed in version 0.8.0. Use pybaselines.utils.optimize_window instead.

Documentation/Examples
----------------------

* Fixed the plot for morphological.mpls in the documentation.
* Fixed the weighting formula for whittaker.arpls in the documentation.
* Fixed a typo for the cost function in the docstring of misc.beads.
* Updated the example program for all of the newly added algorithms.

0.5.1

==========================

This is a minor patch with bug fixes and minor changes.

Bug Fixes
---------

* Added classification to the main pybaselines namespace so that calling
pybaselines.classification works correctly.

Other Changes
-------------

* Changed the default `tol` for pybaselines.polynomial.quant_reg to 1e-6
to get better results.
* Directly use the input `eps` value for pybaselines.polynomial.quant_reg
rather than its square.

Page 1 of 2

© 2024 Safety CLI Cybersecurity Inc. All Rights Reserved.