Diive

Latest version: v0.86.0

Safety actively analyzes 723683 Python packages for vulnerabilities to keep your Python projects secure.

Scan your dependencies

Page 10 of 16

0.57.0

Changes

- Relaxed conditions a bit when inferring time resolution of time
series (`core.times.times.timestamp_infer_freq_progressively`, `core.times.times.timestamp_infer_freq_from_timedelta`)

Additions

- When reading parquet files, the TimestampSanitizer is applied by default to detect e.g. the time resolution
of the time series. Parquet files do not store info on time resolution like it is stored in pandas dataframes
(e.g. `30T` for 30MIN time resolution), even if the dataframe containing that info was saved to a parquet file.

Bugfixes

- Fixed bug where interactive time series plot did not show in Jupyter notebooks (`core.plotting.timeseries.TimeSeries`)
- Fixed bug where certain parts of the flux processing chain could not be used for the sensible heat flux `H`.
The issue was that `H` is calculated from sonic temperature (`T_SONIC` in EddyPro `_fluxnet_` output files),
which was not considered in function `pkgs.flux.common.detect_flux_basevar`.
- Fixed bug: interactive plotting in notebooks using `bokeh` did not work. The reason was that the `bokeh` plot
tools (controls) `ZoomInTool()` and `ZoomOutTool()` do not seem to work anymore. Both tools are now deactivated.

Notebooks

- Added new notebook for simple (interactive) time series plotting `notebooks/Plotting/TimeSeries.ipynb`
- Updated notebook `notebooks/FluxProcessingChain/FluxProcessingChain.ipynb` to version 3

0.55.0

This update focuses on the flux processing chain, in particular the creation of the extended
quality flags, the flux storage correction and the creation of the overall quality flag `QCF`.

New Features

- Added new class `StepwiseOutlierDetection` that can be used for general outlier detection in
time series data. It is based on the `StepwiseMeteoScreeningDb` class introduced in v0.50.0,
but aims to be more generally applicable to all sorts of time series data stored in
files (`pkgs.outlierdetection.stepwiseoutlierdetection.StepwiseOutlierDetection`)
- Added new outlier detection class that identifies outliers based on seasonal-trend decomposition
and z-score calculations (`pkgs.outlierdetection.seasonaltrend.OutlierSTLRZ`)
- Added new outlier detection class that flags values based on absolute limits that can be defined
separately for daytime and nighttime (`pkgs.outlierdetection.absolutelimits.AbsoluteLimitsDaytimeNighttime`)
- Added small functions to directly save (`core.io.files.save_as_parquet`) and
load (`core.io.files.load_parquet`) parquet files. Parquet files offer fast loading and saving in
combination with good compression. For more information about the Parquet format
see [here](https://parquet.apache.org/)

Additions

- **Angle-of-attack**: The angle-of-attack test can now be used during QC flag creation
(`pkgs.fluxprocessingchain.level2_qualityflags.FluxQualityFlagsLevel2.angle_of_attack_test`)
- Various smaller additions

Changes

- Renamed class `FluxQualityFlagsLevel2` to `FluxQualityFlagsLevel2EddyPro` because it is directly based
on the EddyPro output (`pkgs.fluxprocessingchain.level2_qualityflags.FluxQualityFlagsLevel2EddyPro`)
- Renamed class `FluxStorageCorrectionSinglePoint`
to `FluxStorageCorrectionSinglePointEddyPro` (
`pkgs.fluxprocessingchain.level31_storagecorrection.FluxStorageCorrectionSinglePointEddyPro`)
- Refactored creation of flux quality
flags (`pkgs.fluxprocessingchain.level2_qualityflags.FluxQualityFlagsLevel2EddyPro`)
- **Missing storage correction terms** are now gap-filled using random forest before the storage terms are
added to the flux. For some records, the calculated flux was available but the storage term was missing, resulting
in a missing storage-corrected flux (example: 97% of fluxes had storage term available, but for 3% it was missing).
The gap-filling makes sure that each flux values has a corresponding storage term and thus more values are
available for further processing. The gap-filling is done solely based on timestamp information, such as DOY
and hour. (`pkgs.fluxprocessingchain.level31_storagecorrection.FluxStorageCorrectionSinglePoint`)
- The **outlier detection using z-scores for daytime and nighttime data** uses latitude/longitude settings to
calculate daytime/nighttime via `pkgs.createvar.daynightflag.nighttime_flag_from_latlon`. Before z-score
calculation, the time resolution of the time series is now checked and assigned automatically.
(`pkgs.outlierdetection.zscore.zScoreDaytimeNighttime`)
- Removed `pkgs.fluxprocessingchain.level32_outlierremoval.FluxOutlierRemovalLevel32` since flux outlier
removal is now done in the generally applicable class `StepwiseOutlierDetection` (see new features)
- Various smaller changes and refactorings

Environment

- Updated `poetry` to newest version `v1.5.1`. The `lock` files have a new format since `v1.3.0`.
- Created new `lock` file for `poetry`.
- Added new package `pyarrow`.
- Added new package `pymannkendall` (see [GitHub](https://pypi.org/project/pymannkendall/)) to analyze
time series data for trends. Functions of this package are not yet implemented in `diive`.

Notebooks

- Added new notebook for loading and saving parquet files in `notebooks/Formats/LoadSaveParquetFile.ipynb`
- **Flux processing chain**: Added new notebook for flux post-processing
in `notebooks/FluxProcessingChain/FluxProcessingChain.ipynb`.

0.54.0

New Features

- Identify critical heat days for ecosytem flux NEE (net ecosystem exchange, based on air temperature and VPD
(`pkgs.flux.criticalheatdays.FluxCriticalHeatDaysP95`)
- Calculate z-aggregates in classes of x and y (`pkgs.analyses.quantilexyaggz.QuantileXYAggZ`)
- Plot heatmap from pivoted dataframe, using x,y,z values (`core.plotting.heatmap_xyz.HeatmapPivotXYZ`)
- Calculate stats for time series and store results in dataframe (`core.dfun.stats.sstats`)
- New helper function to load and merge files of a specific filetype (`core.io.files.loadfiles`)

Additions

- Added more parameters when formatting EddyPro _fluxnet_ file for FLUXNET
(`pkgs.formats.fluxnet.FormatEddyProFluxnetFileForUpload`)

Changes

- Removed left-over code
- Multiple smaller refactorings

Notebooks

- Added new notebook for calculating VPD in `notebooks/CalculateVariable/Calculate_VPD_from_TA_and_RH.ipynb`
- Added new notebook for calculating time series stats `notebooks/Stats/TimeSeriesStats.ipynb`
- Added new notebook for formatting EddyPro output for upload to
FLUXNET `notebooks/Formats/FormatEddyProFluxnetFileForUpload.ipynb`

0.53.3

Notebooks

- Added new notebooks for reading data files (ICOS BM files)
- Added additional output to other notebooks
- Added new notebook section `Workbench` for practical use cases

Additions

- New filetype `configs/filetypes/ICOS_H1R_CSVZIP_1MIN.yml`

0.53.2

Changes

- Added more output for detecting frequency from timeseries index (`core.times.times.DetectFrequency`)
- The associated functions have been updated accordingly: `core.times.times.timestamp_infer_freq_from_fullset`,
`core.times.times.timestamp_infer_freq_progressively`, `core.times.times.timestamp_infer_freq_from_timedelta`
- Added new notebook (`notebooks/TimeStamps/Detect_time_resolution.ipynb` )
- Added new unittest (`tests/test_timestamps.py`)

0.53.1

Changes

- **GapFinder** now gives by default sorted output, i.e. the output dataframe shows start and
end date for the largest gaps first (`pkgs.analyses.gapfinder.GapFinder`)

Notebooks

- Added new notebook for **finding gaps in time series** in `notebooks/Analyses/GapFinder.ipynb`
- Added new notebook for **time functions** in `notebooks/TimeFunctions/times.ipynb`

Other

- New repository branch `indev` is used as developement branch from now on
- Branch `main` will contain code from the most recent release

Page 10 of 16

© 2025 Safety CLI Cybersecurity Inc. All Rights Reserved.