Malpolon

Latest version: v2.1.2

Safety actively analyzes 706267 Python packages for vulnerabilities to keep your Python projects secure.

Scan your dependencies

Page 2 of 2

1.2.0

New features
- **Datasets**
- Added a new dataset `geolifeclef2024_pre_extracted` following 2024 edition of Kaggle challenge [GeoLifeCLEF](https://www.kaggle.com/competitions/geolifeclef-2024/data/)
- Computed rolling `mean` and rolling `std` values of GeoLifeCLEF2024 dataset for each modality. These values are stored in this dataset's transform functions

- **Models**
- Added a new model "MultimodalEnsemble" in `geolifeclef2024_multimodal_ensemble` based on picekl work on [GeoLifeCLEF2024](https://www.kaggle.com/code/picekl/sentinel-landsat-bioclim-baseline-0-31626)

- **Scripts**
- Added new scripts `split_obs_spatially.py`, `sort_files_glc_fashion.sh`
- `split_obs_spatially.py`: splits a CSV observation dataset into a _training_ and a _val_ subset where _val_ observation plots are spatially separated from _training_ ones. This scripts uses new **`verde`** package.
- `sort_files_glc_fashion.sh`:
> This script re-organizes files in one folder into folders and sub-folders in the same way as for the GeoLifeCLEF challenge.
> That is to say in the following manner.
>
> Each file is re-arranged in folders and sub-folders in the following way:
> A file named 'ABCDWXYZ.pt' located at 'root_path/' will be moved to
> 'root_path/YZ/WX/ABCDWXYZ.pt'.
>
> Each file name must be at least 3 characters long. For instance:
> A file named 'XYZ.pt' located at 'root_path/' will be moved to
> 'root_path/YZ/X/XYZ.pt'.
- `split_obs_per_species_frequency`: splits a CSV observation dataset into a _training_ and a _val_ subset based on species frequency
- Added `split_obs_spatially.py` and `split_obs_per_species_frequency.py` scripts to Malpolon as modules in `malpolon.data.utils`

Changes
- Renamed `scripts` folder to `toolbox`
- Renamed scenarios from {"Ecologists", "Inference", "Kaggle"} to {"Custom_train", "Inference", "Benchmarks"} and re-organized experiments
- Fixed examples-related bugs, file links, duplicate files and cleaned config files
- Updated code documentation, repository READMEs and examples tutorial files

1.1.0

New features
- New dataset `ConcatPatchRasterDataset` to handle both satellite image patches and geolocalized rasters in the same model
- Added example using this new dataset
- Added standalone scripts
- `crop_rasters.py`: This script crops a window from raster files based on coordinates and outputs it as a new file.
- `split_obs_per_species_frequency.py`: This script splits an obs csv in val/train based on the frequency of occurrences in the whole dataset. It does NOT perform a spatial split.
- `split_obs_spatially.py`: This script splits an obs csv in val/train based on the observations' geographic locations using the Verde package.
- `sort_files_glc_fashion.sh`:
This script re-organizes files in one folder into folders and sub-folders in the same way as for the GeoLifeCLEF challenge. That is to say in the following manner.\
Each file is re-arranged in folders and sub-folders in the following way:
A file named 'ABCDWXYZ.pt' located at 'root_path/' will be moved to 'root_path/YZ/WX/ABCDWXYZ.pt'.\
Each file name must be at least 3 characters long. For instance:
A file named 'XYZ.pt' located at 'root_path/' will be moved to 'root_path/YZ/X/XYZ.pt'.
- Added CIFAR-10 example


Changes
- Harmonized datasets class arguments and kwargs
- Reduced examples config files values redundancy by using variable interpolation
- Changed metric logging parameters for tensorboard logger to include more details
- Fixed multilabel inference export for test_dataset

1.0.3

First release of Malpolon's framework.

Try it out now !
https://pypi.org/project/malpolon/

(Versions 1.0.0 to 1.0.2 do not exist)

Page 2 of 2

© 2025 Safety CLI Cybersecurity Inc. All Rights Reserved.