Dataiter

Latest version: v0.99

Safety actively analyzes 688792 Python packages for vulnerabilities to keep your Python projects secure.

Scan your dependencies

Page 5 of 10

0.30

=========================

* Use keyword-only arguments where appropriate – the general principle
is that mandatory arguments are allowed as positional, but optional
modifiers are keyword only
* Rename all instances of "missing" to "na", such as `Vector.is_missing`
to `Vector.is_na`, the only exception being
`ListOfDicts.fill_missing`, which becomes
`ListOfDicts.fill_missing_keys`
* Truncate data frame object and string columns at
`PRINT_TRUNCATE_WIDTH` (default 32) for printing

0.29.2

===========================

* Fix aggregation functions to work with all main data types:
boolean, integer, float, date, datetime and string
* Fix aggregation functions to handle all missing values (NaN, NaT,
blank string) correctly, the same as implemented in Vector
* Rename aggregation functions' `dropna` arguments to `drop_missing`
* `first`, `last`, `nth`: Add `drop_missing` argument
* `Vector.drop_missing`: New method

0.29.1

===========================

* `mode`: Fix to return first in case of ties (requires Python >= 3.8)
* `std`, `var`: Add `ddof` argument (defaults to 0 on account of Numba limitations)
* Don't try to dropna for non-float vectors in aggregation functions

0.29

=========================

* Add shorthand helper functions for use with `DataFrame.aggregate`,
optionally using Numba JIT-compiled code for speed
- https://dataiter.readthedocs.io/en/latest/aggregation.html
- https://dataiter.readthedocs.io/en/latest/data-frame.html#dataiter.DataFrame.aggregate
- https://dataiter.readthedocs.io/en/latest/dataiter.html
* `DataFrame.map`: New method
* `ncol`: Removed
* `nrow`: Deprecated in favor of `dataiter.count`
* `read_csv`: New alias for `DataFrame.read_csv`
* `read_geojson`: New alias for `GeoJSON.read`
* `read_json`: New alias for `ListOfDicts.read_json`
* `read_npz`: New alias for `DataFrame.read_npz`

0.28

=========================

* `DataFrame`: Make object columns work in various operations
* `DataFrame.from_json`: Add arguments `columns` and `dtypes`
* `DataFrame.from_pandas`: Add argument `dtypes`
* `DataFrame.full_join`: Speed up
* `DataFrame.read_csv`: Add argument `dtypes`
* `DataFrame.read_json`: Add arguments `columns` and `dtypes`
* `GeoJSON.read`: Add arguments `columns` and `dtypes`
* `ListOfDicts.fill_missing`: New method
* `ListOfDicts.from_json`: Add arguments `keys` and `types`
* `ListOfDicts.full_join`: Speed up
* `ListOfDicts.read_csv`: Add argument `types`, rename `columns` to `keys`
* `ListOfDicts.read_json`: Add arguments `keys` and `types`

0.27

=========================

* `DataFrame`: Fix error message when column not found
* `DataFrame.aggregate`: Speed up
* `DataFrame.full_join`: Fix to join all possible columns
* `DataFrame.read_csv`: Try to avoid mixed types
* `ListOfDicts.full_join`: Fix to join all possible keys
* `ListOfDicts.write_csv`: Use minimal quoting
* `Vector.get_memory_use`: New method
* `Vector.rank`: Rewrite, add `method` argument
* `*.read_*`: Rename `fname` argument `path`
* `*.write_*`: Rename `fname` argument `path`
* Add comparison table dplyr vs. Dataiter vs. Pandas to documentation:
<https://dataiter.readthedocs.io/en/latest/comparison.html>

Page 5 of 10

© 2024 Safety CLI Cybersecurity Inc. All Rights Reserved.