Breaking Changes:
1. Now there are two types of random column generation methods: 1. With a column reference. These methods are renamed from sample_xxx to rand_xxx. They behave the same as the old way and must be called with a reference column. New methods generate random columns without any reference, therefore new methods won't respect null and won't use reference's statistics. But new methods are easier to use. Using new methods to get a random df.
python
import polars as pl
import polars_ds as pds
df = pds.random_data(size=100_000, n_cols = 1).select(
pds.random(0.0, 12.0).alias("uniform_1"),
pds.random(0.0, 1.0).alias("uniform_2"),
pds.random_exp(0.5).alias("exp"),
pds.random_normal(0.0, 1.0).alias("normal"),
pds.random_normal(0.0, 1000.0).alias("fat_normal"),
)
df.head()
What's Changed
* Knn entropy by abstractqqq in https://github.com/abstractqqq/polars_ds_extension/pull/104
* diagonsis basics by abstractqqq in https://github.com/abstractqqq/polars_ds_extension/pull/105
* Better stats by abstractqqq in https://github.com/abstractqqq/polars_ds_extension/pull/106
* Add profile by abstractqqq in https://github.com/abstractqqq/polars_ds_extension/pull/107
* added exclude in dependency plots by abstractqqq in https://github.com/abstractqqq/polars_ds_extension/pull/108
* add Transfer Entropy and related measures by remiadon in https://github.com/abstractqqq/polars_ds_extension/pull/84
New Contributors
* remiadon made their first contribution in https://github.com/abstractqqq/polars_ds_extension/pull/84
**Full Changelog**: https://github.com/abstractqqq/polars_ds_extension/compare/v0.3.4-fix-release...v0.3.5
v0.3.4-fix-release
Highlights in v0.3.4 (Re-upload to PyPI)
import polars_ds as pds
You can now access the following queries by calling them directly from pds (e.g. pds.query_lstsq, etc.), and your Linter will recognize them!
1. All knn related queries
2. All lstsq (linear regression) related queries
3. All graph related queries
4. Most common metrics
5. Miscallenous, which are self-explanatory by the names
More will be added if appropriate. For most occasions, in pds.func() arguments, you can use either str or pl.Expr. For example:
python
df.select(
pds.query_lstsq_report(
pl.col("x1"), "x2", str | pl.Expr
target = "y",
add_bias=False
).alias("report")
).unnest("report")
shape: (2, 5)
┌──────────┬───────┬────────────┬────────────┬───────┐
│ feat_idx ┆ coeff ┆ std_err ┆ t ┆ p>|t| │
│ --- ┆ --- ┆ --- ┆ --- ┆ --- │
│ u16 ┆ f64 ┆ f64 ┆ f64 ┆ f64 │
╞══════════╪═══════╪════════════╪════════════╪═══════╡
│ 0 ┆ 2.0 ┆ 2.3854e-16 ┆ 8.3842e15 ┆ 0.0 │
│ 1 ┆ -1.0 ┆ 9.0158e-17 ┆ -1.1092e16 ┆ 0.0 │
└──────────┴───────┴────────────┴────────────┴───────┘
See more in [examples/basics.ipynb](https://github.com/abstractqqq/polars_ds_extension/blob/v0.3.4/examples/basics.ipynb)
BREAKING
* Graph queries are refactored to take in two columns of u32 as node and link. If node = 10, and link = 2, it means node 10 has an out-edge to node 2.
* Eigenvector Centrality is temporarily gone.
* KNN's default index is now of type u32 instead of u64
What's Changed
* use uv in building env by abstractqqq in https://github.com/abstractqqq/polars_ds_extension/pull/87
* update readme by abstractqqq in https://github.com/abstractqqq/polars_ds_extension/pull/88
* Update polars by abstractqqq in https://github.com/abstractqqq/polars_ds_extension/pull/89
* added power transform without estimating lambda by abstractqqq in https://github.com/abstractqqq/polars_ds_extension/pull/91
* added tversky index by abstractqqq in https://github.com/abstractqqq/polars_ds_extension/pull/92
* Better ux - part 1 by abstractqqq in https://github.com/abstractqqq/polars_ds_extension/pull/95
* added skip_null in lstsq by abstractqqq in https://github.com/abstractqqq/polars_ds_extension/pull/96
* fix lstsq by abstractqqq in https://github.com/abstractqqq/polars_ds_extension/pull/98
* faster ols by abstractqqq in https://github.com/abstractqqq/polars_ds_extension/pull/100
Special Thanks
* Special thanks to wukan1986 for pushing me to make changes to make lstsq better.
**Full Changelog**: https://github.com/abstractqqq/polars_ds_extension/compare/v0.3.3...v0.3.4