Eda-report

Latest version: v2.8.2

Safety actively analyzes 701298 Python packages for vulnerabilities to keep your Python projects secure.

Page 2 of 5

2.7.0

What's New

- Add the `set_custom_palette`, `box_plot`, `kde_plot`, `probability_plot`, `bar_plot`, `regression_plot` and `plot_correlation` functions (See [Plotting Examples][plotting]).

- Rename `target_variable` to `groupby_data`:
- Select a more intuitive name. Target_variable is ambiguous.
- Add the groupby [-g, --groupby] cli arg.

- Update document layout:
- Center-align images and tables.
- Reduce unnecessary page-breaks.

- Replace correlation heatmap with a bar chart:

Show coloured & labeled bars of the top 20 correlated numeric variable pairs (by magnitude). Makes it much easier to notice highly correlated variables.

- Limit bivariate summaries & regression plots to 50.

Necessary since combinations blow up quickly. 50 numeric columns could easily result in a 500 page report, taking ages to prepare (`combination(50_numeric_cols, 2) == 1225` pairs, and 1 page == 2 pairs). Now only the top 50 pairs will be published (approx 25 pages).

- Configure color in each subprocess:

Update helper functions to accept color choice, and set custom palette. Spawned subprocesses (Windows & Mac currently) weren't getting the globally modified colors.

- Reduce graph image dpi from 250 to 150:

Results in smaller, but very decent images. Significantly reduces the size of report documents with many variables.

- Revise correlation interpretation.

Use R.H. Evans (1966) guide:

.00-.19 -> very weak
.20-.39 -> weak
.40-.59 -> moderate
.60-.79 -> strong
.80-1.0 -> very strong

- Fix handling of int values for `groupby` specifier:
- Int input from the cli and gui is parsed as a string, and failed the `isinstance(x, int)` test.
- The `str.isdecimal` test is more suitable here.

- Optimize tests:
- Add conftest.py.
- Add a session-level temp_data_dir fixture.

[plotting]: https://eda-report.readthedocs.io/en/latest/eda_report.plotting.html#plotting-examples

**Full Changelog**: https://github.com/Tim-Abwao/eda-report/compare/v2.6.0...v2.7.0

2.6.0

What's Changed

* Use `StringDtype` for text & mixed data. Fixes 16 (in 18)

`object` dtype causes errors when sorting mixed numeric+text data.
* Drop completely empty columns. Fixes 17

Update data validation to drop columns full of `NaN`.
Empty columns caused a fatal error by breaking analysis functions.
* Separate build & dev requirements

Add separate requirement files for docs & testing.
Update dependencies

**Full Changelog**: https://github.com/Tim-Abwao/eda-report/compare/v2.5.1...v2.6.0

2.5.1

Update the GUI

* Resize the gui to 560x320, and adjust components accordingly.
* Make the intro brief and to the point.
* Update canvas background image.

Update docs

* Add screencast of iris data report to docs.

Improve number formating.

* Round off numeric summary statistics to 4dp.
* Don't display insignificant zeros i.e show 1.0000 as 1, and 1.230 as 1.23.

Update error messages

* Display a helpful message if the tkinter module is missing (ModuleNotFoundError).
* Show expected input in file import error message.

Miscellaneous changes

* Add `[run] sigterm = true` to .coveragerc, new in coverage version 6.4.
* Rename scatter-plot to regression-plot.

*Full Changelog*: https://github.com/Tim-Abwao/eda-report/compare/v2.5.0...v2.5.1

2.5.0

What's Changed
* Run tasks in parallel using multiprocessing in https://github.com/Tim-Abwao/eda-report/pull/13.
* Rename `PlotVariable` to `UnivariatePlots`.
* Rename `PlotMultivariable` to `BivariatePlots`.
* Explicitly specify color palette in plotting functions.
* Add the content module.
* Add `AnalysisResult`.
* Move `ReportContent` from document.py to content.py.
* Customize progress bars: Describe what is happening, and make the bars line up.
* Add the `analyze_univariate` function: A convenient way to get summary statistics for one-dimensional input
* Rename [var_type]Variable to [var_type]Stats.
* Merge `BaseVariable` into `Variable`.
* Update dependencies in response to CVE-2022-24758.

**Full Changelog**: https://github.com/Tim-Abwao/eda-report/compare/v2.4.1...v2.5.0

2.4.1

Highlights

- Add the Shapiro-Wilk test for normality.

**Full Changelog**: https://github.com/Tim-Abwao/eda-report/compare/v2.4.0...v2.4.1

2.4.0

Highlights
- Add tests for normality.
- Miscellaneous improvements to report appearance.

**Full Changelog**: https://github.com/Tim-Abwao/eda-report/compare/v2.3.1...v2.4.0

Page 2 of 5

Releases

Has known vulnerabilities

Previous Next

Eda-report

Page 2 of 5

2.7.0

2.6.0

2.5.1

2.5.0

2.4.1

2.4.0

Page 2 of 5

Links

Releases