Dataprofiler

Latest version: v0.13.3

Safety actively analyzes 723177 Python packages for vulnerabilities to keep your Python projects secure.

Scan your dependencies

Page 5 of 10

0.8.3

Profiler
* Fix req missing for typing_extensions 698
* Add profiler option for column level invalid values 704
* Updated setup.cfg mypy flags and resolved related errors. 703

Documentation
* Add Makefile to auto setup repo for developers 699
* Add PSI documentation in README.md 709

Bugs
* Fix bug with null replication metrics 702
* PSI diff() 708
* PSI diff() bug 707
* Fix bug with null replication metrics when row is all null 706
* PSI no calculation on `TextProfile` 711
* PSI fixing unit tests 712


Other Changes
* Includes mypy in pre-commit and fixes last needed updates 696
* Quick Fix: Oxford Comma in README 697
* Pre-Commit: Default setup.cfg flags 701
* Updated setup.cfg with check-manifest 705
* Updating the version to v0.8.3 710
* PSI blurb in Docs for 0.8.3 714
* Generate Docs for v0.8.3 715


**Full Changelog**: https://github.com/capitalone/DataProfiler/compare/0.8.2post1...0.8.3

What's Changed
* Fix req missing for typing_extensions by JGSweets in https://github.com/capitalone/DataProfiler/pull/698
* Pre-Commit: Default `setup.cfg` flags by taylorfturner in https://github.com/capitalone/DataProfiler/pull/701
* Add Makefile to auto setup repo for developers by tonywu315 in https://github.com/capitalone/DataProfiler/pull/699
* Quick Fix: Oxford Comma in README by taylorfturner in https://github.com/capitalone/DataProfiler/pull/697
* Adding `PSI` to `diff` report by taylorfturner in https://github.com/capitalone/DataProfiler/pull/688
* Updated setup.cfg with check-manifest by Sanketh7 in https://github.com/capitalone/DataProfiler/pull/705
* Fix bug with null replication metrics by tonywu315 in https://github.com/capitalone/DataProfiler/pull/702
* Add profiler option for column level invalid values by tonywu315 in https://github.com/capitalone/DataProfiler/pull/704
* PSI `diff()` bug by taylorfturner in https://github.com/capitalone/DataProfiler/pull/707
* `PSI` `diff()` by taylorfturner in https://github.com/capitalone/DataProfiler/pull/708
* Fix bug with null replication metrics when row is all null by tonywu315 in https://github.com/capitalone/DataProfiler/pull/706
* Add PSI documentation in README.md by taylorfturner in https://github.com/capitalone/DataProfiler/pull/709
* Updated setup.cfg mypy flags and resolved related errors. by Sanketh7 in https://github.com/capitalone/DataProfiler/pull/703
* Updating the version to v0.8.3 by taylorfturner in https://github.com/capitalone/DataProfiler/pull/710
* PSI no calculation on `TextProfile` by taylorfturner in https://github.com/capitalone/DataProfiler/pull/711
* PSI fixing unit tests by taylorfturner in https://github.com/capitalone/DataProfiler/pull/712


**Full Changelog**: https://github.com/capitalone/DataProfiler/compare/0.8.2.post1...0.8.3

0.8.2.post1

Bugs
* Fix req missing for typing_extensions 698

**Full Changelog**: https://github.com/capitalone/DataProfiler/compare/0.8.2...0.8.2.post1

What's Changed
* Fix req missing for typing_extensions by JGSweets in https://github.com/capitalone/DataProfiler/pull/698


**Full Changelog**: https://github.com/capitalone/DataProfiler/compare/0.8.2...0.8.2.post1

0.8.2

Profiler
* added static typing to data_utils.py 662
* Added static typing to *_data classes in data_readers 677
* Adding the types of parameters and returns of functions 681
* Added static typing to data.py and filepath_or_buffer.py 682
* Fix typing and missing types 684
* Fix typing errors and missing return types 692

Documentation
* Move contribute info to CONTRIBUTING.md 683
* Fix typos, remove (unintended?) indentation 690

Bugs
* Quick Fix 680
* Fix matplotlib version requirements param 686
* fix JSON bug with data reading 691
* Includes mypy in pre-commit and fixes last needed updates 696

Other Changes
* Update ghworkflow actions to use pre-commit 687
* Updating the version to v0.8.2 694
* Generate Docs for v0.8.2 695

**Full Changelog**: https://github.com/capitalone/DataProfiler/compare/0.8.1...0.8.2

What's Changed
* Add static typing to labeler models by tonywu315 in https://github.com/capitalone/DataProfiler/pull/672
* Quick Fix by taylorfturner in https://github.com/capitalone/DataProfiler/pull/680
* Add static typing to labelers/data_processing.py by tonywu315 in https://github.com/capitalone/DataProfiler/pull/673
* Added static typing to data_readers/base_data.py and data_readers/json_data.py by Sanketh7 in https://github.com/capitalone/DataProfiler/pull/666
* Added static typing to data.py and filepath_or_buffer.py by Sanketh7 in https://github.com/capitalone/DataProfiler/pull/682
* Move contribute info to CONTRIBUTING.md by tonywu315 in https://github.com/capitalone/DataProfiler/pull/683
* Fix typing and missing types by tonywu315 in https://github.com/capitalone/DataProfiler/pull/684
* Fix `matplotlib` version requirements param by taylorfturner in https://github.com/capitalone/DataProfiler/pull/686
* Fix typos, remove (unintended?) indentation by bencomp in https://github.com/capitalone/DataProfiler/pull/690
* fix JSON bug with data reading by JGSweets in https://github.com/capitalone/DataProfiler/pull/691
* Fix typing errors and missing return types by tonywu315 in https://github.com/capitalone/DataProfiler/pull/692
* Adding the types of parameters and returns of functions by stefanycoimbra in https://github.com/capitalone/DataProfiler/pull/681
* Update ghworkflow actions to use pre-commit by JGSweets in https://github.com/capitalone/DataProfiler/pull/687
* Added static typing to *_data classes in data_readers by Sanketh7 in https://github.com/capitalone/DataProfiler/pull/677
* added static typing to data_utils.py by Sanketh7 in https://github.com/capitalone/DataProfiler/pull/662
* Updating the version to v0.8.2 by taylorfturner in https://github.com/capitalone/DataProfiler/pull/694
* Includes mypy in pre-commit and fixes last needed updates by JGSweets in https://github.com/capitalone/DataProfiler/pull/696

New Contributors
* bencomp made their first contribution in https://github.com/capitalone/DataProfiler/pull/690

**Full Changelog**: https://github.com/capitalone/DataProfiler/compare/0.8.1...0.8.2

0.8.1

Profiler
* Added static typing to data_readers/avro_data.py 657
* Added static typing to data_readers/structured_mixins.py 659
* Static Typing profiler 660
* Static Typing profilers/column profile 661
* Static typing for profilers 663
* Add static typing to data labeler and abstract classes 664
* Add static typing to labeler utils 668
* Allow diff to set format options for prepare report 669
* Replace dict constructor with dict comprehension 676

Documentation
* fix: nb issues being valid 655
* Add logo readme 670

Bugs
* fix: nb issues being valid 655
* Fix bug with loading data labeler from disk 665
* Fixes bug with empty data for DataLabeler Col 667
* Fixes 0 variance in a dataset 671

Other Changes
* Updating the version to v0.8.1 675
* Generate Docs for v0.8.1 679

**Full Changelog**: https://github.com/capitalone/DataProfiler/compare/0.8.0...0.8.1

What's Changed
* Static Typing profilers/utils.py by tonywu315 in https://github.com/capitalone/DataProfiler/pull/630
* Static Typing for Base Column Primitive Type Profilers by tonywu315 in https://github.com/capitalone/DataProfiler/pull/645
* Added static typing to data_readers/structured_mixins.py by Sanketh7 in https://github.com/capitalone/DataProfiler/pull/659
* Static Typing profilers/profile_builder.py by tonywu315 in https://github.com/capitalone/DataProfiler/pull/643
* Static Typing profilers/numerical_column_stats.py by tonywu315 in https://github.com/capitalone/DataProfiler/pull/648
* Static Typing profiler by tonywu315 in https://github.com/capitalone/DataProfiler/pull/660
* Static Typing profilers/column profile by tonywu315 in https://github.com/capitalone/DataProfiler/pull/661
* Added static typing to data_readers/avro_data.py by Sanketh7 in https://github.com/capitalone/DataProfiler/pull/657
* Static typing for profilers by tonywu315 in https://github.com/capitalone/DataProfiler/pull/663
* Add static typing to data labeler and abstract classes by tonywu315 in https://github.com/capitalone/DataProfiler/pull/664
* Fix bug with loading data labeler from disk by tonywu315 in https://github.com/capitalone/DataProfiler/pull/665
* Fixes bug with empty data for DataLabeler Col by JGSweets in https://github.com/capitalone/DataProfiler/pull/667
* Allow diff to set format options for prepare report by JGSweets in https://github.com/capitalone/DataProfiler/pull/669
* Add static typing to labeler utils by tonywu315 in https://github.com/capitalone/DataProfiler/pull/668
* Add logo readme by taylorfturner in https://github.com/capitalone/DataProfiler/pull/670
* Fixes 0 variance in a dataset by JGSweets in https://github.com/capitalone/DataProfiler/pull/671
* Updating the version to v0.8.1 by taylorfturner in https://github.com/capitalone/DataProfiler/pull/675
* Replace dict constructor with dict comprehension by boneyag in https://github.com/capitalone/DataProfiler/pull/676

New Contributors
* Sanketh7 made their first contribution in https://github.com/capitalone/DataProfiler/pull/659
* boneyag made their first contribution in https://github.com/capitalone/DataProfiler/pull/676

**Full Changelog**: https://github.com/capitalone/DataProfiler/compare/0.8.0...0.8.1

0.8.0

Profiler
* DataProfiler: hotfix for handling nan values in diff 647
* Static Typing profilers/profiler_options.py 644
* refactor: validate parameters and the returns of functions 640
* Preset option in ProfileOptions 638
* GraphProfiler: add() NotImplementedError 636
* ColumnNameLabeler Setup 635
* Fix for issue 605 634
* GraphProfiler: diff() functionality 631
* Post Processor for ColumnNameModel 629
* Graph Profiler: save() and load() Functionality 628
* Quick Add: require_module in ColumnNameModel test 627
* New Data Labeler: ColumnNameModel Build 626

Documentation
* fix: nb issues being valid 655
* fix: remove extra commas 654
* DataProfiler: structured_profilers example fix 653
* DataProfiler: graph_data_demo update 649
* ColumnNameLabeler Notebook Example 646
* Update documentation README for gh-pages branch 619
* README unti testing docs 632
* Notebook Examples for DP + GE: expect_profile_numeric_columns_percent 625
* Notebook Examples for DP + GE: expect_profile_numeric_columns_diff 624
* Notebook Examples for DP + GE: expect_column_values_vs_profile 623
* Notebook Examples for DP + GE: expect_column_value_confidence 622

Other Changes
* Updating the version to v0.8.0 652
* Generate Docs for v0.8.0 656

**Full Changelog**: https://github.com/capitalone/DataProfiler/compare/0.7.11...0.8.0

What's Changed
* Update TF / numpy reqs, drop py3.6 by JGSweets in https://github.com/capitalone/DataProfiler/pull/614
* Notebook Examples for DP + GE: expect_column_value_confidence by micdavis in https://github.com/capitalone/DataProfiler/pull/622
* Notebook Examples for DP + GE: expect_column_values_vs_profile by micdavis in https://github.com/capitalone/DataProfiler/pull/623
* Notebook Examples for DP + GE: expect_profile_numeric_columns_diff by micdavis in https://github.com/capitalone/DataProfiler/pull/624
* Notebook Examples for DP + GE: expect_profile_numeric_columns_percent by micdavis in https://github.com/capitalone/DataProfiler/pull/625
* New Data Labeler: ColumnNameModel Build by taylorfturner in https://github.com/capitalone/DataProfiler/pull/626
* Quick Add: `require_module` in `ColumnNameModel` test by taylorfturner in https://github.com/capitalone/DataProfiler/pull/627
* Graph Profiler: save() and load() Functionality by micdavis in https://github.com/capitalone/DataProfiler/pull/628
* README unti testing docs by taylorfturner in https://github.com/capitalone/DataProfiler/pull/632
* GraphProfiler: diff() functionality by micdavis in https://github.com/capitalone/DataProfiler/pull/631
* Fix for issue 605 by vindhyanairlj in https://github.com/capitalone/DataProfiler/pull/634
* Preset option in ProfileOptions by lovleen3112 in https://github.com/capitalone/DataProfiler/pull/638
* refactor: validate parameters and the returns of functions by stefanycoimbra in https://github.com/capitalone/DataProfiler/pull/640
* GraphProfiler: add() NotImplementedError by micdavis in https://github.com/capitalone/DataProfiler/pull/636
* Post Processor for ColumnNameModel by taylorfturner in https://github.com/capitalone/DataProfiler/pull/629
* ColumnNameLabeler Setup by taylorfturner in https://github.com/capitalone/DataProfiler/pull/635
* DataProfiler: hotfix for handling `nan` values in diff by micdavis in https://github.com/capitalone/DataProfiler/pull/647
* ColumnNameLabeler Notebook Example by taylorfturner in https://github.com/capitalone/DataProfiler/pull/646
* Static Typing profilers/profiler_options.py by tonywu315 in https://github.com/capitalone/DataProfiler/pull/644
* DataProfiler: `graph_data_demo` update by micdavis in https://github.com/capitalone/DataProfiler/pull/649
* Updating the version to v0.8.0 by micdavis in https://github.com/capitalone/DataProfiler/pull/652
* DataProfiler: `structured_profilers` example fix by micdavis in https://github.com/capitalone/DataProfiler/pull/653
* fix: remove extra commas by JGSweets in https://github.com/capitalone/DataProfiler/pull/654
* fix: nb issues being valid by JGSweets in https://github.com/capitalone/DataProfiler/pull/655

New Contributors
* vindhyanairlj made their first contribution in https://github.com/capitalone/DataProfiler/pull/634
* lovleen3112 made their first contribution in https://github.com/capitalone/DataProfiler/pull/638
* stefanycoimbra made their first contribution in https://github.com/capitalone/DataProfiler/pull/640
* tonywu315 made their first contribution in https://github.com/capitalone/DataProfiler/pull/644

**Full Changelog**: https://github.com/capitalone/DataProfiler/compare/0.7.11...0.8.0

0.7.11

Profiler
* Fixes GraphProfiler loading directly with class 602
* Graph loading bug 603
* Hot Fix: Typo in Graph Docstring 611
* fixes profile_schema bug for unnamed columns 612
* Include compact in fix for profile schema serialization 613

Documentation
* Add example notebook for graph data input 594
* Fix version for 0.7.10 596
* Update ReadMe to include Graph Profiler 597
* lower --> upper 598
* Add Graph Github Pages 599
* Graph Pipeline Demo notebook header fix 600
* Add GraphData to data_reader notebook 601
* formatting issue for 0.7.11 docs 617


Other Changes
* Updating the version to v0.7.11 615
* Generate Docs for v0.7.11 616


**Full Changelog**: https://github.com/capitalone/DataProfiler/compare/0.7.10...0.7.11

What's Changed
* Add example notebook for graph data input by MisterPNP in https://github.com/capitalone/DataProfiler/pull/594
* Update ReadMe to include Graph Profiler by MisterPNP in https://github.com/capitalone/DataProfiler/pull/597
* lower --> upper by taylorfturner in https://github.com/capitalone/DataProfiler/pull/598
* Graph Pipeline Demo notebook header fix by taylorfturner in https://github.com/capitalone/DataProfiler/pull/600
* Fixes GraphProfiler loading directly with class by JGSweets in https://github.com/capitalone/DataProfiler/pull/602
* Graph loading bug by JGSweets in https://github.com/capitalone/DataProfiler/pull/603
* Add GraphData to data_reader notebook by taylorfturner in https://github.com/capitalone/DataProfiler/pull/601
* Hot Fix: Typo in Graph Docstring by taylorfturner in https://github.com/capitalone/DataProfiler/pull/611
* fixes profile_schema bug for unnamed columns by JGSweets in https://github.com/capitalone/DataProfiler/pull/612
* Include compact in fix for profile schema serialization by JGSweets in https://github.com/capitalone/DataProfiler/pull/613
* Updating the version to v0.7.11 by taylorfturner in https://github.com/capitalone/DataProfiler/pull/615
* formatting issue for 0.7.11 docs by taylorfturner in https://github.com/capitalone/DataProfiler/pull/617


**Full Changelog**: https://github.com/capitalone/DataProfiler/compare/0.7.10...0.7.11

Page 5 of 10

© 2025 Safety CLI Cybersecurity Inc. All Rights Reserved.