Updates/changes:
- Renamed modules for which their name no longer reflected their true use: normalize -> random_experiments, validate -> analysis
- Completely removed normalization functions from the first iterations of KSTAR that are no longer in use
- Added catch to the pruning procedure to ensure that the code is not stopped if a kinase does not have any remaining edges, and instead keeps the kinase with fewer edges and records the error in the log.
New features:
- New functions in pruning module intended to guide users to best parameter values to use for their purposes + whether their parameter values are actually feasible.
- In addition to binarizing experiments by a threshold, you can now instead provide the desired number of phosphorylation sites to use for each sample and KSTAR will grab that number of sites with the greatest abundance (or least if greater = False)
- New function in KinaseActivity class, called test_threshold, intended to make it easier to check how a threshold value impacts the number of sites used across all samples
- Can add the number of phosphorylation sites used for each sample to a dotplot using evidence_size() function in DotPlot class
- Added new submodule in analysis module, called coverage, which is for exploring the coverage (number of sites with connections in network) of the phosphoproteome and phosphoproteomic experiments by KSTAR networks (or other kinase-substrate networks)
- Added new submodule in analysis module, called interactions, which is intended to contain functions for determining what active kinases are interacting with in the sample. Currently, contains two functions for outputting the phosphorylation sites that contributed most to a kinases activity prediction, based on the number of different networks they are predicted to interact.