Smudgeplot

Latest version: v0.5.0.0a3

Safety actively analyzes 723144 Python packages for vulnerabilities to keep your Python projects secure.

Scan your dependencies

Page 1 of 2

0.4.0

- local aggregation for subtracting hetmers containing sequencing errors
- fishnet algorithm for 1n coverage fit


Known problems:
- quantification of higher ploidy smudges is not well reflecting higher coverage variance and therefore leads to "overspilling" of kmers pairs to other smudges;
- Sometimes for species with strong tetraploid signal and weak diploid signal, the fit is better for assigning the 4n smudge as 2n. This is a recognisable pattern and clear limitation of the optimisation criteria we use at the moment. We are working on fixing this.
- We removed explicit ploidy predictions, users need to make a join interpretation of smudgeplot, genomescope and prior knowledge about to species to determine the most sensible ploidy. An explicit ploidy level is our preferred solution to both problems listed above, so hopefully this functionality will return.

0.3.0

The big changes are

the search for the kmer pair will be within both canonical and non-canonical k-mer sets (Gene demonstrated it makes a difference)
the tool will be supporting FastK kmer counter only
the backend by Gene is paralelized and massively faster
the intermediate file will be a flat file with the 2d histogram with cov1, cov2, freq columns (as opposed to list of coverages of pairs cov1 cov2);
at least for now WE LOSE the ability to extract sequences of the kmers in the pair; this functionality will hopefully restore at some point together with functionality to assess the quality of assembly.
the smudge detection algorithm is under revision and a new version will be released on 18th of October 2024

0.2.5

- fixed issue with L and U being too close to each other. Smudgeplot simply creates a wild plot of the data that are fed to it regardless of being harder for interpretation (aligned with "honest data reporting" philosophy of smudgeplot, but might cause more confusion, perhpa we will add some more warnings in the future)

0.2.4

Adding a new feature `smudgeplot extract` for extracting kmer pairs from a rectangle of the smudgeplot.

Great thanks to zhenzhenyang-psu !

Documentation: https://github.com/KamilSJaron/smudgeplot/wiki/smudgeplot-extract
For usage see `smudgeplot.py extract -h`

0.2.3

This release is just to get it to Zenodo, otherwise identical to v0.2.2.

0.2.2

This version updates:
- the annotation algorithm for higher ploidy levels based on simulations.
- encourages using of our [KMC](https://github.com/tbenavi1/KMC) that speeds up the search for kmer pairs a lot
- adding a new warnings for mismatching estimates of 1n coverage by different approaches (was silent before)
- change in terminology, instead of "estimated ploidy" we say "proposed ploidy", as there is no model explicitly tested

Page 1 of 2

© 2025 Safety CLI Cybersecurity Inc. All Rights Reserved.