Seismic-rna

Latest version: v0.24.2

Safety actively analyzes 723717 Python packages for vulnerabilities to keep your Python projects secure.

Page 2 of 13

0.22.2

Not secure

- The core algorithm of the relate step has been re-implemented in C, speeding it up by roughly 10x. The original Python version of the algorithm is still available as a fallback or for benchmarking or debugging, using the option `--relate-py`.
- The build backend has been changed from Hatch to Meson in order to build the new C extension module for relate.

**Full Changelog**: https://github.com/rouskinlab/seismic-rna/compare/v0.22.1...v0.22.2

0.22.1

Not secure

Draw

- The `draw` module creates vectorized, publication-quality figures, directly from the output of SEISMIC-RNA's `fold` module.
- Initially piloted in 0.22.0, drawing can now be run automatically as part of `wf` by including the `--draw` flag.
- The `draw` command will automatically color bases according to the reactivities used by SEISMIC-RNA to constrain the structure prediction, although this behavior can be disabled with the `--no-color` flag.
- When multiple structures are predicted, only the one with the best AUROC will be drawn by default, although any structure(s) can be specified with the `--struct-num` flag.
- Many thanks to Fabrice Jossinet (fjossinet) for developing [RNArtistCore](https://github.com/fjossinet/RNArtistCore), the heart the `draw` module.
- To install RNArtistCore, first download the [latest release](https://github.com/fjossinet/RNArtistCore/releases) and save the `.jar` file in a safe place. Second, set the environmental variable `RNARTISTCORE` to the full path to the `.jar` file. It is recommended you do this by adding the line `export RNARTISTCORE="/full/path/to/your/rnartistcore-X.X.X-SNAPSHOT-jar-with-dependencies.jar"` to your .bashrc or .zshrc file.
- Installation instructions for RNArtistCore can also be found in a descriptive error message triggered by running the `draw` command when it is not installed.

Bug Fixes

- Fixed incorrect parsing of structure profiles generated from mask tables when using the draw module.
- Implemented safeguards against missing color and table files when drawing structures.

**Full Changelog**: https://github.com/rouskinlab/seismic-rna/compare/v0.22.0...v0.22.1

0.22.0

Not secure

Migration

- To convert SEISMIC-RNA outputs from version 0.21 to version 0.22 (which are not compatible with each other), type `seismic migrate out`, where `out` is an output directory. You may give additional output directories as positional arguments.

Align

- The default alignment scores have been made more stringent, from `L,1,0.5` to `L,1,0.8` (local mode) and from `L,-1,-0.5` to `L,-1,-0.8` (end-to-end mode), to reduce the number of reads with huge numbers of mutations (especially indels) that could previously slip through.
- In the align report, references are now sorted from most to fewest reads.

Relate

- Insertions are now marked on either the base immediately 3' of the insertion (default) or immediately 5' (enabled with `--insert5`), rather than on both bases, to make it possible to count insertions using the existing code.
- The algorithm that finds ambiguous indels has been rewritten to simplify the code and handle more edge cases. It is consequently slower, but its simpler structure now allows the algorithm to be reimplemented in C (to make it much faster than even the original), which will be done in a subsequent version.

Mask

- The term "section" is renamed to "region" to match the terminology used by most of the field to refer to parts of sequences/genes.
- The mask step runs multiple iterations of filtering reads, then filtering positions, and repeating, to make sure the reads that are ultimately kept meet all the criteria for the positions that are ultimately kept.
- The masking process is repeated until an iteration removes no more reads/positions, no reads/positions remain at all, or an maximum number of iterations is reached (no limit by default, but can be set with `--max-mask-iter`).
- Arbitrary reads can be masked out by naming them individually with `--mask-read`, providing a text file of read names to mask out with `--mask-read-file`, or both.

Table

- Table files are now written into the relate, mask, and cluster directories instead of into a separate table directory.
- Table files are now created during the relate, mask, and cluster steps; the table command still exists but is needed only for regenerating the table files if they are ever deleted.
- When calculating the data in a table, batches are now processed in parallel, which speeds up tabulation when you have a small number of large samples.
- Specific tables can now be disabled with options, e.g. `--no-mask-pos-table`.
- Table files now have more descriptive names, e.g. `mask-position-table` instead of `mask-per-pos`.
- The "Unambiguous" column has been renamed to "Informative".
- A bug causing some errors when counting duplicate reads has been fixed.

Graph

- Graphs of data from the mask step have been renamed from `masked` to `filtered` to make it clear that the data are from the reads/positions that remained after filtering, not the ones that were masked out.

Draw

- A new command, draw, has been added to draw RNA structures using RNArtist.

Simulation

- Simulating end coordinates now uses two parameters for variance, `--center-fvar` and `--length-fvar`, so that they can be decoupled (e.g. to generate uniform-length reads of variable position).

Workflow

- Commands that are not part of the main workflow all began with a `+` to help distinguish them as auxiliary; with improved documentation, it should now be more clear to distinguish them, so the `+` has been removed to make auxiliary commands easier to type.
- To reduce the risk of rerunning an earlier step (e.g. mask) and forgetting to rerun a later step (e.g. cluster) based on the new results, an error is raised if the "Time Ended" field of the later step's report file occurs before that of the earlier step. If needed, this check can be disabled with `--no-verify-times`.

Python API

- The logger is now initialized by default when using `import seismicrna`.
- Brickle files can be loaded without a checksum by passing `checksum=""` to `BrickleIO.load()`.
- The `Dataset.load()` class method has been removed; `Dataset.__init__()` now accepts a report file and should be used instead.
- Class `Qnames` has been renamed to `ReadNames`.
- Arbitrary reads can be masked when tabulating mask and cluster datasets.
- Table classes have been renamed with full names, e.g. from `ClustPosTable` to `ClusterPositionTable`.
- Default fields for Table paths now include `cmd`, `table`, and `ext`.

What's Changed
* 0.22.0 by matthewfallan in https://github.com/rouskinlab/seismic-rna/pull/19

**Full Changelog**: https://github.com/rouskinlab/seismic-rna/compare/v0.21.1...v0.22.0

0.21.1

Not secure

Bug Fixes

- Changed obsolete "debug" message to "detail" to prevent crash when using `--keep-discontig`
- Updated the help text for `--verbose` and `--quiet`

What's Changed
* 0.21.1 by matthewfallan in https://github.com/rouskinlab/seismic-rna/pull/18

**Full Changelog**: https://github.com/rouskinlab/seismic-rna/compare/v0.21.0...v0.21.1

0.21.0

Not secure

New Features

- Replace Cutadapt and FastQC with fastp for one-step trimming and QC, faster processing, and a single report file.
- Replace the logging framework (using Python's standard library utilities) with a custom logging module that has seven levels (rather than 5) for greater control over verbosity.

Bug Fixes

- Fixed a bug in the jackpotting calculation that caused a systematic bias of roughly 1%.

Documentation

- Added an illustration of what happens during each main step of the workflow.
- Added a note about installing with Mamba.

Miscellaneous

- Changed the graph color palettes for bases (A/C/G/T) and relationships.
- Changed the colors of logging messages to make them easier to read on dark and light backgrounds.

What's Changed
* 0.21.0 by matthewfallan in https://github.com/rouskinlab/seismic-rna/pull/17

**Full Changelog**: https://github.com/rouskinlab/seismic-rna/compare/v0.20.1...v0.21.0

0.20.1

Not secure

New Features

- The clustering step now calculates a "jackpotting quotient" to measure how much bias there is in the dataset. The calculation is expensive and can be toggled with ``--jackpot/--no-jackpot``.
- The clustering step now outputs the per-run statistics as a CSV file and a graph for every statistic, rather than writing them into the cluster report.
- It also ouputs scatter plots comparing the observed and expected counts for each number of clusters.
- The new command ``seismic +datapath`` locates and prints the location of the data path for RNAstructure (assuming it has been installed properly), to which the ``DATAPATH`` environment variable should be set.
- The ``raise_on_error`` parameter can now be used to make SEISMIC-RNA raise an Exception instead of logging an error message any time ``logger.error()`` or ``logger.critical()`` is called. This is useful for testing the whole workflow, when errors should cause the test to fail. Only available through the Python API.
- SEISMIC-RNA now has a new logo!

Bug Fixes

- Fixed a bug where clustering would run by default in ``seismic wf``. Now, clustering is enabled with the flag ``--cluster``.
- Fixed a bug causing ``seismic align`` and ``seismic wf`` to delete BAM files when aligning demultiplexed FASTQ files.
- Assume that the folding energy for RNA structures missing an explicit energy is 0 rather than NaN.
- Added ``kaleido`` to list of dependencies to fix a bug about a missing optional dependency for Plotly.
- Fixed a bug where ``seismic +sim ends`` could produce out-of-bounds 5' or 3' ends.

**Full Changelog**: https://github.com/rouskinlab/seismic-rna/compare/v0.20.0...v0.20.1

Page 2 of 13

Releases

Has known vulnerabilities

Previous Next

Seismic-rna

Page 2 of 13

0.22.2

0.22.1

0.22.0

0.21.1

0.21.0

0.20.1

Page 2 of 13

Links

Releases