Learnmsa

Latest version: v2.0.9

Safety actively analyzes 682361 Python packages for vulnerabilities to keep your Python projects secure.

Scan your dependencies

Page 2 of 4

2.0.1

Added language model support `(--use_language_model)` for significantly improved accuracy.

With this option, learnMSA aligns about 6% points more columns correct on average on HomFam benchmark than state-of-the-art tools including learnMSA without LM support.

1.3.4

Added sequence cropping to accomodate very long outlies.
Previously, learnMSA would be very slow, if the input includes sequences many times longer than the average. This release provides a fix by implementing the command line option `--crop x` to crop inputs longer than x (defaulting to 3 times the average). Cropping can be disabled (the original behavior) with `--crop disable`.
Cropping does only affect model training, not the decoded MSA which will always respect the full input.

1.3.2

Fixed missing package data for the `--use_language_model` option.

1.3.1

- improved numerical stability across TF versions
- minor bugfixing

1.3.0

- Use `pyfamsa` to align insertions, also made aligning insertions the default behavior (also added `--unaligned_insertions` flag).
- Use `biopython` for data parsing. Many more input file formats are now available as well as the experimental `indexed_data` flag for large datasets that allows constant memory model training.
- Multi GPU training works now.
- Added the highly experimental `--use_language_model` flag that uses a large, pretrained protein language model to guide the MSA and improve alignment accuracy.
- added sequence weighting

1.2.4

Another hotfix for a rare bug when aligning insertions

Page 2 of 4

© 2024 Safety CLI Cybersecurity Inc. All Rights Reserved.