Biocantor

Latest version: v0.4.5

Safety actively analyzes 624915 Python packages for vulnerabilities to keep your Python projects secure.

Scan your dependencies

Page 1 of 2

0.4.5

Changed
- Remove contributor license agreement, which is superseded by the MIT license.

0.4.4

Fixed
- Handle genbank files with broken intervals gracefully.
- Fix interval parsing for negative strand features.

Changed
- The tag `Name` can now be used to identify a feature interval in a GFF3/GenBank file.

0.4.3

Fixed
- `AnnotationCollection._subset_parent()` now uses `seq_chunk_to_parent` and pulls out the chromosome ID from the chromosome record.
- `CDSInterval.from_dict()` now passes along the parent provided.

Added
- `strict_parent_compare` parameter for binary set theory operations.
- `AnnotationCollection.query_by_position()` has a new boolean flag `expand_location_to_children` that defaults to False, but if set to True
will expand the interval to contain the transcripts. When False, it may be the case that transcripts will have their underlying location objects
sliced down from their original coordinates. The original coordinates are still retained as integer members.
If the query position is entirely intronic for an isoform, this isoform will have a `EmptyLocation` chunk relative location,
but will still retain a `chromosome_location`.

Changed
- Added a parent-level sequence identifier to the output of `biocantor.io.parser.seq_chunk_to_parent()`.
- Added a `strand` argument to `biocantor.io.parser.seq_chunk_to_parent()` that allows for the sequence chunk to be strand-referenced.
- `Location.parent_to_relative_location` and `Location.location_relative_to` now has a `optimize_blocks` flag that defaults to True.
If this flag is False, then these operations will not collapse adjacent or overlapping blocks.

0.4.2

Added
- ``Location.union_preserve_overlaps()`` function added. This function produces the union of intervals, while preserving all overlaps.

0.4.1

Added
- Improved docstrings on interval objects.
- Location objects now have a `full_span` optional flag on all `intersction`, `overlaps` and `contains` functions. This flag has compound intervals be treated as their full span, i.e. from start to end, regardless of compound structure. This flag defaults to `False` in all cases. When two `CompoundInterval` are compared, they are both always compared in their full spans when this flag is `True`.
- `Interval` and `IntervalCollection` objects now are capable of being lifted to arbitrary coordinate systems, returning a new copy. These operations rely on first lifting to a shared chromosomal coordinate system.

Changed
- New `SequenceType` enum stores whether interval sequences are `chromosome` or `chunk_relative`.
- All objects that accept `SequenceType` information accepts either the `SequenceType` enum OR raw strings.
- `AnnotationCollection` will look at the provided `parent_or_seq_chunk_parent` to see if the bounds of the object can be inferred from the parent object. This is only performed if no `start`/`end` are explicitly provided. If neither are provided, the bounds of the collection are the bounds of its children.
- Refactored `CDSInterval` to be based on `AbstractFeatureInterval`. Moved `CDSPhase` and `CDSFrame` to accomodate the circular import this introduced.
- All `Interval` objects are allowed to have *chromosome* parents without sequence information.
- Removed versioneer in favor of hard coded versions.


Fixed
- Some functions on Interval objects were not operating in chromosome coordinates
- `AnnotationCollection.query_by_position()` was not returning valid results if the parent was a sequence chunk.
- GFF3 parser was not inferring transcripts for a gene feature with no children.
- Fixed a bug with missing gene biotypes in GFF3 parsing.

0.4.0

Added
- All Interval objects now have the ability to be built from subsets of genome sequence (called `sequence_chunk`).
- Querying `AnnotationCollection` objects by coordinates produce new objects with sliced sequences with chunk-relative coordinates.
- Interval objects built from sequence subsets can be exported in chunk-relative coordinates to GFF3/GenBank.
- Interval objects have new coordinate translation methods that operate in chunk-relative space. Coordinate methods that operate in genomic coordinate space were retained.
- Non-transcribed feature identifier parsing looks in the `note` special field for identifiers.

Changed
- All Interval objects now must be built directly from coordinates, and do not accept Location objects.
- All Interval objects now hide their Location member. This is to avoid confusion about what coordinate system the Location may be on.
- All interval collections have `__iter__` functions that call `__iter_children()` functions.
- All Interval objects have their core `._location` object hidden, and offer two accessors -- `.chromosome_location` and `.chunk_relative_location`. Note that `.chromosome_location` will not have sequence information attached to it if a sequence chunk was used. Generally, it is advised to not access `.location` objects directly.

Page 1 of 2

© 2024 Safety CLI Cybersecurity Inc. All Rights Reserved.