Added
- Include `seq_tensor_one_hot` in the PyTorch Parser.
- Added a `GCGCRecord.encoded_seq` property.
- New `gcgc.random` module to start holding sequence data.
- New `gcgc.rollout` module to handle working through chunks of sequences.
- `rollout_kmers` will roll out [kmers][1].
- `rollout_seq_features` will roll out the `SeqFeatures` from a `SeqRecord`.
- `EncodingAlphabet` now can optionally take a `gap_characters` set of characters to add to the
alphabet letters. It also takes `add_lower_case_for_inserts` which will duplicate the alphabet,
but convert the letters to lowercase.
Changed
Fixed
- Fixed bug in `GenomicDataset.from_path` where it still referred to `init_from_path_generator`.