This release includes a number of new features, optimizations and performance improvements:
- Default calibration files for all released modified base models, including newly released Rerio "research" models (CpG 5mC for Minion/GridION and PromethION; CpG 5mC and 5hmC model for MinION/GridION). Sequence variant calibration files are also provided for all currently released Flip-flop models ( fixes 22 )
- New output type, `mod_mappings`, to visualize per-read modified base calls in a genome browser. This output annotates per-read reference sequence with modified basecalls including confidence scores. This output can be visualized via a genome browser using existing bisulfite setting [see example here](https://nanoporetech.github.io/megalodon/file_formats.html#modified-base-mapping).
- The modified base processing steps have now been optimized to output modified basecalls in all context more efficiently. Internal tests show that all-context modified base output can now keep up with basecalling using the Guppy backend on 2 V100 GPUs.
- Updates to make model training data preparation easier. Training a new basecalling model can be performed in two command line steps (once software is successfully installed). [See documentation for this process here](https://nanoporetech.github.io/megalodon/model_training.html). New methods have been added to prepare training datasets for modified base basecalling models. These are models that specifically detect modified bases along with canonical bases. See documentation for [these new commands here)[https://nanoporetech.github.io/megalodon/modbase_training.html].
- Support for analysis of direct RNA sequencing data including the output of RNA training datasets. This does not include the release of any RNA modified base models or default calibration files.
- Megalodon helper scripts are now accessible via the command line `megalodon_extras` command. These commands have been refactored to be more user-friendly and be accessible from a standard pip or Conda installation.
- The standard sequencing summary file output by Guppy is now output from Megalodon, when the `basecalls` output is selected ( fixes 24 ).
- This release also includes various bug fixes and other optimizations ( fixes 28 ; fixes 33 )