Added
- Automatically download and use any standard index by name. Example: `hostile clean --index human-t2t-hla-argos985 --fastq1 reads.fq.gz`. (28)
- Two new standard indexes:
- `human-t2t-hla.rs-viral-202401_ml-phage-202401` (RefSeq viral & Millard Lab phage)
- `human-t2t-hla.argos-bacteria-985_rs-viral-202401_ml-phage-202401` (ARGOS bacteria, RefSeq viral, & Millard Lab phage)
- Improved masking protocol for custom index creation (`hostile mask`):
- Uses `minimap2` with adjusted secondary alignment limits and score thresholds.
- K-merises target genomes into 150mers (10bp step) before alignment. Adds `dnaio` dependency.
- New options for `hostile clean`: `--aligner-args`, `--invert`, and `--offline`.
- Verification of downloaded index checksums in `manifest.json`. (20)
- Adds `version`, `aligner`, and `options` fields to log output.
Changed
- Prevents corruption of empty `fastq.gz` files when no reads remain after decontamination. (24)