Cutadapt

Latest version: v5.0

Safety actively analyzes 723177 Python packages for vulnerabilities to keep your Python projects secure.

Page 9 of 11

1.4

-----------------

* This release of cutadapt reduces the overhead of reading and writing files.
On my test data set, a typical run of cutadapt (with a single adapter) takes
40% less time due to the following two changes.
* Reading and writing of FASTQ files is faster (thanks to Cython).
* Reading and writing of gzipped files is faster (up to 2x) on systems
where the ``gzip`` program is available.
* The quality trimming function is four times faster (also due to Cython).
* Fix the statistics output for 3' colorspace adapters: The reported lengths were one
too short. Thanks to Frank Wessely for reporting this.
* Support the ``--no-indels`` option. This disallows insertions and deletions while
aligning the adapter. Currently, the option is only available for anchored 5' adapters.
This fixes issue 69.
* As a sideeffect of implementing the --no-indels option: For colorspace, the
length of a read (for ``--minimum-length`` and ``--maximum-length``) is now computed after
primer base removal (when ``--trim-primer`` is specified).
* Added one column to the info file that contains the name of the found adapter.
* Add an explanation about colorspace ambiguity to the README

1.3

-----------------

* Preliminary paired-end support with the ``--paired-output`` option (contributed by
James Casbon). See the README section on how to use it.
* Improved statistics.
* Fix incorrectly reported amount of quality-trimmed Mbp (issue 57, fix by Chris Penkett)
* Add the ``--too-long-output`` option.
* Add the ``--no-trim`` option, contributed by Dave Lawrence.
* Port handwritten C alignment module to Cython.
* Fix the ``--rest-file`` option (issue 56)
* Slightly speed up alignment of 5' adapters.
* Support bzip2-compressed files.

1.2

-----------------

* At least 25% faster processing of .csfasta/.qual files due to faster parser.
* Between 10% and 30% faster writing of gzip-compressed output files.
* Support 5' adapters in colorspace, even when no primer trimming is requested.
* Add the ``--info-file`` option, which has a line for each found adapter.
* Named adapters are possible. Usage: ``-a My_Adapter=ACCGTA`` assigns the name "My_adapter".
* Improve alignment algorithm for better poly-A trimming when there are sequencing errors.
Previously, not the longest possible poly-A tail would be trimmed.
* James Casbon contributed the ``--discard-untrimmed`` option.

1.1

-----------------

* Allow to "anchor" 5' adapters (``-g``), forcing them to be a prefix of the read.
To use this, add the special character ``^`` to the beginning of the adapter sequence.
* Add the "-N" option, which allows 'N' characters within adapters to match literally.
* Speedup of approx. 25% when reading from .gz files and using Python 2.7.
* Allow to only trim qualities when no adapter is given on the command-line.
* Add a patch by James Casbon: include read names (ids) in rest file
* Use nosetest for testing. To run, install nose and run "nosetests".
* When using cutadapt without installing it, you now need to run ``bin/cutadapt`` due to
a new directory layout.
* Allow to give a colorspace adapter in basespace (gets automatically converted).
* Allow to search for 5' adapters (those specified with ``-g``) in colorspace.
* Speed up the alignment by a factor of at least 3 by using Ukkonen's algorithm.
The total runtime decreases by about 30% in the tested cases.
* allow to deal with colorspace FASTQ files from the SRA that contain a fake
additional quality in the beginning (use ``--format sra-fastq``)

1.0

-----------------

* ASCII-encoded quality values were assumed to be encoded as ascii(quality+33).
With the new parameter ``--quality-base``, this can be changed to ascii(quality+64),
as used in some versions of the Illumina pipeline. (Fixes issue 7.)
* Allow to specify that adapters were ligated to the 5' end of reads. This change
is based on a patch contributed by James Casbon.
* Due to cutadapt being published in EMBnet.journal, I found it appropriate
to call this release version 1.0. Please see
http://journal.embnet.org/index.php/embnetjournal/article/view/200 for the
article and I would be glad if you cite it.
* Add Galaxy support, contributed by Lance Parsons.
* Patch by James Casbon: Allow N wildcards in read or adapter or both.
Wildcard matching of 'N's in the adapter is always done. If 'N's within reads
should also match without counting as error, this needs to be explicitly
requested via ``--match-read-wildcards``.

0.9.5

-------------------

* Fix issue 20: Make the report go to standard output when ``-o``/``--output`` is
specified.
* Recognize `.fq` as an extension for FASTQ files
* many more unit tests
* The alignment algorithm has changed. It will now find some adapters that
previously were missed. Note that this will produce different output than
older cutadapt versions!

Before this change, finding an adapter would work as follows:

- Find an alignment between adapter and read -- longer alignments are
better.
- If the number of errors in the alignment (divided by length) is above the
maximum error rate, report the adapter as not being found.

Sometimes, the long alignment that is found had too many errors, but a
shorter alignment would not. The adapter was then incorrectly seen as "not
found". The new alignment algorithm checks the error rate while aligning and only
reports alignments that do not have too many errors.

Page 9 of 11

Releases

Has known vulnerabilities

Previous Next

Cutadapt

Page 9 of 11

1.4

1.3

1.2

1.1

1.0

0.9.5

Page 9 of 11

Links

Releases