- new switch "iter_mode" for explicitly choosing iteration mode (PMID vs. directory vs. collection), rather than a flag for overriding an implicit default
- new iter_mode "collection", which produces a single output file for a collection of articles
- faster term look-up (using two dicts now)
- term matching is based on a special (non-NLTK), customisable tokenisation
- additional termlist fields are exported to TSV and XML as well
- separate flags for controlling output details (include\_headers in TSV, sentence_level annotation in BioC), instead of the overloaded pretty\_print flag
- encoding: rather than being controlled by the locale, encoding is now always UTF-8 for all non-XML input and output documents