* Moved useful script functions to tacl.command.utils, and added
this to the API documentation.
* Added colour to logging output from the command line scripts.
* Added validate-catalogue as a tacl-helper subcommand, to validate
a catalogue file with respect to a corpus.
* Added label-count as a tacl-helper subcommand, to add a column to
results with a count within a label for each n-gram.
* Added -c/--catalogue option to tacl ngrams, to limit the texts
added to a database to those labelled in a catalogue.
* Added experimental command, tacl-jitc, to generate a report on
overlap between pairs of texts in a sub-corpus, against the
background of a second sub-corpus.
* Modified tacl highlight command to allow for either a heatmap
display (based on a results file) or a simple highlight (based on
a file of n-grams).
* Added support for preparing and stripping CBETA TEI files from
their GitHub repository. This also changes the XML expected by the
markup stripping operation.
* Renamed "tacl report" to "tacl results", and tacl.Report to
tacl.Results. (46)
* Added --ngrams option to tacl results, to exclude results whose
n-gram occurs in the supplied list of n-grams.
* Added --min-count-text and --max-count-text options to tacl
results, to filter results that do not have at least one text
carrying an n-gram with a count within the specified range.
* Added bifurcated-extend options to "tacl results", to
generate results containing n-grams extended from the provided
results, but including only those that occur at sizes that mark a
bifurcation (change in label count between an n-gram and its
containing (n+1)-grams).
* Clarified the language used through code, comments, and docs, to
distinguish clearly between a "work" (abstract 'text', such as
T0220, distinct from any particular expression in a witness), a
"witness" (particular expression of a work), and "text" (the
actual words etc). This has changed the names of some command line
options to tacl results, and the API (Corpus.get_text is now
Corpus.get_witness, for example). The database schema has also
changed, meaning that databases must be recreated from
scratch. Any existing results will need to have the "text name"
header in the first line changed to "work". (50)
* Added display of aligned sequences in text order of one of the
texts to tacl align. (32)