Tacl

Latest version: v5.3.0

Safety actively analyzes 682244 Python packages for vulnerabilities to keep your Python projects secure.

Scan your dependencies

Page 3 of 5

4.0.3

* Removed obsolete reference to tacl-helper command in setup.py.

4.0.2

* Corrected release date for version 4.0.0.

4.0.1

* UNRELEASED

* Updated references to documentation, pointing now to
tacl.readthedocs.io.

4.0.0

* Refactored search to output results with the same columns as for
diff/intersect, and added grouping options to tacl results.

* Added add-label-work-count option to tacl results to add a column
to results giving the count of works per label.

* Moved label-count from tacl-helper to tacl results as the option
--add-label-count.

* Added tacl excise command (and excise method on Text) to remove
n-grams from a text.

* Added --excise option to tacl results, to remove any results whose
n-gram contains the supplied n-gram.

* Removed tacl-helper command. Its remaining functionality could be
duplicated through simple scripts (mostly concatenation of tacl
commands).

* Added reference to tacl-catalogue-manager project in
documentation.

* Removed support for CBETA 2011 files.

* Changed witness handling so that all witnesses are explicit.

* Added "latin" tokenizer for whitespace delimited sequences of
non-punctuation characters.

* Added check for each Result method that the supplied results have
the required columns.

* Improved error handling when tacl results is supplied either
results without the necessary columns (eg, after
collapse-witnesses), or an empty set of results.

3.0.0

* Moved useful script functions to tacl.command.utils, and added
this to the API documentation.

* Added colour to logging output from the command line scripts.

* Added validate-catalogue as a tacl-helper subcommand, to validate
a catalogue file with respect to a corpus.

* Added label-count as a tacl-helper subcommand, to add a column to
results with a count within a label for each n-gram.

* Added -c/--catalogue option to tacl ngrams, to limit the texts
added to a database to those labelled in a catalogue.

* Added experimental command, tacl-jitc, to generate a report on
overlap between pairs of texts in a sub-corpus, against the
background of a second sub-corpus.

* Modified tacl highlight command to allow for either a heatmap
display (based on a results file) or a simple highlight (based on
a file of n-grams).

* Added support for preparing and stripping CBETA TEI files from
their GitHub repository. This also changes the XML expected by the
markup stripping operation.

* Renamed "tacl report" to "tacl results", and tacl.Report to
tacl.Results. (46)

* Added --ngrams option to tacl results, to exclude results whose
n-gram occurs in the supplied list of n-grams.

* Added --min-count-text and --max-count-text options to tacl
results, to filter results that do not have at least one text
carrying an n-gram with a count within the specified range.

* Added bifurcated-extend options to "tacl results", to
generate results containing n-grams extended from the provided
results, but including only those that occur at sizes that mark a
bifurcation (change in label count between an n-gram and its
containing (n+1)-grams).

* Clarified the language used through code, comments, and docs, to
distinguish clearly between a "work" (abstract 'text', such as
T0220, distinct from any particular expression in a witness), a
"witness" (particular expression of a work), and "text" (the
actual words etc). This has changed the names of some command line
options to tacl results, and the API (Corpus.get_text is now
Corpus.get_witness, for example). The database schema has also
changed, meaning that databases must be recreated from
scratch. Any existing results will need to have the "text name"
header in the first line changed to "work". (50)

* Added display of aligned sequences in text order of one of the
texts to tacl align. (32)

2.3.2

* Added autogenerated API documentation.

* Added convenience method to Text to get the tokenized text as a
string.

* Expanded test of extend to cover more cases.

Page 3 of 5

© 2024 Safety CLI Cybersecurity Inc. All Rights Reserved.