Folia-tools

Latest version: v2.5.6

Safety actively analyzes 642283 Python packages for vulnerabilities to keep your Python projects secure.

Scan your dependencies

Page 4 of 7

2.3.0

* The **tei2folia** converter has been extended to support more of TEI
* Implements conversion of tokens, sentences and simple linguistic annotation (pos,lemma,join,msd) (12 13)
* better document ID detection, prefer DOI, then ISSN, then ISBN, then DTADirName (specific to Deutsches Text Archiv), fall back to untyped but check we get something sane out of it. 12
* implemented conversion of norm attribute (not sure if this is entirely according to TEI P5 spec but Deutsches Text Archiv uses it.
* Benefit from some of the newly allowed structural nestings in folia v2.3
* Implemented handling for tei:trailer and some other elements
* Ignore styling that is wrapped around structural elements (for now)
* Added extra sanity checks
* foliavalidator now implements the ability to output to **explicit form** (proycon/folia84). Explicit form is a more verbose XML serialisation that makes assumptions that are usually implicit in FoLiA (such as defaults and element categories) explicit in the output. This facilitates the job for parsers who do not implement the full FoLiA logic. This is meant to be used as an alternative serialisation only in cases where it makes sense (to support such 3rd party parsers).
* Various fixes for ``foliatextcontent``
* implemented a first version of a FoLiA to Salt converetor (proycon/folia85). This is still in an experimental stage. Salt is a graph based model that acts as an intermediate model in their conversion tool Pepper. This folia2salt convertor in combination with pepper allows users, in theory, to convert FoLiA to formats such as TCF, Paula XML, ANNIS and many others.
* Updated documentation with some more in-depth sections on foliavalidator, tei2folia and folia2salt
* various foliaspec updates

2.2.7

* Fixed excessively slow foliaupgrade 7

2.2.6

Significant fixes and improvements for tei2folia (was previously only tested on DBNL collection, now on several others too).

2.2.5

* Added txt2folia tool
* [conllu2folia] added --outputfile parameter

2.2.4

* [alpino2folia] Fixed conversion of dependencies and syntax layer, was missing or a long time due to an unforeseen change in Alpino output (proycon/folia49)
* [foliaspec] Added code generation for new Rust library
* [foliabench] Added a new benchmark tool for the foliapy library, implementing four benchmarks.

2.2.3

Documentation update, added docstrings to all the tools for documentation (automatically harvestable for metadata)

Page 4 of 7

© 2024 Safety CLI Cybersecurity Inc. All Rights Reserved.