Ua-gec

Latest version: v2.1.3

Safety actively analyzes 625681 Python packages for vulnerabilities to keep your Python projects secure.

Scan your dependencies

Page 1 of 2

2.1.0

Added
- m2 files
- API access to sentence-split and tokenized version of source and target texts

Fixed
- Many punctuation whitespace fixes -- contributed by danmysak
- Mismatch in length of sentence-split source and targets in some cases.

2.0.0

Added
- 861 new documents (13,020 new sentences)!
- Detailed annotations (22 error categories vs. 4 categories in v1)
- GEC-only annotations
- Multiple annotators per document (as indicated by `doc.meta.annotator_id`)

1.3.0.dev0

Added
- Annotations may indicate newline insertion/deletion by using the "\n" token

Changed
- Fix annotations in ~10 docs (lists, tables, newlines)
- Sentence-split source and target files are now guaranteed to have the same
number of lines

1.2.1

Changed
- Fixed bug with `is_sensitive` metadata

1.2.0

Added
- Sentence-level aligned data
- Tokenized doc-level and sentence-level data

1.1.1

Added
- `Corpus.get_doc()` method to find a document by id.

Page 1 of 2

© 2024 Safety CLI Cybersecurity Inc. All Rights Reserved.