Added - m2 files - API access to sentence-split and tokenized version of source and target texts
Fixed - Many punctuation whitespace fixes -- contributed by danmysak - Mismatch in length of sentence-split source and targets in some cases.
2.0.0
Added - 861 new documents (13,020 new sentences)! - Detailed annotations (22 error categories vs. 4 categories in v1) - GEC-only annotations - Multiple annotators per document (as indicated by `doc.meta.annotator_id`)
1.3.0.dev0
Added - Annotations may indicate newline insertion/deletion by using the "\n" token
Changed - Fix annotations in ~10 docs (lists, tables, newlines) - Sentence-split source and target files are now guaranteed to have the same number of lines
1.2.1
Changed - Fixed bug with `is_sensitive` metadata
1.2.0
Added - Sentence-level aligned data - Tokenized doc-level and sentence-level data
1.1.1
Added - `Corpus.get_doc()` method to find a document by id.