Added - Lexicon-based target vocabulary restriction for faster decoding. New CLI for top-k lexicon creation, sockeye.lexicon. New translate CLI argument `--restrict-lexicon`.
Changed - Bleu computation based on Sacrebleu.
1.10.5
Not secure
Fixed - Fixed yet another bug with the data iterator.
1.10.4
Not secure
Fixed - Fixed a bug with the revised data iterator not correctly appending EOS symbols for variable-length batches. This reverts part of the commit added in 1.10.1 but is now correct again.
1.10.3
Not secure
Changed - Fixed a bug with max_observed_{source,target}_len being computed on the complete data set, not only on the sentences actually added to the buckets based on `--max_seq_len`.
1.10.2
Added - `--max-num-epochs` flag to train for a maximum number of passes through the training data.
1.10.1
Not secure
Changed - Reduced memory footprint when creating data iterators: integer sequences are streamed from disk when being assigned to buckets.