Added - Added `fertility` as a further type of attention coverage. - Added an option for training to keep the initializations of the model via `--keep-initializations`. When set, the trainer will avoid deleting the params file for the first checkpoint, no matter what `--keep-last-params` is set to.
1.18.66
Fixed - Fix to argument names that are allowed to differ for resuming training.
1.18.65
Changed - More informative error message about inconsistent --shared-vocab setting.
1.18.64
Added - Adding translation sampling via `--sample [N]`. This causes the decoder to sample each next step from the target distribution probabilities at each timestep. An optional value of `N` causes the decoder to sample only from the top `N` vocabulary items for each hypothesis at each timestep (the default is 0, meaning to sample from the entire vocabulary).
1.18.63
Changed - The checkpoint decoder and nvidia-smi subprocess are now launched from a forkserver, allowing for a better separation between processes.
1.18.62
Added - Add option to make `TranslatorInputs` directly from a dict.