Changed
- Downscaled fixed positional embeddings for CNN models.
- Renamed `--monitor-bleu` flag to `--decode-and-evaluate` to illustrate that it computes
other metrics in addition to BLEU.
Added
- `--decode-and-evaluate-use-cpu` flag to use CPU for decoding validation data.
- `--decode-and-evaluate-device-id` flag to use a separate GPU device for validation decoding. If not specified, the
existing and still default behavior is to use the last acquired GPU for training.