- Many thanks to Martin Popel for all the changes below!
- Added evaluation on concatenated test sets (e.g., `-t wmt17,wmt18`).
Works as long as they all have the same language pair.
- Added `sacrebleu --origlang` (both for evaluation on a subset and for `--echo`).
Note that while echoing prints just the subset, evaluation expects the complete
test set (and just skips the irrelevant parts).
- Added `sacrebleu --detail` for breakdown by domain-specific subsets of the test sets.
(Available for WMT19).
- Minor changes
- Improved display of `sacrebleu -h`
- Added `sacrebleu --list`
- Code refactoring
- Documentation and tests updates
- Fixed a race condition bug (`os.makedirs(outdir, exist_ok=True)` instead of `if os.path.exists`)