What's Changed
* Debug `BLEU` by ryokan0123 in https://github.com/sbintuitions/flexeval/pull/18
* Update tests for metrics by ryokan0123 in https://github.com/sbintuitions/flexeval/pull/19
* Deal with a case when stop_seq is tokenized into an empty string in `HuggingFaceLM` by ryokan0123 in https://github.com/sbintuitions/flexeval/pull/20
**Full Changelog**: https://github.com/sbintuitions/flexeval/compare/v0.4.0...v0.4.1