What's Changed
* Fix: bug in actor logit's numerical precision when bf16 is on. by Illyasville in https://github.com/OpenRLHF/OpenRLHF/pull/634
* docs: add Japanese README file by eltociear in https://github.com/OpenRLHF/OpenRLHF/pull/636
* Fix loss_mean log by xiaoxigua999 Freder-chen in https://github.com/OpenRLHF/OpenRLHF/pull/650
* Fix: In prm training, placeholder_token should be truncated if input is truncated by LinXueyuanStdio in https://github.com/OpenRLHF/OpenRLHF/pull/652
* [Fix adapter_model.safetensors for LoRA + ZeRO3](https://github.com/OpenRLHF/OpenRLHF/commit/5a06164334f23c467cbe3dcfa7ef918c9931c567) xiaoxigua999
* [add lora_combiner.py](https://github.com/OpenRLHF/OpenRLHF/commit/18010bca5ff6cfe0cde4cf654efa180e1d28afe9) UbeCc xiaoxigua999
New Contributors
* Illyasville made their first contribution in https://github.com/OpenRLHF/OpenRLHF/pull/634
* LinXueyuanStdio made their first contribution in https://github.com/OpenRLHF/OpenRLHF/pull/652
**Full Changelog**: https://github.com/OpenRLHF/OpenRLHF/compare/v0.5.5...v0.5.6