What's Changed
* Upgrade vLLM to 0.8.2 (V1 engine) and DeepSpeed to 0.16.5 xiaoxigua999
* fix generation attn_mask in ppo_train by gzpan in https://github.com/OpenRLHF/OpenRLHF/pull/913
* Add progress bar during forward batch when making experience by HollowMan6 in https://github.com/OpenRLHF/OpenRLHF/pull/925
* Replace deprecated vLLM generate API by HollowMan6 in https://github.com/OpenRLHF/OpenRLHF/pull/926
* Update for recent HIP_VISIBLE_DEVICES changes in ray by HollowMan6 in https://github.com/OpenRLHF/OpenRLHF/pull/933
* Fix full determinism mode when using vLLM V1 by HollowMan6 in https://github.com/OpenRLHF/OpenRLHF/pull/932
**Full Changelog**: https://github.com/OpenRLHF/OpenRLHF/compare/v0.6.3...v0.6.4