What's Changed
* [Feature] Support DPO, ORPO and Reward Model by RangiLyu in https://github.com/InternLM/xtuner/pull/743
* [Bugs] fix dispatch bugs by HIT-cwh in https://github.com/InternLM/xtuner/pull/775
* [Bugs] Fix HFCheckpointHook bugs when training deepseekv2 and mixtral withou… by HIT-cwh in https://github.com/InternLM/xtuner/pull/774
* [Feature] Support the scenario where sp size is not divisible by attn head num by HIT-cwh in https://github.com/InternLM/xtuner/pull/769
* bump version to 0.1.21 by HIT-cwh in https://github.com/InternLM/xtuner/pull/776
**Full Changelog**: https://github.com/InternLM/xtuner/compare/v0.1.20...v0.1.21