What's Changed * Fixed deep copy, shallow copy error and label mask error. by Control-derek in https://github.com/lucidrains/self-rewarding-lm-pytorch/pull/29
What's Changed * Solves the problem that some variables are not declared by Control-derek in https://github.com/lucidrains/self-rewarding-lm-pytorch/pull/28
What's Changed * Solves the problem that some variables are not declared by Control-derek in https://github.com/lucidrains/self-rewarding-lm-pytorch/pull/27
What's Changed * Fix TypeError for is_valid_reward in SelfRewardDPOConfig by ViswanathaReddyGajjala in https://github.com/lucidrains/self-rewarding-lm-pytorch/pull/19
New Contributors * ViswanathaReddyGajjala made their first contribution in https://github.com/lucidrains/self-rewarding-lm-pytorch/pull/19