API Change
1. middleware: `CkptSaver(cfg, policy, train_freq=100)` -> `CkptSaver(policy, cfg.exp_name, train_freq=100)`
Env
1. add metadrive env and related ppo config (574)
2. add acrobot env and related dqn config (577)
3. add carracing in box2d (575)
4. add new gym hybrid viz (563)
5. update cartpole IL config (578)
Algorithm
1. add BDQ algorithm (558)
2. add procedure cloning model (573)
Enhancement
1. add simplified PPOF (PPO × Family) interface (567) (568) (581) (582)
Fix
1. fix to_device and prev_state bug when using ttorch (571)
2. fix py38 and numpy unittest bugs (565)
3. fix typo in contrastive_loss.py (572)
4. fix dizoo envs pkg installation bugs
5. fix multi_trainer middleware unittest bug
Style
1. add evogym docker (580)
2. fix metaworld docker bug
3. fix setuptools high version incompatibility bug
4. extend treetensor lowest version
New Paper
1. [GoBigger](https://openreview.net/forum?id=NnOZT_CR26Z): [ICLR 2023] A Scalable Platform for Cooperative-Competitive Multi-Agent Interactive Simulation
**Contributors: PaParaZz1 puyuan1996 timothijoe Cloud-Pku ruoyuGao Super1ce karroyan kxzxvbk eltociear**