5. add new task pipeline demo (DDPG/TD3/D4PG/C51/QRDQN/IQN?SQIL/TREX/PDQN) (374, 380, 384, 407)
Env (dizoo)
1. add gym anytrading env (424)
2. add board games env (tictactoe, gomuku, chess) (356)
3. add sokoban env (397) (429)
4. add BC and DQN demo for gfootball (418) (423)
6. add discrete pendulum env (395)
Algorithm
1. add STEVE model-based algorithm (363)
2. add PLR algorithm (408)
3. plugin ST-DIM into PPO (379)
Enhancement
1. add final result saving in training pipeline
Fix
1. random policy randomness bug
2. action_space seed compalbility bug
3. discard message sent by self in redis mq (354)
4. remove pace controller (400)
5. import error in serial_pipeline_trex (410)
7. unittest hang and fail bug (413)
8. DREX collect data bug
9. remove unused import cv2
10. ding CLI env/policy option bug
Style
1. add buffer api description (371)
2. polish VAE comments (404)
3. unittest for FQF (412)
4. add metaworld dockerfile (432)
5. remove opencv requirement in default setting
6. update long description in setup.py
New Repo
1. [InterFuser](https://github.com/opendilab/InterFuser): Safety-Enhanced Autonomous Driving Using Interpretable Sensor Fusion Transformer
2. [awesome-decision-transformer](https://github.com/opendilab/awesome-decision-transformer): A curated list of Decision Transformer resources
3. [awesome-exploration-RL](https://github.com/opendilab/awesome-exploration-rl): A curated list of awesome exploration RL resources
**Contributors: PaParaZz1 zjowowen sailxjx puyuan1996 ZHZisZZ lixl-st Cloud-Pku Weiyuhong-1998 karroyan kxzxvbk song2181 nighood zhangpaipai Hcnaeg**