- env: MiniGrid env (110)
- env: Bsuite env (110)
- env: GoBigger env (39)
- algo: RND+MuZero (110)
- algo: Sampled AlphaZero (141)
- algo: Multi-Agent MuZero/EfficientZero (39)
- feature: add ctree version of mcts in alphazero (142)
- feature: upgrade the dependency on gym with gymnasium (150)
- feature: add agent class to support LightZero's HuggingFace Model Zoo (163)
- feature: add recent MCTS-related papers in readme (159)
- feature: add muzero config for connect4 (107)
- feature: added CONTRIBUTING.md (119)
- feature: added .gitpod.yml and .gitpod.Dockerfile (123)
- feature: added contributors subsection in README (132)
- feature: added CODE_OF_CONDUCT.md (127)
- polish: refine comments and render_eval configs for various common envs (154) (161)
- polish: polish action_type and env_type, fix test.yml, fix unittest (160)
- polish: update env and algo tutorial doc (106)
- polish: polish gomoku env (141)
- polish: add random_policy support for continuous env (118)
- polish: polish simulation method of ptree_az (120)
- polish: polish comments of game_segment_to_array
- fix: fix render method for various common envs (154) (161)
- fix: fix gumbel muzero collector bug, fix gumbel typo (144)
- fix: fix assert bug in game_segment.py (138)
- fix: fix visit_count_distributions name in muzero_evaluator
- fix: fix mcts and alphabeta bot unittest (120)
- fix: fix typos in ptree_mz.py (113)
- fix: fix root_sampled_actions_tmp shape bug in sez ptree
- fix: fix policy utils unittest
- fix: fix typo in readme and add a 'back to top' button in readme (104) (109) (111)
- style: add nips2023 paper link