- added TrafficJunction env (Thanks to rafaelmp2 ) - added clock version of switch and checkers. - reward eased for switch env. by having no step cost once the agent reaches the cell. - added on-hot representation for checkers observation.
0.0.6
- code refactor - small fix in switch environment( disabled agents are removed) - no change in performance of agents
0.0.5
- corrected packaging of sub-packages for pip install
0.0.4
0.0.1
Initial Collection of multi-agent environments for ease of experimentation.