Algorithms implemented:
* Dreamer-V1 (https://arxiv.org/abs/1912.01603)
* Dreamer-V2 (https://arxiv.org/abs/2010.02193)
* Plan2Explore Dreamer-V1-based (https://arxiv.org/abs/2005.05960)
* Plan2Explore Dreamer-V2-based (https://arxiv.org/abs/2005.05960)
* DroQ (https://arxiv.org/abs/2110.02034)
* PPO (https://arxiv.org/abs/1707.06347)
* PPO Recurrent (https://arxiv.org/abs/2205.11104)
* SAC (https://arxiv.org/abs/1812.05905)
* SAC-AE (https://arxiv.org/abs/1910.01741)