- (Jul 2017) **Hindsight Experience Replay**
Marcin Andrychowicz and Filip Wolski and Alex Ray and Jonas Schneider and Rachel Fong and
Peter Welinder and Bob McGrew and Josh Tobin and Pieter Abbeel and Wojciech Zaremba
http://arxiv.org/abs/1707.01495
- (Sep 2018) **Multi-task Deep Reinforcement Learning with PopArt**
Hessel, Matteo and Soyer, Hubert and Espeholt, Lasse and Czarnecki, Wojciech and Schmitt, Simon and van Hasselt, Hado
https://arxiv.org/abs/1809.04474