-------------------------------------------------------------------------
- Record agent initial probabilities into JSON file. Used for visualizations.
- New *losses*, *rewards* and *action_preferences* visualizations.
- Added learning experiment.
- Refactored environment, now main class is *Environment*.
- User-defined action subsets and observation types.