Major release: refactoring of models, trainer and pipelines, and more! 🚀
- Refactored the old `task` class into a base class (`RL4COLitModule`) that is the base for RL algorithms (such as REINFORCE and PPO), following the discussion in 67
- New base class for construction methods: now encoder, decoder, policy, and model can be based on common parent classes to make implementation much more modular
- Added native loading from the checkpoint, which used to be buggy
- Nice new logo (we like it, but we are obviously biased, so feel free to give us your opinion ;) )
- Added mPDP environment (and added some WIP for EquityTransformer)
- New `RL4COTrainer` that automatically includes training tricks for RL
- Added Codecov coverage
- Better testing: now we thoroughly test most of the library, including training (the Hydra part as well!)
- Documentation overhaul: add Sphinx plugins for modularized, automatic docs
- ... and more!