Added the ability to checkpoint the pruner object without reinitializing random topologies.
0.3
Added the ability to accumulate gradients to simulate larger batch sizes for RigL steps. Larger batch sizes tend to reduce noise & the aim is to emulate a batch size of 4096 using a true batch size of 1024.
0.2
0.1
First release that implements RigL to spec with distributed training available.