👍 Improvements
* Calculate model perplexity per training epoch (metric for synthetic data set quality)
* Added progress bar for SentencePiece tokenizer (can take a while on large datasets)
* Cleaned up logging
📚 Tutorial and doc improvements
* Automatically save model parameters and training history to model directory
* Specify `save_all_checkpoints` config option to save best, or all checkpoints (save disk space)