Features:
* Flash checkpoint provides FlashCkptTrainer to support HuggingFace transforemers.Trainer.
* Flash checkpoint supports loading the checkpint of Megatron-LM from the memory.
Flash Checkpoint supports saving and loading FSDP checkpoint with full state dict.
* Job master can sort the node ranks by the access switches of the node.
BugFix:
* Fix the segment fault when restarting the training process.