Features:
Flash checkpoint provides `FlashCkptTrainer` to support HuggingFace `transforemers.Trainer`.
Flash checkpoint supports loading the checkpint of Megatron-LM from the memory.
Flash Checkpoint supports saving and loading FSDP checkpoint with full state dict.
Job master can sort the node ranks by the access switches of the node.
BugFix:
Fix the segment fault when restarting the training process.