- Fixed 472, in which `enqueue_idx` for CrossBatchMemory could not be passed into DistributedLossWrapper - Converted CrossBatchMemory's `embedding_memory` and `label_memory` to buffers, so they can be saved and loaded as state dicts and transferred to devices use `.to(device)`.