Latest version: v0.0.1
The information on this page was curated by experts in our Cybersecurity Intelligence Team.
AdEMAMix is a PyTorch optimizer that combines two EMAs to better utilize past gradients, offering improved convergence and model retention over AdamW.
No known vulnerabilities found