Fixed
- Expose bias flag for feedforwards, same default as Timm [facebookresearch/xformers220]
- Update eps value for layernorm, same default as torch [facebookresearch/xformers221]
- PreNorm bugfix, only one input was normalized [facebookresearch/xformers233]
- Fix bug where embedding dimensions that did not match model dim would lead to a crash [facebookresearch/xformers244]
Added
- Add DeepNet (DeepNorm) residual path and init [facebookresearch/xformers227]