**2. New CI/CD**
MegaBlocks has new Github Actions for better CI/CD! Now on every PR, MegaBlocks will automatically perform code linting and formatting (131) and run tests on a GPU (127).
**3. Remove Weight Parallelism (137)**
Weight parallelism was not in use and so we removed it.
**4. Shared Experts (109)**
Implement shared experts, based on the DeepSeekMoE [paper](https://arxiv.org/abs/2401.06066).
Bug Fixes
1. Better handle incompatible ffn sizes (108)
2. Fix AMP for memory optimized options (111)
3. Don't save moe lb-loss tensors (119)
What's Changed
* Remove turbo by dblalock in https://github.com/databricks/megablocks/pull/96
* Update README.md by dakinggg in https://github.com/databricks/megablocks/pull/98
* Fix for `ffn_hidden_size` of 128, and better error message for incompatible ffn sizes. by snarayan21 in https://github.com/databricks/megablocks/pull/108
* Add Shared Expert by vchiley in https://github.com/databricks/megablocks/pull/109
* Fix AMP for memory optimized options by mvpatel2000 in https://github.com/databricks/megablocks/pull/111
* bump and pin versions by vchiley in https://github.com/databricks/megablocks/pull/112
* dont save moe lb-loss tensors if args.moe_loss_weight=0 by michael-go in https://github.com/databricks/megablocks/pull/119
* bump by vchiley in https://github.com/databricks/megablocks/pull/116
* Minor changes to batched_load_balancing_loss function by ShashankMosaicML in https://github.com/databricks/megablocks/pull/121
* Migrate tests to pytest + add GA by eitanturok in https://github.com/databricks/megablocks/pull/127
* Change Runner in GA by eitanturok in https://github.com/databricks/megablocks/pull/129
* Clean up setup.py by eitanturok in https://github.com/databricks/megablocks/pull/128
* only run GA if repo owner is Databricks by eitanturok in https://github.com/databricks/megablocks/pull/135
* GA to Lint + Format MegaBlocks by eitanturok in https://github.com/databricks/megablocks/pull/131
* bump ci-testing to v0.1.2 by eitanturok in https://github.com/databricks/megablocks/pull/138
* remove weight parallelism by eitanturok in https://github.com/databricks/megablocks/pull/137
* refactor testing by eitanturok in https://github.com/databricks/megablocks/pull/140
* Type Checking by eitanturok in https://github.com/databricks/megablocks/pull/141
* Bump torch to <2.4.1 by eitanturok in https://github.com/databricks/megablocks/pull/145
New Contributors
* dakinggg made their first contribution in https://github.com/databricks/megablocks/pull/98
* michael-go made their first contribution in https://github.com/databricks/megablocks/pull/119
* ShashankMosaicML made their first contribution in https://github.com/databricks/megablocks/pull/121
**Full Changelog**: https://github.com/databricks/megablocks/compare/v0.5.1...v0.6.0