What's Changed
Too many commit messages so let's summarise them.
General Features
- Pipeline Parallelism
- Cache now doesn't move tensors across devices unless told to
New Models:
- Redwood 2L
- New Pythia Models
- LLaMA
Analysis Features:
- Add apply_ln to stack_head_results and stack_neuron_results
- Context Manager for Hooks
- Attention Head Detectors
Thanks to all the Contributors!
Many thanks to: rusheb, ckkissane, slavachalnev, JayBaileyCS, zshn-gvg, jbloomAus, adzcai, adamyedidia, ArthurConmy, bryce13950, daspartho, haileyschoelkopf, 0amp
**Full Changelog**: https://github.com/neelnanda-io/TransformerLens/compare/v1.2.1...v1.2.2