Added
- Added support for Qwen models. ([15](https://github.com/tomaarsen/attention_sinks/pull/15))
- Added support for StableLM_Epoch models. ([20](https://github.com/tomaarsen/attention_sinks/pull/20))
Changed
- Changed how Attention Sinks are injected into models, allows `attention_sinks` to be integrated with architectures that aren't in `transformers` ([16](https://github.com/tomaarsen/attention_sinks/pull/16))