What's Changed
* Add support for Grouped Query Attention (GQA) in Huggingface transformers.
* Include [minimal examples](https://github.com/VainF/Torch-Pruning/tree/master/examples/LLMs) for Large Language Models (LLaMA-2 & LLaMA-3).
**Full Changelog**: https://github.com/VainF/Torch-Pruning/compare/v1.3.7...v1.4.0