Gptq-triton

Latest version: v0.0.3

Safety actively analyzes 623616 Python packages for vulnerabilities to keep your Python projects secure.

Scan your dependencies

0.0.3

- Added support for groupsize.
- Note: fuse_mlp is not recommended for groupsize != -1. It is now disabled automatically during loading if the model has grouping, unless fuse_mlp is explictly set to True. This is a result of the current kernel implementation being slower than the naive implementation for groupsize != -1.
- Added a warning if `act_order` and `groupsize` are used together. They are not compatible.

Links

Releases

Has known vulnerabilities

© 2024 Safety CLI Cybersecurity Inc. All Rights Reserved.