Autoawq

Latest version: v0.2.7.post2

Safety actively analyzes 687918 Python packages for vulnerabilities to keep your Python projects secure.

Scan your dependencies

Page 3 of 5

0.1.6

What's Changed
* Pseudo dequantize function by casper-hansen in https://github.com/casper-hansen/AutoAWQ/pull/127
* CUDA 11.8.0 and 12.1.1 build by casper-hansen in https://github.com/casper-hansen/AutoAWQ/pull/128
* AwqConfig class by casper-hansen in https://github.com/casper-hansen/AutoAWQ/pull/132
* Fix init quant by casper-hansen in https://github.com/casper-hansen/AutoAWQ/pull/136
* Update readme by casper-hansen in https://github.com/casper-hansen/AutoAWQ/pull/137
* Benchmark info by casper-hansen in https://github.com/casper-hansen/AutoAWQ/pull/138
* Bump to v0.1.6 by casper-hansen in https://github.com/casper-hansen/AutoAWQ/pull/139
* CUDA 12 release by casper-hansen in https://github.com/casper-hansen/AutoAWQ/pull/140
* Revert to previous version by casper-hansen in https://github.com/casper-hansen/AutoAWQ/pull/141
* Fix performance regression by casper-hansen in https://github.com/casper-hansen/AutoAWQ/pull/148
* [`core` / `attention`] Fix fused attention generation with newest transformers version by younesbelkada in https://github.com/casper-hansen/AutoAWQ/pull/146
* Fix condition when rolling cache by casper-hansen in https://github.com/casper-hansen/AutoAWQ/pull/150
* Default to safetensors for quantized models by casper-hansen in https://github.com/casper-hansen/AutoAWQ/pull/151
* Create fused LlamaLikeModel by casper-hansen in https://github.com/casper-hansen/AutoAWQ/pull/152


**Full Changelog**: https://github.com/casper-hansen/AutoAWQ/compare/v0.1.5...v0.1.6

0.1.5

What's Changed
* Only apply attention mask if seqlen is greater than 1 by casper-hansen in https://github.com/casper-hansen/AutoAWQ/pull/96
* add gpt_neox support by twaka in https://github.com/casper-hansen/AutoAWQ/pull/113
* [`core`] Support fp32 / bf16 inference by younesbelkada in https://github.com/casper-hansen/AutoAWQ/pull/121
* Fix potential overflow by casper-hansen in https://github.com/casper-hansen/AutoAWQ/pull/102
* Fixing starcoder based models with 15B by SebastianBodza in https://github.com/casper-hansen/AutoAWQ/pull/118
* Support Aquila models. by ftgreat in https://github.com/casper-hansen/AutoAWQ/pull/123
* Add benchmark of Aquila2 34B AWQ in README.md. by ftgreat in https://github.com/casper-hansen/AutoAWQ/pull/126

New Contributors
* twaka made their first contribution in https://github.com/casper-hansen/AutoAWQ/pull/113
* younesbelkada made their first contribution in https://github.com/casper-hansen/AutoAWQ/pull/121
* SebastianBodza made their first contribution in https://github.com/casper-hansen/AutoAWQ/pull/118
* ftgreat made their first contribution in https://github.com/casper-hansen/AutoAWQ/pull/123

**Full Changelog**: https://github.com/casper-hansen/AutoAWQ/compare/v0.1.4...v0.1.5

0.1.4

What's Changed
* Refactor cache and embedding modules by casper-hansen in https://github.com/casper-hansen/AutoAWQ/pull/95
* Fix `TypeError: 'NoneType' object is not subscriptable`


**Full Changelog**: https://github.com/casper-hansen/AutoAWQ/compare/v0.1.3...v0.1.4

0.1.3

What's Changed
* Turing inference support (Colab+Kaggle working) by casper-hansen in https://github.com/casper-hansen/AutoAWQ/pull/92
* Fix memory bug (save 2GB VRAM)

**Full Changelog**: https://github.com/casper-hansen/AutoAWQ/compare/v0.1.2...v0.1.3

0.1.2

What's Changed
* Fix unexpected keyword by casper-hansen in https://github.com/casper-hansen/AutoAWQ/pull/88
* Fix Falcon n_kv_heads parameter by casper-hansen in https://github.com/casper-hansen/AutoAWQ/pull/89
* Mistral fused modules by casper-hansen in https://github.com/casper-hansen/AutoAWQ/pull/90


**Full Changelog**: https://github.com/casper-hansen/AutoAWQ/compare/v0.1.1...v0.1.2

0.1.1

What's Changed
* Add GPT BigCode support (StarCoder) by casper-hansen in https://github.com/casper-hansen/AutoAWQ/pull/61
* Use typing classes over base types by VikParuchuri in https://github.com/casper-hansen/AutoAWQ/pull/69
* Fix KV cache shapes error by casper-hansen in https://github.com/casper-hansen/AutoAWQ/pull/75
* Mistral support by casper-hansen in https://github.com/casper-hansen/AutoAWQ/pull/79
* Add low_cpu_mem_usage=True in example by casper-hansen in https://github.com/casper-hansen/AutoAWQ/pull/80
* Offloading to cpu and disk by s4rduk4r in https://github.com/casper-hansen/AutoAWQ/pull/77
* Faster build, fix "no space left". by casper-hansen in https://github.com/casper-hansen/AutoAWQ/pull/84

New Contributors
* VikParuchuri made their first contribution in https://github.com/casper-hansen/AutoAWQ/pull/69
* s4rduk4r made their first contribution in https://github.com/casper-hansen/AutoAWQ/pull/77

**Full Changelog**: https://github.com/casper-hansen/AutoAWQ/compare/v0.1.0...v0.1.1

Page 3 of 5

© 2024 Safety CLI Cybersecurity Inc. All Rights Reserved.