Autoawq

Latest version: v0.2.5

Safety actively analyzes 624578 Python packages for vulnerabilities to keep your Python projects secure.

Scan your dependencies

Page 1 of 4

0.2.5

What's Changed
* Fix fused models for tf >= 4.39 by TechxGenus in https://github.com/casper-hansen/AutoAWQ/pull/418
* FIX: Add safe guards for static cache + llama on transformers latest by younesbelkada in https://github.com/casper-hansen/AutoAWQ/pull/401
* Pin: lm_eval==0.4.1 by casper-hansen in https://github.com/casper-hansen/AutoAWQ/pull/426
* Implement `apply_clip` argument to `quantize()` by casper-hansen in https://github.com/casper-hansen/AutoAWQ/pull/427
* Workaround: illegal memory access by casper-hansen in https://github.com/casper-hansen/AutoAWQ/pull/421
* Add download_kwargs for load model (302) by Roshiago in https://github.com/casper-hansen/AutoAWQ/pull/399
* add starcoder2 support by shaonianyr in https://github.com/casper-hansen/AutoAWQ/pull/406
* Add StableLM support by Isotr0py in https://github.com/casper-hansen/AutoAWQ/pull/410
* Fix starcoder2 fused norm by TechxGenus in https://github.com/casper-hansen/AutoAWQ/pull/442
* Update generate example to llama 3 by casper-hansen in https://github.com/casper-hansen/AutoAWQ/pull/448
* [BUG] Fix github action documentation build by suparious in https://github.com/casper-hansen/AutoAWQ/pull/449
* Fix path by casper-hansen in https://github.com/casper-hansen/AutoAWQ/pull/451
* FIX: 'awq_ext' is not defined error by younesbelkada in https://github.com/casper-hansen/AutoAWQ/pull/465
* FIX: Fix multiple generations for new HF cache format by younesbelkada in https://github.com/casper-hansen/AutoAWQ/pull/444
* support max_memory to specify mem usage for each GPU by laoda513 in https://github.com/casper-hansen/AutoAWQ/pull/460
* Bump to 0.2.5 by casper-hansen in https://github.com/casper-hansen/AutoAWQ/pull/468

New Contributors
* Roshiago made their first contribution in https://github.com/casper-hansen/AutoAWQ/pull/399
* shaonianyr made their first contribution in https://github.com/casper-hansen/AutoAWQ/pull/406
* Isotr0py made their first contribution in https://github.com/casper-hansen/AutoAWQ/pull/410
* suparious made their first contribution in https://github.com/casper-hansen/AutoAWQ/pull/449
* laoda513 made their first contribution in https://github.com/casper-hansen/AutoAWQ/pull/460

**Full Changelog**: https://github.com/casper-hansen/AutoAWQ/compare/v0.2.4...v0.2.5

0.2.4

What's Changed
* Add Gemma Support by TechxGenus in https://github.com/casper-hansen/AutoAWQ/pull/393
* Pin transformers>=4.35.0,<=4.38.2 by casper-hansen in https://github.com/casper-hansen/AutoAWQ/pull/408
* Bump to v0.2.4 by casper-hansen in https://github.com/casper-hansen/AutoAWQ/pull/409

New Contributors
* TechxGenus made their first contribution in https://github.com/casper-hansen/AutoAWQ/pull/393

**Full Changelog**: https://github.com/casper-hansen/AutoAWQ/compare/v0.2.3...v0.2.4

0.2.3

What's Changed
* New optimized kernels by casper-hansen in https://github.com/casper-hansen/AutoAWQ/pull/365
* Fix double bias by casper-hansen in https://github.com/casper-hansen/AutoAWQ/pull/383
* x_max -> x_mean and w_max -> w_mean name changes and some comments by OscarSavolainenDR in https://github.com/casper-hansen/AutoAWQ/pull/378

New Contributors
* OscarSavolainenDR made their first contribution in https://github.com/casper-hansen/AutoAWQ/pull/378

**Full Changelog**: https://github.com/casper-hansen/AutoAWQ/compare/v0.2.2...v0.2.3

0.2.2

What's Changed
* Support Fused Mixtral on multi-GPU by casper-hansen in https://github.com/casper-hansen/AutoAWQ/pull/352
* Add multi-GPU benchmark of Mixtral by casper-hansen in https://github.com/casper-hansen/AutoAWQ/pull/353
* Remove MoE Triton kernels by casper-hansen in https://github.com/casper-hansen/AutoAWQ/pull/355
* Bump to 0.2.2 by casper-hansen in https://github.com/casper-hansen/AutoAWQ/pull/356


**Full Changelog**: https://github.com/casper-hansen/AutoAWQ/compare/v0.2.1...v0.2.2

0.2.1

What's Changed
* Avoid downloading ROCm by casper-hansen in https://github.com/casper-hansen/AutoAWQ/pull/347
* ENH / FIX: Few enhancements and fix for mixed-precision training by younesbelkada in https://github.com/casper-hansen/AutoAWQ/pull/348
* Fix triton dependency by casper-hansen in https://github.com/casper-hansen/AutoAWQ/pull/350
* Bump to 0.2.1 by casper-hansen in https://github.com/casper-hansen/AutoAWQ/pull/351


**Full Changelog**: https://github.com/casper-hansen/AutoAWQ/compare/v0.2.0...v0.2.1

0.2.0

What's Changed
* AWQ: Separate the AWQ kernels to separate repository by casper-hansen in https://github.com/casper-hansen/AutoAWQ/pull/279
* Add CPU-loaded multi-GPU quantization by xNul in https://github.com/casper-hansen/AutoAWQ/pull/289
* GGUF compatible quantization (2, 3, 4 bit / any bit) by casper-hansen in https://github.com/casper-hansen/AutoAWQ/pull/285
* Exllama kernels support by IlyasMoutawwakil in https://github.com/casper-hansen/AutoAWQ/pull/313
* Cleanup requirements by casper-hansen in https://github.com/casper-hansen/AutoAWQ/pull/295
* Torch only inference + any-device quantization by casper-hansen in https://github.com/casper-hansen/AutoAWQ/pull/319
* Up to 60% faster context processing by casper-hansen in https://github.com/casper-hansen/AutoAWQ/pull/316
* Evaluation: Add more evals by casper-hansen in https://github.com/casper-hansen/AutoAWQ/pull/283
* Fixes a breaking change in autoawq by younesbelkada in https://github.com/casper-hansen/AutoAWQ/pull/325
* AMD ROCM Support by IlyasMoutawwakil in https://github.com/casper-hansen/AutoAWQ/pull/315
* Marlin symmetric quantization and inference by IlyasMoutawwakil in https://github.com/casper-hansen/AutoAWQ/pull/320
* Add qwen2 by JustinLin610 in https://github.com/casper-hansen/AutoAWQ/pull/321
* Fix n_samples by casper-hansen in https://github.com/casper-hansen/AutoAWQ/pull/326
* PEFT compatible GEMM by casper-hansen in https://github.com/casper-hansen/AutoAWQ/pull/324
* [`PEFT`] Fix PEFT batch size > 1 by younesbelkada in https://github.com/casper-hansen/AutoAWQ/pull/338
* v0.2.0 by casper-hansen in https://github.com/casper-hansen/AutoAWQ/pull/330
* Fix ROCm build by casper-hansen in https://github.com/casper-hansen/AutoAWQ/pull/342
* Fix dependency by casper-hansen in https://github.com/casper-hansen/AutoAWQ/pull/343
* Fix importlib by casper-hansen in https://github.com/casper-hansen/AutoAWQ/pull/344
* Fix workflow by casper-hansen in https://github.com/casper-hansen/AutoAWQ/pull/345
* Fix typo in setup.py by casper-hansen in https://github.com/casper-hansen/AutoAWQ/pull/346

New Contributors
* xNul made their first contribution in https://github.com/casper-hansen/AutoAWQ/pull/289
* IlyasMoutawwakil made their first contribution in https://github.com/casper-hansen/AutoAWQ/pull/313
* JustinLin610 made their first contribution in https://github.com/casper-hansen/AutoAWQ/pull/321

**Full Changelog**: https://github.com/casper-hansen/AutoAWQ/compare/v0.1.8...v0.2.0

Page 1 of 4

© 2024 Safety CLI Cybersecurity Inc. All Rights Reserved.