What's Changed
* works on windows, set dtype is importang by wejoncy in https://github.com/wejoncy/QLLM/pull/54
* use_heuristic=false default for models having hard to predict unquantized layers like mixtral-8x7b by wejoncy in https://github.com/wejoncy/QLLM/pull/55
* add mixtral in readme example by wejoncy in https://github.com/wejoncy/QLLM/pull/56
* bugfix when export 16bit model by wejoncy in https://github.com/wejoncy/QLLM/pull/57
* Fix build err, uint32_t is not defined. <stdint.h> by wejoncy in https://github.com/wejoncy/QLLM/pull/58
* dp kernel support g_idx by wejoncy in https://github.com/wejoncy/QLLM/pull/59
* [important] packing improve, faster by wejoncy in https://github.com/wejoncy/QLLM/pull/60
* [improve packing]fix for awq unpack by wejoncy in https://github.com/wejoncy/QLLM/pull/61
* 3bit support with g_idx in dq_kernel by wejoncy in https://github.com/wejoncy/QLLM/pull/63
* 3bit fix by wejoncy in https://github.com/wejoncy/QLLM/pull/64
* 0.1.5.dev by wejoncy in https://github.com/wejoncy/QLLM/pull/65
* onnx support Act_order && some onnx fix by wejoncy in https://github.com/wejoncy/QLLM/pull/66
* Support gemv with g_idx and some fix in exporter/dataloader by wejoncy in https://github.com/wejoncy/QLLM/pull/67
* support mixtral in gptq/awq by wejoncy in https://github.com/wejoncy/QLLM/pull/68
* minor fix for act_order detect by wejoncy in https://github.com/wejoncy/QLLM/pull/70
* Bump version to 0.1.5 by wejoncy in https://github.com/wejoncy/QLLM/pull/69
**Full Changelog**: https://github.com/wejoncy/QLLM/compare/v0.1....v0.1.5