What's Changed
* suport Phi, detect multi blocks by wejoncy in https://github.com/wejoncy/QLLM/pull/43
* quick fix by wejoncy in https://github.com/wejoncy/QLLM/pull/44
* add colab example && turing support for awq && remove dependency of xbitops by wejoncy in https://github.com/wejoncy/QLLM/pull/46
* quick fix for meta device by wejoncy in https://github.com/wejoncy/QLLM/pull/47
* add trust code by wejoncy in https://github.com/wejoncy/QLLM/pull/48
* fix trust_code by wejoncy in https://github.com/wejoncy/QLLM/pull/49
* quick fix for turing awq 75 by wejoncy in https://github.com/wejoncy/QLLM/pull/50
* fix low_cpu_mem_usage by wejoncy in https://github.com/wejoncy/QLLM/pull/51
* fix model dtype ,default half by wejoncy in https://github.com/wejoncy/QLLM/pull/52
**Full Changelog**: https://github.com/wejoncy/QLLM/compare/v0.1.3...v0.1.4