What's Changed
* Add and update template READMEs by EricLBuehler in https://github.com/EricLBuehler/mistral.rs/pull/405
* Improve Rust crates docs by EricLBuehler in https://github.com/EricLBuehler/mistral.rs/pull/406
* Expose phi3v loader and remove unused deps by EricLBuehler in https://github.com/EricLBuehler/mistral.rs/pull/408
* Support GGUF Mixtral format where experts are in one tensor by EricLBuehler in https://github.com/EricLBuehler/mistral.rs/pull/355
* Refactor with normal loading metadata for vision models by EricLBuehler in https://github.com/EricLBuehler/mistral.rs/pull/409
* Phi 3 vision ISQ support by EricLBuehler in https://github.com/EricLBuehler/mistral.rs/pull/410
* Remove causal masks cache by EricLBuehler in https://github.com/EricLBuehler/mistral.rs/pull/412
* Fix: use new slice_assign by EricLBuehler in https://github.com/EricLBuehler/mistral.rs/pull/415
* Fix Phi-3 GGUF by EricLBuehler in https://github.com/EricLBuehler/mistral.rs/pull/414
* Implement gpt2 (BPE) GGUF tokenizer conversion by EricLBuehler in https://github.com/EricLBuehler/mistral.rs/pull/397
* Support chat template from GGUF by EricLBuehler in https://github.com/EricLBuehler/mistral.rs/pull/416
* Expose API to specify dtype during loading by EricLBuehler in https://github.com/EricLBuehler/mistral.rs/pull/417
* Lock candle version to commit by EricLBuehler in https://github.com/EricLBuehler/mistral.rs/pull/419
* Bump version to 0.1.17 by EricLBuehler in https://github.com/EricLBuehler/mistral.rs/pull/420
**Full Changelog**: https://github.com/EricLBuehler/mistral.rs/compare/v0.1.16...v0.1.17