What's Changed 27
* add vllm>=0.4.1 by liyucheng09 in https://github.com/microsoft/MInference/pull/19, https://github.com/microsoft/MInference/pull/44
* Feature(MInference): update HF demo information, thanks AK's sponsoring by iofu728 in https://github.com/microsoft/MInference/pull/22
* Feature(MInference): add unittest by iofu728 in https://github.com/microsoft/MInference/pull/31, https://github.com/microsoft/MInference/pull/32
* Feature(MInference): add triton-based decoding in case flash_attn is not available by liyucheng09 in https://github.com/microsoft/MInference/pull/35
* Feature(MInference): add e2e benchmark using vllm by iofu728 in https://github.com/microsoft/MInference/pull/49
* Feature(MInference): support llama 3.1 by iofu728 in https://github.com/microsoft/MInference/pull/54
* Hotfix(MInference): fix the import warnings, fix the apply_rotary_pos… by iofu728 in https://github.com/microsoft/MInference/pull/30
New Contributors
* liyucheng09 made their first contribution in https://github.com/microsoft/MInference/pull/19
**Full Changelog**: https://github.com/microsoft/MInference/compare/v0.1.4...v0.1.5