What's Changed
* Set CPU Affinity: Electric Boogaloo V2 by KaraKaraWitch in https://github.com/PygmalionAI/aphrodite-engine/pull/187
* chore: backlog 1 by AlpinDale in https://github.com/PygmalionAI/aphrodite-engine/pull/191
* feat: support GPTQ 2, 3, and 8bit quants by AlpinDale in https://github.com/PygmalionAI/aphrodite-engine/pull/181
* feat: FP8 KV Cache (ENG-4) by AlpinDale in https://github.com/PygmalionAI/aphrodite-engine/pull/185
* feat: tokenizer endpoint for OpenAI API by AlpinDale in https://github.com/PygmalionAI/aphrodite-engine/pull/195
* feat: rejection sampler by AlpinDale in https://github.com/PygmalionAI/aphrodite-engine/pull/197
* feat: better mixtral parallelism by AlpinDale in https://github.com/PygmalionAI/aphrodite-engine/pull/193
* fix: triton compile error by AlpinDale in https://github.com/PygmalionAI/aphrodite-engine/pull/200
* feat: reduce sampler overhead by making it less blocking by AlpinDale in https://github.com/PygmalionAI/aphrodite-engine/pull/198
* fix: test units by AlpinDale in https://github.com/PygmalionAI/aphrodite-engine/pull/201
* merge branch 'dev' into 'main' by AlpinDale in https://github.com/PygmalionAI/aphrodite-engine/pull/203
* feat: bump cuda to 12.1 by AlpinDale in https://github.com/PygmalionAI/aphrodite-engine/pull/205
* bump version to 0.4.6 by AlpinDale in https://github.com/PygmalionAI/aphrodite-engine/pull/204
New Contributors
* KaraKaraWitch made their first contribution in https://github.com/PygmalionAI/aphrodite-engine/pull/187
**Full Changelog**: https://github.com/PygmalionAI/aphrodite-engine/compare/v0.4.5...v0.4.6