What's Changed
* Add Qwen1.5-72B-Chat to AlpacaEval by Lukeming-tsinghua in https://github.com/tatsu-lab/alpaca_eval/pull/226
* Add claude-instant-1.2, deepseek-llm-67b-chat, wizardlm-70b, Qwen-14B-Chat (config + outputs without annotations) by gblazex in https://github.com/tatsu-lab/alpaca_eval/pull/228
* [DATA] Adding annotations for the arena models by YannDubs in https://github.com/tatsu-lab/alpaca_eval/pull/229
* Update README.md - Add missing "Y" to "ou" by yoderj in https://github.com/tatsu-lab/alpaca_eval/pull/230
* [DEV] Analyzing length-controlled metrics. by YannDubs in https://github.com/tatsu-lab/alpaca_eval/pull/231
* [DOC] add annotation interpretation by YannDubs in https://github.com/tatsu-lab/alpaca_eval/pull/232
* [DATA] add results from the Arena openai models by YannDubs in https://github.com/tatsu-lab/alpaca_eval/pull/234
* update ELO for llama-2-13b-chat-hf by gblazex in https://github.com/tatsu-lab/alpaca_eval/pull/235
* [NOTEBOOK] add length-corrected GLM by YannDubs in https://github.com/tatsu-lab/alpaca_eval/pull/237
* [ENH] add inverse mapper to make sure in and out types are the same by YannDubs in https://github.com/tatsu-lab/alpaca_eval/pull/240
* [ENH] update to allow AF to use AE by YannDubs in https://github.com/tatsu-lab/alpaca_eval/pull/241
New Contributors
* Lukeming-tsinghua made their first contribution in https://github.com/tatsu-lab/alpaca_eval/pull/226
* yoderj made their first contribution in https://github.com/tatsu-lab/alpaca_eval/pull/230
**Full Changelog**: https://github.com/tatsu-lab/alpaca_eval/compare/v0.5.3...v0.5.4