Alpaca-eval

Latest version: v0.6.6

Safety actively analyzes 693883 Python packages for vulnerabilities to keep your Python projects secure.

Scan your dependencies

Page 5 of 6

0.2.4

What's Changed
* Add Baichuan-13B-Chat Results by inferLLM in https://github.com/tatsu-lab/alpaca_eval/pull/85
* Add ChatGLM2-6B Results by inferLLM in https://github.com/tatsu-lab/alpaca_eval/pull/86
* [ENH] add chat llama2 by YannDubs in https://github.com/tatsu-lab/alpaca_eval/pull/87
* [ENH] automatically add minimal/verified by YannDubs in https://github.com/tatsu-lab/alpaca_eval/pull/88
* [ENH] add replicate + llama 70B by YannDubs in https://github.com/tatsu-lab/alpaca_eval/pull/90
* [ENH] add llama 70B outputs by YannDubs in https://github.com/tatsu-lab/alpaca_eval/pull/91
* [ENH] optionally return raw completions by YannDubs in https://github.com/tatsu-lab/alpaca_eval/pull/92
* [ENH] eval_parser by YannDubs in https://github.com/tatsu-lab/alpaca_eval/pull/93
* [ENH] json parser by YannDubs in https://github.com/tatsu-lab/alpaca_eval/pull/94

New Contributors
* inferLLM made their first contribution in https://github.com/tatsu-lab/alpaca_eval/pull/85

**Full Changelog**: https://github.com/tatsu-lab/alpaca_eval/compare/v0.2.3...v0.2.4

0.2.3

What's Changed
* [ENH] make completion_parser easier to inherit by YannDubs in https://github.com/tatsu-lab/alpaca_eval/pull/81
* [ENH] Add length by YannDubs in https://github.com/tatsu-lab/alpaca_eval/pull/79
* [ENH] add format_sample_sheets.py to CI by YannDubs in https://github.com/tatsu-lab/alpaca_eval/pull/82
* [ENH] adding samples to leadeboard by YannDubs in https://github.com/tatsu-lab/alpaca_eval/pull/83


**Full Changelog**: https://github.com/tatsu-lab/alpaca_eval/compare/v0.2.2...v0.2.3

0.2.2

What's Changed
* [ENH] add base annotator by YannDubs in https://github.com/tatsu-lab/alpaca_eval/pull/76
* [ENH] add claude v2 by YannDubs in https://github.com/tatsu-lab/alpaca_eval/pull/78


**Full Changelog**: https://github.com/tatsu-lab/alpaca_eval/compare/v0.2.1...v0.2.2

0.2.1

What's Changed
* Update WizardLM 13B V1.1 results by victorsungo in https://github.com/tatsu-lab/alpaca_eval/pull/66
* [ENH] make. it easier to cache to a DB by YannDubs in https://github.com/tatsu-lab/alpaca_eval/pull/73
* add vicuna v1.3 results by rtaori in https://github.com/tatsu-lab/alpaca_eval/pull/74
* gpt4 annotations for vicuna v1.3 by rtaori in https://github.com/tatsu-lab/alpaca_eval/pull/75

New Contributors
* victorsungo made their first contribution in https://github.com/tatsu-lab/alpaca_eval/pull/66

**Full Changelog**: https://github.com/tatsu-lab/alpaca_eval/compare/v0.2.0...v0.2.1

0.2.0

What's Changed
* [CI] auto release by YannDubs in https://github.com/tatsu-lab/alpaca_eval/pull/72


**Full Changelog**: https://github.com/tatsu-lab/alpaca_eval/compare/v0.1.9...v0.2.0

0.1.7.1

What's Changed
* Add Custom OpenAI API Endpoint Support and OpenChat Results by imoneoi in https://github.com/tatsu-lab/alpaca_eval/pull/42
* get falcon models running decoding by rtaori in https://github.com/tatsu-lab/alpaca_eval/pull/47
* [TEST] test by YannDubs in https://github.com/tatsu-lab/alpaca_eval/pull/50
* [ENH] upgrade anthropic 0.3 by YannDubs in https://github.com/tatsu-lab/alpaca_eval/pull/54
* [CLEAN] black by YannDubs in https://github.com/tatsu-lab/alpaca_eval/pull/55
* [TEST] setting up test CI by YannDubs in https://github.com/tatsu-lab/alpaca_eval/pull/56
* Add Baize v2 13B by JetRunner in https://github.com/tatsu-lab/alpaca_eval/pull/49
* [CI] leaderboard formatting by YannDubs in https://github.com/tatsu-lab/alpaca_eval/pull/58
* format leaderboard for baize by YannDubs in https://github.com/tatsu-lab/alpaca_eval/pull/59
* [ENH] remove inputs from example by YannDubs in https://github.com/tatsu-lab/alpaca_eval/pull/60
* [CLEAN] setting up precommit by YannDubs in https://github.com/tatsu-lab/alpaca_eval/pull/61

New Contributors
* imoneoi made their first contribution in https://github.com/tatsu-lab/alpaca_eval/pull/42
* JetRunner made their first contribution in https://github.com/tatsu-lab/alpaca_eval/pull/49

**Full Changelog**: https://github.com/tatsu-lab/alpaca_eval/compare/v0.1.6...v0.1.7.1

Page 5 of 6

© 2025 Safety CLI Cybersecurity Inc. All Rights Reserved.