๐ Highlights
- ๐ **New API Integrations:** A leap forward with the addition of multiple new APIs, including Baidu, Moonshot, Sensetime, and more, broadening the scope and capabilities of OpenCompass.
- ๐ต **Circular Evaluation Feature:** Introducing Circular Eval, an enhancement for comprehensive and dynamic evaluations within the platform.
- ๐ค Turbomind Inference Integration: Integration of Turbomind inference through its RPC API, enhancing the platform's inferencing capabilities.
๐ New Features & Enhancements
- **Model & API Development:** Explore new capabilities with DataCanvas Alaya LM, Lightllm API, 360API, and enhanced Turbomind Python API integration ([612](https://github.com/open-compass/opencompass/pull/612), [#613](https://github.com/open-compass/opencompass/pull/613), [#601](https://github.com/open-compass/opencompass/pull/601), [#484](https://github.com/open-compass/opencompass/pull/484)).
- **Circular Evaluation Implementation:** Elevate your evaluation methods with the newly added Circular Eval feature, offering a more nuanced and detailed analysis capability ([610](https://github.com/open-compass/opencompass/pull/610)).
- **Rich Dataset Additions:** Enrich your research with new datasets - FinanceIQ, SVAMP, GSM_Hard, and updated Mathbench for diverse applications ([596](https://github.com/open-compass/opencompass/pull/596), [#604](https://github.com/open-compass/opencompass/pull/604), [#619](https://github.com/open-compass/opencompass/pull/619), [#580](https://github.com/open-compass/opencompass/pull/580), [#607](https://github.com/open-compass/opencompass/pull/607)).
๐ Improvements & Fixes
- **Subjective Evaluation Bug Fixes:** Improved accuracy in subjective evaluations ([589](https://github.com/open-compass/opencompass/pull/589)).
- **Dataset and Feature Fixes:** Resolving issues in CMB dataset, various feature enhancements, and fixes ([587](https://github.com/open-compass/opencompass/pull/587), [#592](https://github.com/open-compass/opencompass/pull/592), [#615](https://github.com/open-compass/opencompass/pull/615), [#632](https://github.com/open-compass/opencompass/pull/632)).
๐ Documentation Updates
- **README & FAQ Enhancements:** Updated for better clarity and assistance ([582](https://github.com/open-compass/opencompass/pull/582), [#622](https://github.com/open-compass/opencompass/pull/622), [#628](https://github.com/open-compass/opencompass/pull/628), [#629](https://github.com/open-compass/opencompass/pull/629)).
- **Typo and Spelling Corrections:** Ensuring accuracy and professionalism in documentation ([594](https://github.com/open-compass/opencompass/pull/594), [#637](https://github.com/open-compass/opencompass/pull/637)).
๐ New Contributors
Welcoming new contributors to the OpenCompass family!
- **rahidzeynal**, **Sniper970119**, **ZhangRaymond**, **HunterKruger**, **helloyongyang**, and **Yggdrasill7D6**. Your contributions are greatly appreciated!
What's Changed
* Add author as: author='OpenCompass Contributors' by rahidzeynal in https://github.com/open-compass/opencompass/pull/578
* [Doc] Update README by tonysy in https://github.com/open-compass/opencompass/pull/582
* [Feature] Update mathbench by tonysy in https://github.com/open-compass/opencompass/pull/580
* Fix bugs in subjective evaluation by frankweijue in https://github.com/open-compass/opencompass/pull/589
* [Fix] fix cmb dataset by Leymore in https://github.com/open-compass/opencompass/pull/587
* [Fix] change save_every defaults to 1 by yingfhu in https://github.com/open-compass/opencompass/pull/592
* update word spell by Sniper970119 in https://github.com/open-compass/opencompass/pull/594
* Add FinanceIQ dataset by ZhangRaymond in https://github.com/open-compass/opencompass/pull/596
* [Feat] support humaneval and mbpp passk by yingfhu in https://github.com/open-compass/opencompass/pull/598
* [Feature] Add multi-prompt generation demo by jingmingzhuo in https://github.com/open-compass/opencompass/pull/568
* Mathbench update postprocess by liushz in https://github.com/open-compass/opencompass/pull/600
* [Feature] Add arithmetic to mathbench by liushz in https://github.com/open-compass/opencompass/pull/607
* Add support for DataCanvas Alaya LM by HunterKruger in https://github.com/open-compass/opencompass/pull/612
* [Feature] Support Lightllm api by helloyongyang in https://github.com/open-compass/opencompass/pull/613
* [Feature] Support 360API and FixKRetriever for CSQA dataset by tonysy in https://github.com/open-compass/opencompass/pull/601
* Integrate turbomind python api by lvhan028 in https://github.com/open-compass/opencompass/pull/484
* [Bug] Update api with generation_kargs by tonysy in https://github.com/open-compass/opencompass/pull/614
* [Fix] Fix gen inferencer by Leymore in https://github.com/open-compass/opencompass/pull/615
* [Docs] update ds1000 code eval docs by yingfhu in https://github.com/open-compass/opencompass/pull/618
* [Feature] Add SVAMP dataset by liushz in https://github.com/open-compass/opencompass/pull/604
* [Feature] support download from modelscope by KevinNuNu in https://github.com/open-compass/opencompass/pull/534
* [Doc] Update README and requirements. by tonysy in https://github.com/open-compass/opencompass/pull/622
* [Sync] Fix cmnli, fix vicuna meta template, fix longbench postprocess and other minor fixes by Leymore in https://github.com/open-compass/opencompass/pull/625
* [API] Update API by tonysy in https://github.com/open-compass/opencompass/pull/624
* [Feature] Add circular eval by Leymore in https://github.com/open-compass/opencompass/pull/610
* [Doc] Update FAQ by Leymore in https://github.com/open-compass/opencompass/pull/628
* [Doc] Update README by tonysy in https://github.com/open-compass/opencompass/pull/629
* [Bug] fix icl eval with nested list by yingfhu in https://github.com/open-compass/opencompass/pull/632
* Fix LightllmAPI list bug by helloyongyang in https://github.com/open-compass/opencompass/pull/635
* fix typo in README by Yggdrasill7D6 in https://github.com/open-compass/opencompass/pull/637
* [Sync] update codes by Leymore in https://github.com/open-compass/opencompass/pull/641
* [Feature] Add GSM_Hard dataset by liushz in https://github.com/open-compass/opencompass/pull/619
* [Feat] support zhipu post process by yingfhu in https://github.com/open-compass/opencompass/pull/642
* [Sync] Bump version to 0.1.9 by Leymore in https://github.com/open-compass/opencompass/pull/644
Explore the detailed changes in the [full changelog](https://github.com/open-compass/opencompass/compare/0.1.8...0.1.9).
Thank you to all the contributors for this release. Your dedication and hard work continue to enhance OpenCompass, making it an ever-evolving and dynamic tool for the community. Let's dive into the new possibilities with OpenCompass v0.1.9! ๐๐งฎ๐ป