Eval-mm

Latest version: v0.4.0

Safety actively analyzes 722631 Python packages for vulnerabilities to keep your Python projects secure.

Page 1 of 2

0.4.0

What's Changed
* Refactoring Task and Scorer class by speed1313 in https://github.com/llm-jp/llm-jp-eval-mm/pull/150

**Full Changelog**: https://github.com/llm-jp/llm-jp-eval-mm/compare/v0.3.0...v0.4.0

0.3.0

What's Changed
* Generalize aggregate() output type and Remove unnecessary methods by speed1313 in https://github.com/llm-jp/llm-jp-eval-mm/pull/131
* Improve JDocQA's preparation time and Fix JMMMU scoring and Add phi4 and Refactoring by speed1313 in https://github.com/llm-jp/llm-jp-eval-mm/pull/141
* Add visualization script by speed1313 in https://github.com/llm-jp/llm-jp-eval-mm/pull/143
* Fix Heron-bench scoring and Add Asagi model by speed1313 in https://github.com/llm-jp/llm-jp-eval-mm/pull/146

**Full Changelog**: https://github.com/llm-jp/llm-jp-eval-mm/compare/v0.2.2...v0.3.0

0.2.2

What's Changed
* [WIP] Add gemma3 and Qwen2.5 VL and sarashina and Refactoring by speed1313 in https://github.com/llm-jp/llm-jp-eval-mm/pull/123

**Full Changelog**: https://github.com/llm-jp/llm-jp-eval-mm/compare/v0.2.1...v0.2.2

0.2.1

What's Changed
* Fix vilaja group dependency by speed1313 in https://github.com/llm-jp/llm-jp-eval-mm/pull/107
* JIC-VQA評価データセットの追加 by PeifeiZhu in https://github.com/llm-jp/llm-jp-eval-mm/pull/116
* add mecha-ja by Silviase in https://github.com/llm-jp/llm-jp-eval-mm/pull/122

New Contributors
* PeifeiZhu made their first contribution in https://github.com/llm-jp/llm-jp-eval-mm/pull/116

**Full Changelog**: https://github.com/llm-jp/llm-jp-eval-mm/compare/v0.2.0...v0.2.1

0.2.0

What's Changed
* Refactoring and Fix OpenAI API bug by speed1313 in https://github.com/llm-jp/llm-jp-eval-mm/pull/89
* Use uv by speed1313 in https://github.com/llm-jp/llm-jp-eval-mm/pull/96
* Add mmmu and llava-itw tasks by speed1313 in https://github.com/llm-jp/llm-jp-eval-mm/pull/97
* Add GitHub pages by speed1313 in https://github.com/llm-jp/llm-jp-eval-mm/pull/100
* 不要なファイルの消去 by Silviase in https://github.com/llm-jp/llm-jp-eval-mm/pull/102
* Add acknowledge section by speed1313 in https://github.com/llm-jp/llm-jp-eval-mm/pull/103

**Full Changelog**: https://github.com/llm-jp/llm-jp-eval-mm/compare/v0.1.2...v0.2.0

0.1.2

What's Changed
* Fix the way to import model by speed1313 in https://github.com/llm-jp/llm-jp-eval-mm/pull/83

**Full Changelog**: https://github.com/llm-jp/llm-jp-eval-mm/compare/v0.1.1...v0.1.2

Page 1 of 2

Releases

Has known vulnerabilities

Eval-mm

Page 1 of 2

0.4.0

0.3.0

0.2.2

0.2.1

0.2.0

0.1.2

Page 1 of 2

Links

Releases