What's Changed * fixed double counting corner case for precision / average precision in https://github.com/relari-ai/continuous-eval/pull/55 * Fix required keyword for code string to ground_truth_answers in https://github.com/relari-ai/continuous-eval/pull/56
New Contributors * stantonius made their first contribution in https://github.com/relari-ai/continuous-eval/pull/56
- Metrics batch execution now use threads by default - Bug fixing
0.3.1
Key points:
- Added `from_data` class method to `Dataset` class - Fixed `is_empty` method in EvaluationResults, MetricsResults, and TestResults - Added error handling in LLM-based metrics
0.2.7
What's Changed * Added Code Evaluation Metrics in https://github.com/relari-ai/continuous-eval/pull/29