What's Changed
* [Feat] Add Exception mapping for Azure ContentPolicyViolationError by ishaan-jaff in https://github.com/BerriAI/litellm/pull/3449
* Add return_exceptions to litellm.batch_completion by ffreemt in https://github.com/BerriAI/litellm/pull/3397
* fix(caching.py): fix redis caching ping check by krrishdholakia in https://github.com/BerriAI/litellm/pull/3447
* change max_tokens type to int by TanaroSch in https://github.com/BerriAI/litellm/pull/1530
* Revert "Add return_exceptions to litellm.batch_completion" by ishaan-jaff in https://github.com/BerriAI/litellm/pull/3450
* fix(factory.py): support 'function' openai message role for anthropic by krrishdholakia in https://github.com/BerriAI/litellm/pull/3448
* [Feat] Return model, api_base and first 100 chars of messages in Azure Exceptions by ishaan-jaff in https://github.com/BerriAI/litellm/pull/3451
* [FEAT] router set custom num retries for ContentPolicyViolationErrorRetries, RateLimitErrorRetries, BadRequestErrorRetries etc by ishaan-jaff in https://github.com/BerriAI/litellm/pull/3456
* [Feat] return num_retries in litellm.Router exceptions by ishaan-jaff in https://github.com/BerriAI/litellm/pull/3457
* [Feat] Set a Retry Policy per model group by ishaan-jaff in https://github.com/BerriAI/litellm/pull/3460
* Fix OpenMeter sync logger by tothandras in https://github.com/BerriAI/litellm/pull/3452
* feat(openai.py): add support for openai assistants by krrishdholakia in https://github.com/BerriAI/litellm/pull/3455
* gunicorn version bump by RoniGurvichCycode in https://github.com/BerriAI/litellm/pull/3463
* Fix Ollama streamed tool calls. Set finish_reason to tool_calls for all tool_calls responses by jackmpcollins in https://github.com/BerriAI/litellm/pull/3469
* Allowing extra headers for bedrock by themrzmaster in https://github.com/BerriAI/litellm/pull/3299
New Contributors
* ffreemt made their first contribution in https://github.com/BerriAI/litellm/pull/3397
* TanaroSch made their first contribution in https://github.com/BerriAI/litellm/pull/1530
* tothandras made their first contribution in https://github.com/BerriAI/litellm/pull/3452
* RoniGurvichCycode made their first contribution in https://github.com/BerriAI/litellm/pull/3463
**Full Changelog**: https://github.com/BerriAI/litellm/compare/v1.35.38-stable...v1.36.0
Don't want to maintain your internal proxy? get in touch 🎉
Hosted Proxy Alpha: https://calendly.com/d/4mp-gd3-k5k/litellm-1-1-onboarding-chat
Don't want to maintain your internal proxy? get in touch 🎉
Hosted Proxy Alpha: https://calendly.com/d/4mp-gd3-k5k/litellm-1-1-onboarding-chat
Load Test LiteLLM Proxy Results
| Name | Status | Median Response Time (ms) | Average Response Time (ms) | Requests/s | Failures/s | Request Count | Failure Count | Min Response Time (ms) | Max Response Time (ms) |
| --- | --- | --- | --- | --- | --- | --- | --- | --- | --- |
| /chat/completions | Passed ✅ | 81 | 88.20979605555442 | 1.623255922227879 | 0.0 | 486 | 0 | 75.38953300002049 | 1264.5359969999959 |
| /health/liveliness | Passed ✅ | 65 | 68.12845653229724 | 15.253929623075564 | 0.0 | 4567 | 0 | 63.39287800000193 | 1385.0202130000184 |
| /health/readiness | Passed ✅ | 65 | 68.59345058785526 | 15.511112145733067 | 0.0033400327617857596 | 4644 | 1 | 63.46367399999053 | 1491.452105999997 |
| Aggregated | Passed ✅ | 65 | 69.35759579210092 | 32.38829769103651 | 0.0033400327617857596 | 9697 | 1 | 63.39287800000193 | 1491.452105999997 |
v1.35.38-stable
What's Changed
* UI select start/ end time for viewing model metrics by ishaan-jaff in https://github.com/BerriAI/litellm/pull/3441
* Improve the document of Traceloop by paneru-rajan in https://github.com/BerriAI/litellm/pull/3445
* Improve mocking in `test_proxy_exception_mapping.py` by msabramo in https://github.com/BerriAI/litellm/pull/3408
New Contributors
* paneru-rajan made their first contribution in https://github.com/BerriAI/litellm/pull/3445
**Full Changelog**: https://github.com/BerriAI/litellm/compare/v1.35.38...v1.35.38-stable
Don't want to maintain your internal proxy? get in touch 🎉
Hosted Proxy Alpha: https://calendly.com/d/4mp-gd3-k5k/litellm-1-1-onboarding-chat
Load Test LiteLLM Proxy Results
| Name | Status | Median Response Time (ms) | Average Response Time (ms) | Requests/s | Failures/s | Request Count | Failure Count | Min Response Time (ms) | Max Response Time (ms) |
| --- | --- | --- | --- | --- | --- | --- | --- | --- | --- |
| /chat/completions | Passed ✅ | 55 | 58.640919096329625 | 1.4563005047336437 | 0.0 | 436 | 0 | 50.270481999973526 | 588.9671030000159 |
| /health/liveliness | Passed ✅ | 40 | 43.084301563953716 | 15.5116044586767 | 0.0 | 4644 | 0 | 37.74072900000647 | 1439.319705999992 |
| /health/readiness | Passed ✅ | 40 | 43.8615925310188 | 15.398039740417655 | 0.006680277544649742 | 4610 | 2 | 37.88508399998136 | 1424.1174170000193 |
| Aggregated | Passed ✅ | 40 | 44.15406385521118 | 32.365944703828 | 0.006680277544649742 | 9690 | 2 | 37.74072900000647 | 1439.319705999992 |