What's Changed
* fix(types/utils.py): Support deepseek prompt caching by krrishdholakia in https://github.com/BerriAI/litellm/pull/5019
* build(ui): allow admin_viewer to view teams tab by krrishdholakia in https://github.com/BerriAI/litellm/pull/5027
* [Fix] Whisper Caching - Use correct cache keys for checking request in cache by ishaan-jaff in https://github.com/BerriAI/litellm/pull/5026
* fix(utils.py): Fix adding azure models on ui by krrishdholakia in https://github.com/BerriAI/litellm/pull/5029
* Allow Bedrock to set custom STS endpoint for OIDC flow by Manouchehri in https://github.com/BerriAI/litellm/pull/4982
* [Feat] Add support for Vertex AI fine tuning endpoints by ishaan-jaff in https://github.com/BerriAI/litellm/pull/5028
* [Feat] Add support for Vertex AI Fine tuning on LiteLLM Proxy by ishaan-jaff in https://github.com/BerriAI/litellm/pull/5030
* [Feat] Vertex AI fine tuning - support translating hyperparameters by ishaan-jaff in https://github.com/BerriAI/litellm/pull/5034
* Docs - Add example of Vertex AI fine tuning API by ishaan-jaff in https://github.com/BerriAI/litellm/pull/5035
* [Feat] support all native vertex ai endpoints - Gemini API, Embeddings API, Imagen API, Batch prediction API, Tuning API, CountTokens API by ishaan-jaff in https://github.com/BerriAI/litellm/pull/5037
**Full Changelog**: https://github.com/BerriAI/litellm/compare/v1.42.11...v1.42.12
Docker Run LiteLLM Proxy
docker run \
-e STORE_MODEL_IN_DB=True \
-p 4000:4000 \
ghcr.io/berriai/litellm:main-v1.42.12
Don't want to maintain your internal proxy? get in touch 🎉
Hosted Proxy Alpha: https://calendly.com/d/4mp-gd3-k5k/litellm-1-1-onboarding-chat
Docker Run LiteLLM Proxy
docker run \
-e STORE_MODEL_IN_DB=True \
-p 4000:4000 \
ghcr.io/berriai/litellm:main-v1.42.12
Don't want to maintain your internal proxy? get in touch 🎉
Hosted Proxy Alpha: https://calendly.com/d/4mp-gd3-k5k/litellm-1-1-onboarding-chat
Load Test LiteLLM Proxy Results
| Name | Status | Median Response Time (ms) | Average Response Time (ms) | Requests/s | Failures/s | Request Count | Failure Count | Min Response Time (ms) | Max Response Time (ms) |
| --- | --- | --- | --- | --- | --- | --- | --- | --- | --- |
| /chat/completions | Passed ✅ | 120.0 | 147.37461002147643 | 6.379597709990025 | 0.0033418531744316532 | 1909 | 1 | 27.773200999973824 | 2754.675483000028 |
| Aggregated | Passed ✅ | 120.0 | 147.37461002147643 | 6.379597709990025 | 0.0033418531744316532 | 1909 | 1 | 27.773200999973824 | 2754.675483000028 |