What's Changed
* (UI) Fix viewing home page keys on a new DB by ishaan-jaff in https://github.com/BerriAI/litellm/pull/7042
* build(model_prices_and_context_window.json): add bedrock region model⦠by krrishdholakia in https://github.com/BerriAI/litellm/pull/7044
* Update SearchBar by yujonglee in https://github.com/BerriAI/litellm/pull/6982
* (fix) litellm router.aspeech by ishaan-jaff in https://github.com/BerriAI/litellm/pull/6962
* (UI) perf improvement - cache internal user tab results by ishaan-jaff in https://github.com/BerriAI/litellm/pull/7054
* (fix) adding public routes when using custom header by ishaan-jaff in https://github.com/BerriAI/litellm/pull/7045
* LiteLLM Minor Fixes & Improvements (12/05/2024) by krrishdholakia in https://github.com/BerriAI/litellm/pull/7051
* feat: Add gemini-exp-1206 model configuration with 2M input tokens by paulmaunders in https://github.com/BerriAI/litellm/pull/7064
* Correct Vertex Embedding Model Data/Prices by emerzon in https://github.com/BerriAI/litellm/pull/7069
* litellm not honoring OPENAI_ORGANIZATION env var by ishaan-jaff in https://github.com/BerriAI/litellm/pull/7066
* Provider Budget Routing - Get Budget, Spend Details by ishaan-jaff in https://github.com/BerriAI/litellm/pull/7063
* Feat - add groq/llama3.3 models by ishaan-jaff in https://github.com/BerriAI/litellm/pull/7076
* (feat) Allow enabling logging message / response for specific virtual keys by ishaan-jaff in https://github.com/BerriAI/litellm/pull/7071
New Contributors
* paulmaunders made their first contribution in https://github.com/BerriAI/litellm/pull/7064
**Full Changelog**: https://github.com/BerriAI/litellm/compare/v1.53.7...v1.53.8
Docker Run LiteLLM Proxy
docker run \
-e STORE_MODEL_IN_DB=True \
-p 4000:4000 \
ghcr.io/berriai/litellm:main-v1.53.8
Don't want to maintain your internal proxy? get in touch π
Hosted Proxy Alpha: https://calendly.com/d/4mp-gd3-k5k/litellm-1-1-onboarding-chat
Load Test LiteLLM Proxy Results
| Name | Status | Median Response Time (ms) | Average Response Time (ms) | Requests/s | Failures/s | Request Count | Failure Count | Min Response Time (ms) | Max Response Time (ms) |
| --- | --- | --- | --- | --- | --- | --- | --- | --- | --- |
| /chat/completions | Passed β
| 220.0 | 252.68788116416482 | 6.227999496173144 | 0.0 | 1864 | 0 | 198.31458400000201 | 2829.406032999941 |
| Aggregated | Passed β
| 220.0 | 252.68788116416482 | 6.227999496173144 | 0.0 | 1864 | 0 | 198.31458400000201 | 2829.406032999941 |