Litellm

Latest version: v1.61.11

Safety actively analyzes 707607 Python packages for vulnerabilities to keep your Python projects secure.

Page 97 of 110

1.22.10

Not secure

What's Changed
* fix(proxy_server.py): do a health check on db before returning if proxy ready (if db connected) by krrishdholakia in https://github.com/BerriAI/litellm/pull/1856
* fix(utils.py): return finish reason for last vertex ai chunk by krrishdholakia in https://github.com/BerriAI/litellm/pull/1847
* fix(proxy/utils.py): if langfuse trace id passed in, include in slack alert by krrishdholakia in https://github.com/BerriAI/litellm/pull/1839
* [Feat] Budgets for 'user' param passed to /chat/completions, /embeddings etc by ishaan-jaff in https://github.com/BerriAI/litellm/pull/1859

Semantic Caching Support - Add Semantic Caching to litellm💰 by ishaan-jaff in https://github.com/BerriAI/litellm/pull/1829
- Use with LiteLLM Proxy https://docs.litellm.ai/docs/proxy/caching
- Use with litellm.completion https://docs.litellm.ai/docs/caching/redis_cache

Usage with Proxy

Step 1: Add `cache` to the config.yaml
yaml
model_list:
- model_name: gpt-3.5-turbo
litellm_params:
model: gpt-3.5-turbo
- model_name: azure-embedding-model
litellm_params:
model: azure/azure-embedding-model
api_base: os.environ/AZURE_API_BASE
api_key: os.environ/AZURE_API_KEY
api_version: "2023-07-01-preview"

litellm_settings:
set_verbose: True
cache: True set cache responses to True, litellm defaults to using a redis cache
cache_params:
type: "redis-semantic"
similarity_threshold: 0.8 similarity threshold for semantic cache
redis_semantic_cache_embedding_model: azure-embedding-model set this to a model_name set in model_list

Step 2: Add Redis Credentials to .env
Set either `REDIS_URL` or the `REDIS_HOST` in your os environment, to enable caching.

shell
REDIS_URL = "" REDIS_URL='redis://username:passwordhostname:port/database'
OR
REDIS_HOST = "" REDIS_HOST='redis-18841.c274.us-east-1-3.ec2.cloud.redislabs.com'
REDIS_PORT = "" REDIS_PORT='18841'
REDIS_PASSWORD = "" REDIS_PASSWORD='liteLlmIsAmazing'

**Additional kwargs**
You can pass in any additional redis.Redis arg, by storing the variable + value in your os environment, like this:
shell
REDIS_<redis-kwarg-name> = ""

Step 3: Run proxy with config
shell
$ litellm --config /path/to/config.yaml

That's IT !

(You'll see semantic-similarity on langfuse if you set langfuse as a success_callback)
(FYI the api key here is deleted 🔑)

<img width="915" alt="Screenshot 2024-02-06 at 11 15 01 AM" src="https://github.com/BerriAI/litellm/assets/29436595/4bcd757a-f099-4221-a9e6-52d4401db4e8">

Usage with `litellm.completion`
python
litellm.cache = Cache(
type="redis-semantic",
host=os.environ["REDIS_HOST"],
port=os.environ["REDIS_PORT"],
password=os.environ["REDIS_PASSWORD"],
similarity_threshold=0.8,
redis_semantic_cache_embedding_model="text-embedding-ada-002",
)
response1 = completion(
model="gpt-3.5-turbo",
messages=[
{
"role": "user",
"content": f"write a one sentence poem about: {random_number}",
}
],
max_tokens=20,
)
print(f"response1: {response1}")

random_number = random.randint(1, 100000)

response2 = completion(
model="gpt-3.5-turbo",
messages=[
{
"role": "user",
"content": f"write a one sentence poem about: {random_number}",
}
],
max_tokens=20,
)
print(f"response2: {response1}")
assert response1.id == response2.id

Budgets for 'user' param passed to /chat/completions, /embeddings etc
budget `user` passed to /chat/completions, without needing to create a key for every user passed
docs: https://docs.litellm.ai/docs/proxy/users

How to Use
1. Define a litellm.max_user_budget on your confg
yaml
litellm_settings:
max_budget: 10 global budget for proxy
max_user_budget: 0.0001 budget for 'user' passed to /chat/completions

2. Make a /chat/completions call, pass 'user' - First call Works
shell
curl --location 'http://0.0.0.0:4000/chat/completions' \
--header 'Content-Type: application/json' \
--header 'Authorization: Bearer sk-zi5onDRdHGD24v0Zdn7VBA' \
--data ' {
"model": "azure-gpt-3.5",
"user": "ishaan3",
"messages": [
{
"role": "user",
"content": "what time is it"
}
]
}'

3. Make a /chat/completions call, pass 'user' - Call Fails, since 'ishaan3' over budget
shell
curl --location 'http://0.0.0.0:4000/chat/completions' \
--header 'Content-Type: application/json' \
--header 'Authorization: Bearer sk-zi5onDRdHGD24v0Zdn7VBA' \
--data ' {
"model": "azure-gpt-3.5",
"user": "ishaan3",
"messages": [
{
"role": "user",
"content": "what time is it"
}
]
}'

Error
shell
{"error":{"message":"Authentication Error, ExceededBudget: User ishaan3 has exceeded their budget. Current spend: 0.0008869999999999999; Max Budget: 0.0001","type":"auth_error","param":"None","code":401}}%

**Full Changelog**: https://github.com/BerriAI/litellm/compare/v1.22.9...v1.22.10

1.22.9

Not secure

What's Changed
* [FEAT] show langfuse logging / cache tags better through proxy by ishaan-jaff in https://github.com/BerriAI/litellm/pull/1857
* [Feat] Add Semantic Caching to litellm💰 by ishaan-jaff in https://github.com/BerriAI/litellm/pull/1829

**Full Changelog**: https://github.com/BerriAI/litellm/compare/v1.22.8...v1.22.9

1.22.8

Not secure

What's Changed
* [Fix] UI - Security - Litellm UI Keys meant for litellm-dashboard shouldn't be allowed to make non-management related requests by ishaan-jaff in https://github.com/BerriAI/litellm/pull/1836
* Fix admin UI title and description by ushuz in https://github.com/BerriAI/litellm/pull/1842
* fix(langfuse.py): support logging failed llm api calls to langfuse by krrishdholakia in https://github.com/BerriAI/litellm/pull/1837
* [Feat] Proxy set upperbound params for key/generate by ishaan-jaff in https://github.com/BerriAI/litellm/pull/1844
* build(requirements.txt): update the proxy requirements.txt by krrishdholakia in https://github.com/BerriAI/litellm/pull/1846

**Full Changelog**: https://github.com/BerriAI/litellm/compare/v1.22.5...v1.22.8

1.22.5

Not secure

What's Changed
* Re-raise exception in async ollama streaming by vanpelt in https://github.com/BerriAI/litellm/pull/1750
* Add a Helm chart for deploying LiteLLM Proxy by ShaunMaher in https://github.com/BerriAI/litellm/pull/1602
* Update Perplexity models in model_prices_and_context_window.json by toniengelhardt in https://github.com/BerriAI/litellm/pull/1826
* (feat) Add sessionId for Langfuse. by Manouchehri in https://github.com/BerriAI/litellm/pull/1828
* [Feat] Sync model_prices_and_context_window.json and litellm/model_prices_and_context_window_backup.json by ishaan-jaff in https://github.com/BerriAI/litellm/pull/1834

New Contributors
* vanpelt made their first contribution in https://github.com/BerriAI/litellm/pull/1750

**Full Changelog**: https://github.com/BerriAI/litellm/compare/v1.22.3...v1.22.5

1.22.3

Not secure

What's Changed
* feat(utils.py): support cost tracking for openai/azure image gen models by krrishdholakia in https://github.com/BerriAI/litellm/pull/1805

**Full Changelog**: https://github.com/BerriAI/litellm/compare/v1.22.2...v1.22.3

1.22.2

Not secure

Admin UI 🤠
- view spend, budget for signed in user
- view daily spend, top users for a key
![ui_3](https://github.com/BerriAI/litellm/assets/29436595/e379f469-7948-475d-a7b4-4c6c1ee1b392)
What's Changed
* Litellm vertex ai gecko support by krrishdholakia in https://github.com/BerriAI/litellm/pull/1794
* [Feat] Allow setting user roles for UserTable by ishaan-jaff in https://github.com/BerriAI/litellm/pull/1795
* fix(vertex_ai.py): add async embedding support for vertex ai by krrishdholakia in https://github.com/BerriAI/litellm/pull/1797
* [UI] Show UserID, user_role on UI by ishaan-jaff in https://github.com/BerriAI/litellm/pull/1800
* feat(bedrock.py): add stable diffusion image generation support by krrishdholakia in https://github.com/BerriAI/litellm/pull/1799
* [UI] view role on ui by ishaan-jaff in https://github.com/BerriAI/litellm/pull/1801
* [Feat] UI view spend / budget per user by ishaan-jaff in https://github.com/BerriAI/litellm/pull/1803
* fix(vertex_ai.py): treat vertex ai high-traffic error as a rate limit error - allows user-controlled backoff logic to work here by krrishdholakia in https://github.com/BerriAI/litellm/pull/1802
* [UI] View Key Spend Reports 🤠 by ishaan-jaff in https://github.com/BerriAI/litellm/pull/1807
* Support caching individual items in embedding list (Async embedding only) by krrishdholakia in https://github.com/BerriAI/litellm/pull/1809

**Full Changelog**: https://github.com/BerriAI/litellm/compare/v1.21.7...v1.22.2

Page 97 of 110

Releases

Has known vulnerabilities

Previous Next

Litellm

Page 97 of 110

1.22.10

1.22.9

1.22.8

1.22.5

1.22.3

1.22.2

Page 97 of 110

Links

Releases