Openllm

Latest version: v0.6.20

Safety actively analyzes 714875 Python packages for vulnerabilities to keep your Python projects secure.

Scan your dependencies

Page 7 of 23

0.4.38

Usage

All available models: openllm models

To start a LLM: python -m openllm start HuggingFaceH4/zephyr-7b-beta

To run OpenLLM within a container environment (requires GPUs): docker run --gpus all -it -P -v $PWD/data:$HOME/.cache/huggingface/ ghcr.io/bentoml/openllm:0.4.38 start HuggingFaceH4/zephyr-7b-beta

Find more information about this release in the [CHANGELOG.md](https://github.com/bentoml/OpenLLM/blob/main/CHANGELOG.md)



What's Changed
* fix(mixtral): correct chat templates to remove additional spacing by aarnphm in https://github.com/bentoml/OpenLLM/pull/774
* fix(cli): correct set arguments for `openllm import` and `openllm build` by aarnphm in https://github.com/bentoml/OpenLLM/pull/775
* fix(mixtral): setup hack atm to load weights from pt specifically instead of safetensors by aarnphm in https://github.com/bentoml/OpenLLM/pull/776


**Full Changelog**: https://github.com/bentoml/OpenLLM/compare/v0.4.37...v0.4.38

0.4.37

Usage

All available models: openllm models

To start a LLM: python -m openllm start HuggingFaceH4/zephyr-7b-beta

To run OpenLLM within a container environment (requires GPUs): docker run --gpus all -it -P -v $PWD/data:$HOME/.cache/huggingface/ ghcr.io/bentoml/openllm:0.4.37 start HuggingFaceH4/zephyr-7b-beta

Find more information about this release in the [CHANGELOG.md](https://github.com/bentoml/OpenLLM/blob/main/CHANGELOG.md)



What's Changed
* feat(mixtral): correct support for mixtral by aarnphm in https://github.com/bentoml/OpenLLM/pull/772
* chore: running all script when installation by aarnphm in https://github.com/bentoml/OpenLLM/pull/773


**Full Changelog**: https://github.com/bentoml/OpenLLM/compare/v0.4.36...v0.4.37

0.4.36

Usage

All available models: openllm models

To start a LLM: python -m openllm start HuggingFaceH4/zephyr-7b-beta

To run OpenLLM within a container environment (requires GPUs): docker run --gpus all -it -P -v $PWD/data:$HOME/.cache/huggingface/ ghcr.io/bentoml/openllm:0.4.36 start HuggingFaceH4/zephyr-7b-beta

Find more information about this release in the [CHANGELOG.md](https://github.com/bentoml/OpenLLM/blob/main/CHANGELOG.md)



What's Changed
* feat(openai): supports echo by aarnphm in https://github.com/bentoml/OpenLLM/pull/760
* fix(openai): logprobs when echo is enabled by aarnphm in https://github.com/bentoml/OpenLLM/pull/761
* ci: pre-commit autoupdate [pre-commit.ci] by pre-commit-ci in https://github.com/bentoml/OpenLLM/pull/767
* chore(deps): bump docker/metadata-action from 5.2.0 to 5.3.0 by dependabot in https://github.com/bentoml/OpenLLM/pull/766
* chore(deps): bump actions/setup-python from 4.7.1 to 5.0.0 by dependabot in https://github.com/bentoml/OpenLLM/pull/765
* chore(deps): bump taiki-e/install-action from 2.21.26 to 2.22.0 by dependabot in https://github.com/bentoml/OpenLLM/pull/764
* chore(deps): bump aquasecurity/trivy-action from 0.14.0 to 0.16.0 by dependabot in https://github.com/bentoml/OpenLLM/pull/763
* chore(deps): bump github/codeql-action from 2.22.8 to 2.22.9 by dependabot in https://github.com/bentoml/OpenLLM/pull/762
* feat: mixtral support by aarnphm in https://github.com/bentoml/OpenLLM/pull/770


**Full Changelog**: https://github.com/bentoml/OpenLLM/compare/v0.4.35...v0.4.36

0.4.35

Usage

All available models: openllm models

To start a LLM: python -m openllm start HuggingFaceH4/zephyr-7b-beta

To run OpenLLM within a container environment (requires GPUs): docker run --gpus all -it -P -v $PWD/data:$HOME/.cache/huggingface/ ghcr.io/bentoml/openllm:0.4.35 start HuggingFaceH4/zephyr-7b-beta

Find more information about this release in the [CHANGELOG.md](https://github.com/bentoml/OpenLLM/blob/main/CHANGELOG.md)



What's Changed
* chore(deps): bump pypa/gh-action-pypi-publish from 1.8.10 to 1.8.11 by dependabot in https://github.com/bentoml/OpenLLM/pull/749
* chore(deps): bump docker/metadata-action from 5.0.0 to 5.2.0 by dependabot in https://github.com/bentoml/OpenLLM/pull/751
* chore(deps): bump taiki-e/install-action from 2.21.19 to 2.21.26 by dependabot in https://github.com/bentoml/OpenLLM/pull/750
* ci: pre-commit autoupdate [pre-commit.ci] by pre-commit-ci in https://github.com/bentoml/OpenLLM/pull/753
* fix(logprobs): explicitly set logprobs=None by aarnphm in https://github.com/bentoml/OpenLLM/pull/757


**Full Changelog**: https://github.com/bentoml/OpenLLM/compare/v0.4.34...v0.4.35

0.4.34

Usage

All available models: openllm models

To start a LLM: python -m openllm start HuggingFaceH4/zephyr-7b-beta

To run OpenLLM within a container environment (requires GPUs): docker run --gpus all -it -P -v $PWD/data:$HOME/.cache/huggingface/ ghcr.io/bentoml/openllm:0.4.34 start HuggingFaceH4/zephyr-7b-beta

Find more information about this release in the [CHANGELOG.md](https://github.com/bentoml/OpenLLM/blob/main/CHANGELOG.md)



What's Changed
* feat(models): Support qwen by yansheng105 in https://github.com/bentoml/OpenLLM/pull/742

New Contributors
* yansheng105 made their first contribution in https://github.com/bentoml/OpenLLM/pull/742

**Full Changelog**: https://github.com/bentoml/OpenLLM/compare/v0.4.33...v0.4.34

0.4.33

Usage

All available models: openllm models

To start a LLM: python -m openllm start HuggingFaceH4/zephyr-7b-beta

To run OpenLLM within a container environment (requires GPUs): docker run --gpus all -it -P -v $PWD/data:$HOME/.cache/huggingface/ ghcr.io/bentoml/openllm:0.4.33 start HuggingFaceH4/zephyr-7b-beta

Find more information about this release in the [CHANGELOG.md](https://github.com/bentoml/OpenLLM/blob/main/CHANGELOG.md)



**Full Changelog**: https://github.com/bentoml/OpenLLM/compare/v0.4.32...v0.4.33

Page 7 of 23

© 2025 Safety CLI Cybersecurity Inc. All Rights Reserved.