Usage
All available models: openllm models
To start a LLM: python -m openllm start opt
To run OpenLLM within a container environment (requires GPUs): docker run --gpus all -it -P ghcr.io/bentoml/openllm:0.4.12 start opt
To run OpenLLM Clojure UI (community-maintained): docker run -p 8420:80 ghcr.io/bentoml/openllm-ui-clojure:0.4.12
Find more information about this release in the [CHANGELOG.md](https://github.com/bentoml/OpenLLM/blob/main/CHANGELOG.md)
What's Changed
* fix(envvar): explicitly set NVIDIA_DRIVER_CAPABILITIES by aarnphm in https://github.com/bentoml/OpenLLM/pull/681
* fix(torch_dtype): correctly infer based on options by aarnphm in https://github.com/bentoml/OpenLLM/pull/682
**Full Changelog**: https://github.com/bentoml/OpenLLM/compare/v0.4.11...v0.4.12