- **Import Hugging Face Transformers Model:** the `bentoml.transformers.import_model` API imports pretrained transformers models directly from HuggingFace. Using this API allows importing Transformers models into the BentoML model store without loading the model into memory. The `bentoml.transformers.import_model` API takes the first argument to be the model name in BentoML store, and the second argument to be the `model_id` on HuggingFace Hub.
python
import bentoml
bentomodel = bentoml.transformers.import_model("zephyr-7b-beta", "HuggingFaceH4/zephyr-7b-beta")
- **Standardize with `nvidia-ml-py`:** BentoML now uses the official `nvidia-ml-py` package instead of `pynvml` to avoid conflict with other packages.
- **Define Environment Variable in Configuration:** Within `bentoml_configuration.yaml`, values in the form of `${ENV_VAR}` will be expanded at runtime to the value of the corresponding environment variable, but please note that this only supports string types.
What's Changed
* docs: Update the deployment docs by Sherlock113 in https://github.com/bentoml/BentoML/pull/4260
* ci: pre-commit autoupdate [skip ci] by pre-commit-ci in https://github.com/bentoml/BentoML/pull/4264
* feat: import model for transformers framework by MingLiangDai in https://github.com/bentoml/BentoML/pull/4247
* build: Use official nvidia-ml-py package instead of fork by ecederstrand in https://github.com/bentoml/BentoML/pull/4208
New Contributors
* MingLiangDai made their first contribution in https://github.com/bentoml/BentoML/pull/4247
* ecederstrand made their first contribution in https://github.com/bentoml/BentoML/pull/4208
**Full Changelog**: https://github.com/bentoml/BentoML/compare/v1.1.7...v1.1.9