What's Changed
* docs: Add GPU inference doc by Sherlock113 in https://github.com/bentoml/BentoML/pull/4654
* chore: update quickstart by ssheng in https://github.com/bentoml/BentoML/pull/4655
* docs: Add JSON output for bentovllm by Sherlock113 in https://github.com/bentoml/BentoML/pull/4657
* chore: cleanup quickstart by ssheng in https://github.com/bentoml/BentoML/pull/4658
* docs: Update help info by Sherlock113 in https://github.com/bentoml/BentoML/pull/4664
* fix: remove the uvicorn server header by frostming in https://github.com/bentoml/BentoML/pull/4665
* docs: Fix format by Sherlock113 in https://github.com/bentoml/BentoML/pull/4666
* docs: Add model composition doc by Sherlock113 in https://github.com/bentoml/BentoML/pull/4668
* docs: Update example project list by Sherlock113 in https://github.com/bentoml/BentoML/pull/4673
* docs: Add the monitoring and data collection doc by Sherlock113 in https://github.com/bentoml/BentoML/pull/4662
* docs: Add add_asgi_middleware doc by Sherlock113 in https://github.com/bentoml/BentoML/pull/4672
* fix: delete useless enum and fix enum value by FogDong in https://github.com/bentoml/BentoML/pull/4674
* docs: Add RAG tutorial by Sherlock113 in https://github.com/bentoml/BentoML/pull/4675
* docs: Update the clients doc by Sherlock113 in https://github.com/bentoml/BentoML/pull/4676
* docs: Add some explanations for bentoml.models.get by Sherlock113 in https://github.com/bentoml/BentoML/pull/4660
* docs: Add e2e test doc by Sherlock113 in https://github.com/bentoml/BentoML/pull/4679
* fix(cloud client): various type error by bojiang in https://github.com/bentoml/BentoML/pull/4680
* fix(cli): bentoml cli verbosity not passed to the subprocess correctly by frostming in https://github.com/bentoml/BentoML/pull/4661
**Full Changelog**: https://github.com/bentoml/BentoML/compare/v1.2.11...v1.2.12