Added:
- Build `vulkan_1_x` for general GPU.
- Build `cuda 12.4.1` as default.
Changed:
- Renamed examples for TinyLlama (chat, tool calling) and OpenAI.
- Updated demo models definitions.
- Updated examples (chat, tool calling).
- `get_special_tokens` not supports parameter `force_standard_special_tokens: bool=False` which bypasses tokenizer's special tokens with standard/common ones.
- Build `cuda 12.5.1` as additional build target but packaged on PyPI.
- Build `cuda 12.6` as additional build target but packaged on PyPI.
- Build `openblas` as additional build target but packaged on PyPI.
Fixed:
- Handle `Options.no_display_prompt` on Python side.