AutoModel` compatible models will now use the official `tokenizers` library, which improves the decoding accuracy, especially for all non llama based models.
If you want to specify a tokenizer manually, it can be set via the `tokenizer_path_or_repo_id` parameter. If you want to use the default GGML tokenizer the huggingface support can be disabled via `use_hf_tokenizer`.