- Fixed import namespace issue in `readme.md` examples
- Fixed `TODO` in `readme.md` function call examples
- Added manual reset API to integrations' `logits_processor` so users can manually reset formatter state if the generation does not complete normally
- Fixed a bug in `exllamav2` integrations so `Exllamav2Filter` can be reused across generations
- Significantly improved the speed of `formatter.mask_logits` on torch tensors