* `Transformer.predict`: do not broadcast to listeners, requires `spacy>=3.5.0` (345)
* Correct and clarify the handling of empty/zero-length `Doc`s during training and inference (365)
* Remove superfluous datatype and device conversions, requires `torch>=1.8.0` (369)
* Fix memory leak in offsets mapping alignment for fast tokenizers (373)