Faster-whisper

Latest version: v1.1.1

Safety actively analyzes 723947 Python packages for vulnerabilities to keep your Python projects secure.

Scan your dependencies

Page 6 of 6

0.4.0

Integration of Silero VAD

The [Silero VAD](https://github.com/snakers4/silero-vad) model is integrated to ignore parts of the audio without speech:

python
model.transcribe(..., vad_filter=True)


The default behavior is conservative and only removes silence longer than 2 seconds. See the README to find how to customize the VAD parameters.

**Note:** the Silero model is executed with `onnxruntime` which is currently not released for Python 3.11. The dependency is excluded for this Python version and so the VAD features cannot be used.

Speaker diarization using stereo channels

The function `decode_audio` has a new argument `split_stereo` to split stereo audio into seperate left and right channels:

python
left, right = decode_audio(audio_file, split_stereo=True)

model.transcribe(left)
model.transcribe(right)


Other changes

* Add `Segment` attributes `avg_log_prob` and `no_speech_prob` (same definition as openai/whisper)
* Ignore audio frames raising an `av.error.InvalidDataError` exception during decoding
* Fix option `prefix` to be passed only to the first 30-second window
* Extend `suppress_tokens` with some special tokens that should always be suppressed (unless `suppress_tokens is None`)
* Raise a more helpful error message when the selected model size is invalid
* Disable the progress bar when the model to download is already in the cache

0.3.0

* Converted models are now available on the [Hugging Face Hub](https://huggingface.co/guillaumekln) and are automatically downloaded when creating a `WhisperModel` instance. The conversion step is no longer required for the original Whisper models.

python
Automatically download https://huggingface.co/guillaumekln/faster-whisper-large-v2
model = WhisperModel("large-v2")


* Run the encoder only once for each 30-second window. Before this change the same window could be encoded multiple times, for example in the temperature fallback or when word-level timestamps is enabled.

0.2.0

Initial publication of the library on PyPI: https://pypi.org/project/faster-whisper/

Page 6 of 6

© 2025 Safety CLI Cybersecurity Inc. All Rights Reserved.