Faster-whisper

Latest version: v1.1.0

Safety actively analyzes 681935 Python packages for vulnerabilities to keep your Python projects secure.

Scan your dependencies

Page 4 of 6

1.0.0

* Support distil-whisper model (https://github.com/SYSTRAN/faster-whisper/pull/557)
Robust knowledge distillation of the Whisper model via large-scale pseudo-labelling.
For more detail: https://github.com/huggingface/distil-whisper

* Upgrade ctranslate2 version to 4.0 to support CUDA 12 (https://github.com/SYSTRAN/faster-whisper/pull/694)

* Upgrade PyAV version to 11.* to support Python3.12.x (https://github.com/SYSTRAN/faster-whisper/pull/679)

* Small bug fixes
* Illogical "Avoid computing higher temperatures on no_speech" (https://github.com/SYSTRAN/faster-whisper/pull/652)
* broken prompt_reset_on_temperature (https://github.com/SYSTRAN/faster-whisper/pull/604)
* Word timing tweaks (https://github.com/SYSTRAN/faster-whisper/pull/616)

* New improvements from original OpenAI Whisper project
* Skip silence around hallucinations (https://github.com/SYSTRAN/faster-whisper/pull/646)
* Prevent infinite loop for out-of-bound timestamps in clip_timestamps (https://github.com/SYSTRAN/faster-whisper/pull/697)

0.10.1

Fix the broken tag [v0.10.0](https://github.com/SYSTRAN/faster-whisper/releases/tag/v0.10.0)

0.10.0

* Support "large-v3" model with
* The ability to load `feature_size/num_mels` and other from `preprocessor_config.json`
* A new language token for Cantonese (`yue`)
* Update `CTranslate2` requirement to include the latest version 3.22.0
* Update `tokenizers` requirement to include the latest version 0.15
* Change the hub to fetch models from [Systran organization](https://huggingface.co/Systran)

0.9.0

* Add function `faster_whisper.available_models()` to list the available model sizes
* Add model property `supported_languages` to list the languages accepted by the model
* Improve error message for invalid `task` and `language` parameters
* Update `tokenizers` requirement to include the latest version 0.14

0.8.0

Expose new transcription options

Some generation parameters that were available in the CTranslate2 API but not exposed in faster-whisper:

* `repetition_penalty` to penalize the score of previously generated tokens (set > 1 to penalize)
* `no_repeat_ngram_size` to prevent repetitions of ngrams with this size

Some values that were previously hardcoded in the transcription method:

* `prompt_reset_on_temperature` to configure after which temperature fallback step the prompt with the previous text should be reset (default value is 0.5)

Other changes

* Fix a possible memory leak when decoding audio with PyAV by forcing the garbage collector to run
* Add property `duration_after_vad` in the returned `TranscriptionInfo` object
* Add "large" alias for the "large-v2" model
* Log a warning when the model is English-only but the `language` parameter is set to something else

0.7.1

* Fix a bug related to `no_speech_threshold`: when the threshold was met for a segment, the next 30-second window reused the same encoder output and was also considered as non speech
* Improve selection of the final result when all temperature fallbacks failed by returning the result with the best log probability

Page 4 of 6

© 2024 Safety CLI Cybersecurity Inc. All Rights Reserved.