* Fix a bug related to `no_speech_threshold`: when the threshold was met for a segment, the next 30-second window reused the same encoder output and was also considered as non speech
* Improve selection of the final result when all temperature fallbacks failed by returning the result with the best log probability