Realtimestt

Latest version: v0.3.100

Safety actively analyzes 723217 Python packages for vulnerabilities to keep your Python projects secure.

Scan your dependencies

Page 1 of 6

0.3.100

New VAD callbacks on_vad_start and on_vad_stop
- triggering on VAD presence
- reverted functionality of on_vad_detect_start, on_vad_detect_stop back to: triggered when the system starts/stops detecting for VAD presence

0.3.99

1. **Enhanced Logging Configuration**
- Introduced a dedicated named logger `realtimestt` instead of using the root logger.
- Added structured logging with handlers for both console (level set by user) and file (always DEBUG).
- Logging no longer propagates to the root logger by default (`logger.propagate = False`).

2. **Added possibility to disable Faster-Whisper VAD Filter**
- Added `faster_whisper_vad_filter` parameter (default: `True`) to enable voice activity detection (VAD) from the `faster_whisper` library.
- Improves robustness against background noise at the cost of additional GPU resources.
- Integrated into both real-time and main transcription workflows.

3. **Audio Worker Improvements**
- Added improved, detailed debug logging for audio device initialization, sample rate handling, and resampling.

4. **VAD Callback Adjustments**
- fixes 215
- Moved `on_vad_detect_start` and `on_vad_detect_stop` callbacks to trigger directly during voice activity checks instead of state transitions.
- Ensures callbacks align more accurately with actual speech/silence events.

0.3.98

- minor fix for pypi wheel

0.3.97

- fix for 210

0.3.95

- better warmup (using audio file)
- merged 200

0.3.94

- **New Parameters for stop-method of AudioToTextRecorder:**
- `backdate_stop_seconds` (float, default=0.0):
- **Description:** Specifies the number of seconds to backdate the stop time when ending a recording.
- **Usage:** When invoking `stop()` due to a wake word detection or a speaker diarization change event, this parameter compensates for any latency, ensuring that only relevant audio is included in the recording and transcription.

- `backdate_resume_seconds` (float, default=0.0):
- **Description:** Specifies the number of seconds to backdate the resume time when restarting listening after a recording has stopped.
- **Usage:** Typically set to the same value as `backdate_stop_seconds`, this parameter allows for fine-tuning.

Page 1 of 6

© 2025 Safety CLI Cybersecurity Inc. All Rights Reserved.