Improvements for Inference Pipeline for Fully Finetuned Models
- Model Optimization: Uses [CTranslate2](https://github.com/OpenNMT/CTranslate2) for optimized model performance.
- Backend: Leverages the [faster-whisper](https://github.com/SYSTRAN/faster-whisper) backend (previously based on the Transformers Pipeline) for faster processing.
- Alignment: Uses Wav2vec models to ensure precise audio alignment.
- Multi-Speaker Diarization: Integrates [pyannote-audio](https://github.com/pyannote/pyannote-audio) for speaker identification and labeling.
- Voice Activity Detection (VAD): Employs VAD preprocessing to reduce hallucination and improve batching without compromising Word Error Rate (WER).
Fixes
- Fixes 109
What's Changed
* Chore: Update deployment documentation, Fix: remove uid by KevKibe in https://github.com/KevKibe/African-Whisper/pull/93
* Chore: Add unittests for `training.whisper_model_prep` module. Modify `training.load_dataset` module by KevKibe in https://github.com/KevKibe/African-Whisper/pull/94
* Chore: Add tests for `training.audio_data_processor` module. by KevKibe in https://github.com/KevKibe/African-Whisper/pull/95
* Feat: Faster Whisper backend for `deployment.transcription_pipeline` module, transcription alignment, audio diarization, VAD. by KevKibe in https://github.com/KevKibe/African-Whisper/pull/98
* Chore: Update `DOCS/DEPLOYMENT.MD` and comments, add faster-whisper dependencies to `src/deployments/requirements.txt`.`. by KevKibe in https://github.com/KevKibe/African-Whisper/pull/100
* Bump rustls from 0.22.3 to 0.22.4 in /src/deployment/rustinference by dependabot in https://github.com/KevKibe/African-Whisper/pull/99
* Refactors: Change filenames for easier identification of peft code and non-peft. by KevKibe in https://github.com/KevKibe/African-Whisper/pull/102
* Fix: 'argparse` argument name, update version number for new release by KevKibe in https://github.com/KevKibe/African-Whisper/pull/107
* Fix: `whisper_arch` parameter in `load_asr_model` in `convert_model_to_optimized_format` method. by KevKibe in https://github.com/KevKibe/African-Whisper/pull/110
* Chore: Add tests for `deployment.peft_speech_inference` module, fix `src/deployment/speech_inference_cli.py`. by KevKibe in https://github.com/KevKibe/African-Whisper/pull/111
* Feat: Add `num_samples` parameter to `training.load_data` module to load a section of the training data. by KevKibe in https://github.com/KevKibe/African-Whisper/pull/112
* Fix: `src/deployment/faster_whisper/mel_filters.npz` file in build. by KevKibe in https://github.com/KevKibe/African-Whisper/pull/113
**Full Changelog**: https://github.com/KevKibe/African-Whisper/compare/v0.9.0...v0.10.0