Pyhearingai

Latest version: v0.1.0

Safety actively analyzes 722832 Python packages for vulnerabilities to keep your Python projects secure.

0.1.0

Added

Core Functionality
- Audio transcription engine with high accuracy text-to-speech conversion
- Advanced speaker diarization for multi-speaker conversations
- GPT-based speaker assignment for improved speaker identification
- Multiple formats support for transcription output (JSON, TEXT, SRT)
- Segment-level timestamps and speaker identification
- Support for multiple audio formats (WAV, MP3, M4A, etc.)

User Interfaces
- Command-line interface (CLI) for easy access to transcription features
- Programmatic API for integration into other Python applications
- Simple example scripts demonstrating library usage

Model Integrations
- OpenAI API integration for transcription and speaker assignment
- SpeechBrain integration for speech recognition
- Hugging Face models support
- Pyannote.audio integration for speaker diarization

Developer Features
- Comprehensive configuration options via environment variables
- Debug logging capabilities
- Error handling and graceful degradation
- Structured output format with segments and speaker information
- Multiple output file formats supported

Changed
- Initial release, no changes to document

Fixed
- Initial release, no fixes to document

Releases

Has known vulnerabilities

0.1.0

Pyhearingai

Page 1 of 1

0.1.0

Page 1 of 1

Links

Releases