- [NumPy](https://numpy.org/) (>= 1.17.2)
`pip install numpy`
- [SciPy](https://www.scipy.org/) (>= 1.3.1)
`pip install scipy`
- [FFmpeg](https://www.ffmpeg.org/)
`sudo apt install ffmpeg`
Installation
-------------
If you already have a working installation of numpy and scipy, you can simply install pydiogment using pip:
pip install -U pydiogment
How to use
==========
- Amplitude related augmentation
- Apply a fade in and fade out effect
python3
from pydiogment.auga import fade_in_and_out
test_file = "path/test.wav"
fade_in_and_out(test_file)
- Apply gain to file
python3
from pydiogment.auga import apply_gain
test_file = "path/test.wav"
apply_gain(test_file, -100)
apply_gain(test_file, -50)
- Add Random Gaussian Noise based on SNR to file
python3
from pydiogment.auga import add_noise
test_file = "path/test.wav"
add_noise(test_file, 10)
- Frequency related augmentation
- Change file tone
python3
from pydiogment.augf import change_tone
test_file = "path/test.wav"
change_tone(test_file, 0.9)
change_tone(test_file, 1.1)
- Time related augmentation
- Slow-down/ speed-up file
python3
from pydiogment.augt import slowdown, speed
test_file = "path/test.wav"
slowdown(test_file, coefficient=0.8)
speed(test_file, coefficient=1.2)
- Apply random cropping to the file
python3
from pydiogment.augt import random_cropping
test_file = "path/test.wav"
random_cropping(test_file, 1)
- Change shift data on the time axis in a certain direction
python3
from pydiogment.augt import shift_time
test_file = "path/test.wav"
shift_time(test_file, 1,"right")
shift_time(test_file, 1,"left")
Documentation
==============
A thorough documentation of the library is available under [pydiogment.readthedocs.io](https://pydiogment.readthedocs.io/en/latest/index.html).
Contributing
============
Contributions are welcome and encouraged. To learn more about how to contribute to pydiogment please refer to the [Contributing guidelines](https://github.com/SuperKogito/pydiogment/blob/master/CONTRIBUTING.md)
Acknowledgment and credits
============================
- The test file used in the pytests is [OSR_us_000_0060_8k.wav](https://www.voiptroubleshooter.com/open_speech/american/OSR_us_000_0060_8k.wav) from the [Open Speech Repository](https://www.voiptroubleshooter.com/open_speech/american.html).