Torchaudio

Latest version: v2.5.1

Safety actively analyzes 688552 Python packages for vulnerabilities to keep your Python projects secure.

Scan your dependencies

Page 15 of 16

0.5.1

Highlights

* Updated pinned version of PyTorch to [`v1.5.1`](https://github.com/pytorch/pytorch/releases/tag/v1.5.1)

0.5.0

Highlights

torchaudio includes new transforms (e.g. Griffin-Lim and inverse Mel scale), new filters (e.g. all pass, fade, band pass/reject, band, treble, deemph, riaa), and datasets (LJ Speech and SpeechCommands).

Backwards Incompatible Changes

* torchaudio no longer supports python 2. We removed future and six imports. We added inline typing. (413, 478, 479, 482, 486)
* We fixed CommonVoice dataset download, and updated to the latest version. (498)
* We now skip data point with missing data in VCTK dataset. (484)

New Features

* We now have the Vol transforms, and DB_to_amplitude.(468, 469)
* We now have the InverseMelScale (448)
* We now have the Griffin-Lim functional. (365)
* We now support allpass, fade, bandpass, bandreject, band, treble, deemph, riaa. (444, 449, 464, 470, 508)
* We now offer LJSpeech and SpeechCommands datasets. (439, 437)

Improvements

* We added inline typing to SoxEffects and Kaldi compliance. (490, 497)
* We refactored the tests. (480, 485, 496, 491, 501, 502, 503, 506, 507, 509)
* We now run tests with sox only when sox is available. (419)
* We extended batch support to MelScale, MelSpectrogram, MFCC, Resample. (391, 435)
* The speed of torchaudio.functional.istft was improved. (471)
* We now have transform and functional tests for AmplitudeToDB. (463)
* We now ignore pycharm and OSX files in git. (461)
* TimeStretch now has a batch test. (459)
* Docstrings in transforms were polished. (442)
* TimeStretch and AmplitudeToDB are now torch.nn.Module. (456)
* Resample is now jitable. (441)
* We support python 3.8. (397)
* Add cuda test for complex norm. (421)
* Dither is jitable with the latest version of pytorch. (417)
* Batching uses view instead of reshape. (409)
* We refactored the jitability test. (395)
* In .circleci, we removed a conditional block that wasn't doing anything. (399)
* We now have Windows CI for building. (394 and 398)
* We corrected the use of standard variable names in code. (393)
* We adopted native-Python code generation convention. (378)
* torchaudio.istft creates tensors directly on device. (377)
* torchaudio.compliance.kaldi.resample_waveform is now jitable. (362)
* The runtime of torchaudio.functional.lfilter was decreased. (374)

Bug Fixes

* We fixed flake8 errors. (504, 505)
* We fixed Windows test by only testing with cpu-only binaries. (489)
* Spelling correction in docstrings for transforms.FrequencyMasking and transforms.TimeMasking. (474)
* In .circleci, we switched to use token for conda uploads. (460)
* The default value of dither parameter was changed. (453)
* TimeStretch moves device correctly. (457)
* Adding dev-other option in librispeech. (433)
* In build script, we install the correct version of pytorch for pip. (412)
* Upgrading dataset DeprecationWarning to UserWarning so that the user gets the warning. (402)
* Make power of spectrogram a float to work with complex norm. (392)
* Fix random seed for flaky test_griffinlim test. (388)
* Apply 'nightly' branch filter to binary uploads. (385)
* Fixed build errors: added explicitly utf8 decoration, added explicit utf_8_encoder definition if not available, explicitly cast to int. (380)

Deprecations

* None

0.4

* We introduce an interactive speech recognition demo. (266, 229, 248)
* SoX is now optional, and a new extensible backend dispatch mechanism exposes SoundFile as an alternative to SoX.
* The interface for datasets has been unified. This enables the addition of two large datasets: LibriSpeech and Common Voice.
* New filters such as biquad, data augmentation such as time and frequency masking, and transforms such as gain and dither, and new feature computation such as deltas, are now available.
* Transformations now support batches and are jitable.

We would like to thank again our contributors and the wider community for their significant contributions to this release. In particular we'd like to thank keunwoochoi, ksanjeevan, and all the other maintainers and contributors of torchaudio-contrib for their significant and valuable additions around augmentations (285) and batching (327).

Breaking Changes

* torchaudio now requires PyTorch 1.3.0 or newer, see https://pytorch.org/ for installation instructions. (#312)
* We make jit compilation optional for functions and use nn.Module where possible. (314, 326, 342, 369)
* By unifying the interface for datasets, we changed the interface for VCTK and YESNO (303, 316). In particular, the construction parameters `downsample`, `transform`, `target_transform`, and `return_dict` are being deprecated.
* SoxEffectsChain.EFFECTS_AVAILABLE replaced by SoxEffectsChain().EFFECTS_AVAILABLE (355)
* This is the last version to support Python 2.

New Features

* SoX is now optional, and a new extensible backend dispatch mechanism exposes SoundFile as an alternative to SoX. This makes it possible to use torchaudio even when SoX or SoundFile are not installed or available. (355)
* We now have a unified dataset interface that loads in memory only one item at a time enabling new large datasets: LibriSpeech and CommonVoice. (303, 316, 330)
* We introduce a pitch detection algorithm: `torchaudio.functional.detect_pitch_frequency`. (313, 322)
* We offer data augmentations in `torchaudio.transforms`: `TimeStretch`, `FrequencyMasking`, `TimeMasking`. (285, 333, 348)
* We introduce a complex norm transform: `torchaudio.transform.ComplexNorm`. (285, 333)
* We now have a new audio feature generation for computing deltas: `torchaudio.functional.compute_deltas`. (268, 326)
* We introduce `torchaudio.functional.gain` and `torchaudio.functional.dither` (319, 360). We welcome work to continue the effort to implement features available in SoX, see 260.
* We now include `equalizer_biquad` (315, 340), `lowpass_biquad`, `highpass_biquad` (275), `lfilter`, and `biquad` (275, 291, 326) in `torchaudio.functional`.
* MFCC is available as `torchaudio.functional.mfcc`. (228)

Improvements

* We now support batching in transforms. (327, 337, 404)
* Functions are now jitable, and nn.Module is used where possible. (314, 326, 342, 362, 369, 395)
* Downloads of large files are now automatically resumed with new download function. (320)
* New tests for ISTFT are added. (279)
* We introduce nightly builds. (301)
* We now have smoke tests for builds. (346, 359)

Bug Fixes

* Fix mismatch between `MelScale` and librosa. (294)
* Fix `torchaudio.compliance.kaldi.resample_waveform` where internal variables where not moved to the GPU when used. (277)
* Fix a bug that occurred when importing torchaudio built outside of a git repository. (276)
* Fix `istft` where the `dtype` and `device` of parameters were not created on the same device as the tensor provided by the user. (264)
* Fix size mismatch when saving and loading from state dictionary (`load_state_dict`). (246)
* Clarified internal naming convention within transforms and functionals. (298)
* Fix build script to be more tolerant to download drops. (280, 284, 305)
* Correct documentation for SoxEffectsChain. (283)
* Fix resample error with cuda tensors. (277)
* Fix error when importing version outside of git. (276)
* Fix missing asound in linux build. (254)
* Fix deprecated torch. (254)
* Fix link in README. (253)
* Fix window device in ISTFT. (240)
* Documentation: Fix range in documentation for `torchaudio.load` to [-1, 1]. (283)

0.4.0

0.3.2

This release is to update the dependency to PyTorch 1.3.1.

0.3.1

This release is to update the dependency to PyTorch 1.3.0.

Minor Fix

* Updated settings for curl in build scripts (280, 284, 297).

Page 15 of 16

© 2024 Safety CLI Cybersecurity Inc. All Rights Reserved.