HEAR 2021 Competition Version
- Minor fix, unpins transformers version
v2021.0.5-release
* wav2vec2 baseline uses `facebook/wav2vec2-large-100k-voxpopuli` model by default and has different hop sizes for scene and timestamp embedding.
* torchcrepe baseline has different hop sizes for scene and timestamp embedding.