- Make Hyperion pip installable
- Adds installation instructions
- Configuration files and command-line arguments are handled using jsonarparse, not argparse anymore. This allows us to use yaml files and overrides the values in the yaml file from the command line.
- First version using nn.DistributedDataParallel instead of nn.DataParallel
- Supports FairScale Sharded DataParallel, we didn't observe significant memory gains in our models using this so far
- Added SpineNet, Spine2Net and TSE-Spine2Net x-vector architectures from our IS21 paper
- Added SpeAugment PyTorch Layer
- Added numpy speed augment class
- Fixed make_voxceleb2cat.pl, all speakers extracted from the same video were getting the same spkid, the script did not take into account that more than one speaker could be extracted from each video
- New recipes:
- Recipe for classifying adversarial attacks algorithms and threat models from our IS21 paper (voxceleb/adv.v2)
- Recipe for adv attacks aginst spk verif renamed as voxceleb/adv.v1 and adv.v1.1, these recipes have been updated and cleaned up
- Recipe for SRE19-AV Audio part with AHC diarization (sre19-av-a/v2.1)
- Recipe for Chime5 speaker verification setup chime5/v1
- Recipes for SRE19-AV Face Recognition using pretrained RetinaFace face detector and ArcFace embeddings from InsightFace MX-Net repository (sre19-av-v/v0.1) and Insightface-Pytorch (sre19-av-v/v0.2)
- Added VOiCES challenge recipe
- Adds SRE20-CTS recipe v1
- Added Spine2Net results in voxceleb/v1.1 recipe