Faster metadata upload, bugfixes and improvements
Faster metadata upload
The upload of custom metadata is now done async and with multiple workers in parallel, allowing to speed up the upload process by up to 30 times.
Bugfixes
- When there is a failure uploading a file to a signed url, now the status code is printed correctly.
- Creating a `LightlyDataset` with an `input_dir` with videos will now raise all errors scanning the input directory instead of ignoring them. This means e.g. that if a subfolder without read permissions is encountered, a `PermissionError` will be raised instead of silently ignoring the subfolder.
- When embedding, the order of the embeddings in the output will now be the order of the samples in the dataset, even if multiple workers are used in the dataloader. Thus also the embeddings in the embedding file are in the sorted order. This is not directly a bugfix but might prevent problems later on.
Improvements
- The usage of resnet backbones in the example models is now consistent. Thanks for bringing this up JeanKaddour!
- The SimCLR example now does not use Gaussian blur anymore, just like in the paper. Thanks for pointing this out littleolex!
- The BarlowTwins example now also uses an input size of 32 to make it consistent with the other examples. Thanks for bringing this up heytitle!
- The documentation for setting up Azure as cloud storage for the Lightly Platform has been improved.
Models
- [Bootstrap your own latent: A new approach to self-supervised Learning, 2020](https://arxiv.org/abs/2006.07733)
- [Barlow Twins: Self-Supervised Learning via Redundancy Reduction, 2021](https://arxiv.org/abs/2103.03230)
- [SimSiam: Exploring Simple Siamese Representation Learning, 2020](https://arxiv.org/abs/2011.10566)
- [MoCo: Momentum Contrast for Unsupervised Visual Representation Learning, 2019](https://arxiv.org/abs/1911.05722)
- [SimCLR: A Simple Framework for Contrastive Learning of Visual Representations, 2020](https://arxiv.org/abs/2002.05709)
- [NNCLR: Nearest-Neighbor Contrastive Learning of Visual Representations, 2021](https://arxiv.org/pdf/2104.14548.pdf)
- [SwAV: Unsupervised Learning of Visual Features by Contrasting Cluster Assignments, M. Caron, 2020](https://arxiv.org/abs/2006.09882)