Skada

Latest version: v0.3.0

Safety actively analyzes 682387 Python packages for vulnerabilities to keep your Python projects secure.

Scan your dependencies

0.3.0

First release of SKADA!

The following algorithms are currently implemented.

Domain adaptation algorithms

- Sample reweighting methods (Gaussian [1], Discriminant [2], KLIEPReweight [3],
DensityRatio [4], TarS [21], KMMReweight [23])
- Sample mapping methods (CORAL [5], Optimal Transport DA OTDA [6], LinearMonge [7], LS-ConS [21])
- Subspace methods (SubspaceAlignment [8], TCA [9], Transfer Subspace Learning [27])
- Other methods (JDOT [10], DASVM [11], OT Label Propagation [28])

Any methods that can be cast as an adaptation of the input data can be used in one of two ways:
- a scikit-learn transformer (Adapter) which provides both a full Classifier/Regressor estimator
- or an `Adapter` that can be used in a DA pipeline with `make_da_pipeline`.
Refer to the examples below and visit [the gallery](https://scikit-adaptation.github.io/auto_examples/index.html) for more details.

Deep learning domain adaptation algorithms

- Deep Correlation alignment (DeepCORAL [12])
- Deep joint distribution optimal (DeepJDOT [13])
- Divergence minimization (MMD/DAN [14])
- Adversarial/discriminator based DA (DANN [15], CDAN [16])

DA metrics

- Importance Weighted [17]
- Prediction entropy [18]
- Soft neighborhood density [19]
- Deep Embedded Validation (DEV) [20]
- Circular Validation [11]

References

[1] Shimodaira Hidetoshi. ["Improving predictive inference under covariate shift by weighting the log-likelihood function."](https://citeseerx.ist.psu.edu/document?repid=rep1&type=pdf&doi=235723a15c86c369c99a42e7b666dfe156ad2cba) Journal of statistical planning and inference 90, no. 2 (2000): 227-244.

[2] Sugiyama Masashi, Taiji Suzuki, and Takafumi Kanamori. ["Density-ratio matching under the Bregman divergence: a unified framework of density-ratio estimation."](https://citeseerx.ist.psu.edu/document?repid=rep1&type=pdf&doi=f1467208a75def8b2e52a447ab83644db66445ea) Annals of the Institute of Statistical Mathematics 64 (2012): 1009-1044.

[3] Sugiyama Masashi, Taiji Suzuki, Shinichi Nakajima, Hisashi Kashima, Paul Von Bünau, and Motoaki Kawanabe. ["Direct importance estimation for covariate shift adaptation."](https://citeseerx.ist.psu.edu/document?repid=rep1&type=pdf&doi=af14e09a9f829b9f0952eac244b0ac0c8bda2ca8) Annals of the Institute of Statistical Mathematics 60 (2008): 699-746.

[4] Sugiyama Masashi, and Klaus-Robert Müller. ["Input-dependent estimation of generalization error under covariate shift."](https://web.archive.org/web/20070221112234id_/http://sugiyama-www.cs.titech.ac.jp:80/~sugi/2005/IWSIC.pdf) (2005): 249-279.

[5] Sun Baochen, Jiashi Feng, and Kate Saenko. ["Correlation alignment for unsupervised domain adaptation."](https://arxiv.org/pdf/1612.01939.pdf) Domain adaptation in computer vision applications (2017): 153-171.

[6] Courty Nicolas, Flamary Rémi, Tuia Devis, and Alain Rakotomamonjy. ["Optimal transport for domain adaptation."](https://arxiv.org/pdf/1507.00504.pdf) IEEE Trans. Pattern Anal. Mach. Intell 1, no. 1-40 (2016): 2.

[7] Flamary, R., Lounici, K., & Ferrari, A. (2019). [Concentration bounds for linear monge mapping estimation and optimal transport domain adaptation](https://arxiv.org/pdf/1905.10155.pdf). arXiv preprint arXiv:1905.10155.

[8] Fernando, B., Habrard, A., Sebban, M., & Tuytelaars, T. (2013). [Unsupervised visual domain adaptation using subspace alignment](https://openaccess.thecvf.com/content_iccv_2013/papers/Fernando_Unsupervised_Visual_Domain_2013_ICCV_paper.pdf). In Proceedings of the IEEE international conference on computer vision (pp. 2960-2967).

[9] Pan, S. J., Tsang, I. W., Kwok, J. T., & Yang, Q. (2010). [Domain adaptation via transfer component analysis](https://citeseerx.ist.psu.edu/document?repid=rep1&type=pdf&doi=4823e52161ec339d4d3526099a5477321f6a9a0f). IEEE transactions on neural networks, 22(2), 199-210.

[10] Courty, N., Flamary, R., Habrard, A., & Rakotomamonjy, A. (2017). [Joint distribution optimal transportation for domain adaptation](https://proceedings.neurips.cc/paper_files/paper/2017/file/0070d23b06b1486a538c0eaa45dd167a-Paper.pdf). Advances in neural information processing systems, 30.

[11] Bruzzone, L., & Marconcini, M. (2009). [Domain adaptation problems: A DASVM classification technique and a circular validation strategy.](https://rslab.disi.unitn.it/papers/R82-PAMI.pdf) IEEE transactions on pattern analysis and machine intelligence, 32(5), 770-787.

[12] Sun, B., & Saenko, K. (2016). [Deep coral: Correlation alignment for deep domain adaptation](https://arxiv.org/pdf/1607.01719.pdf). In Computer Vision–ECCV 2016 Workshops: Amsterdam, The Netherlands, October 8-10 and 15-16, 2016, Proceedings, Part III 14 (pp. 443-450). Springer International Publishing.

[13] Damodaran, B. B., Kellenberger, B., Flamary, R., Tuia, D., & Courty, N. (2018). [Deepjdot: Deep joint distribution optimal transport for unsupervised domain adaptation](https://openaccess.thecvf.com/content_ECCV_2018/papers/Bharath_Bhushan_Damodaran_DeepJDOT_Deep_Joint_ECCV_2018_paper.pdf). In Proceedings of the European conference on computer vision (ECCV) (pp. 447-463).

[14] Long, M., Cao, Y., Wang, J., & Jordan, M. (2015, June). [Learning transferable features with deep adaptation networks](https://proceedings.mlr.press/v37/long15.pdf). In International conference on machine learning (pp. 97-105). PMLR.

[15] Ganin, Y., Ustinova, E., Ajakan, H., Germain, P., Larochelle, H., Laviolette, F., ... & Lempitsky, V. (2016). [Domain-adversarial training of neural networks](https://www.jmlr.org/papers/volume17/15-239/15-239.pdf). Journal of machine learning research, 17(59), 1-35.

[16] Long, M., Cao, Z., Wang, J., & Jordan, M. I. (2018). [Conditional adversarial domain adaptation](https://proceedings.neurips.cc/paper_files/paper/2018/file/ab88b15733f543179858600245108dd8-Paper.pdf). Advances in neural information processing systems, 31.

[17] Sugiyama, M., Krauledat, M., & Müller, K. R. (2007). [Covariate shift adaptation by importance weighted cross validation](https://www.jmlr.org/papers/volume8/sugiyama07a/sugiyama07a.pdf). Journal of Machine Learning Research, 8(5).

[18] Morerio, P., Cavazza, J., & Murino, V. (2017).[ Minimal-entropy correlation alignment for unsupervised deep domain adaptation](https://arxiv.org/pdf/1711.10288.pdf). arXiv preprint arXiv:1711.10288.

[19] Saito, K., Kim, D., Teterwak, P., Sclaroff, S., Darrell, T., & Saenko, K. (2021). [Tune it the right way: Unsupervised validation of domain adaptation via soft neighborhood density](https://openaccess.thecvf.com/content/ICCV2021/papers/Saito_Tune_It_the_Right_Way_Unsupervised_Validation_of_Domain_Adaptation_ICCV_2021_paper.pdf). In Proceedings of the IEEE/CVF International Conference on Computer Vision (pp. 9184-9193).

[20] You, K., Wang, X., Long, M., & Jordan, M. (2019, May). [Towards accurate model selection in deep unsupervised domain adaptation](https://proceedings.mlr.press/v97/you19a/you19a.pdf). In International Conference on Machine Learning (pp. 7124-7133). PMLR.

[21] Zhang, K., Schölkopf, B., Muandet, K., Wang, Z. (2013). [Domain Adaptation under Target and Conditional Shift](http://proceedings.mlr.press/v28/zhang13d.pdf). In International Conference on Machine Learning (pp. 819-827). PMLR.

[22] Loog, M. (2012). Nearest neighbor-based importance weighting. In 2012 IEEE International Workshop on Machine Learning for Signal Processing, pages 1–6. IEEE (https://arxiv.org/pdf/2102.02291.pdf)

[23] Domain Adaptation Problems: A DASVM ClassificationTechnique and a Circular Validation StrategyLorenzo Bruzzone, Fellow, IEEE, and Mattia Marconcini, Member, IEEE (https://rslab.disi.unitn.it/papers/R82-PAMI.pdf)

[24] Loog, M. (2012). Nearest neighbor-based importance weighting. In 2012 IEEE International Workshop on Machine Learning for Signal Processing, pages 1–6. IEEE (https://arxiv.org/pdf/2102.02291.pdf)

[25] J. Huang, A. Gretton, K. Borgwardt, B. Schölkopf and A. J. Smola. Correcting sample selection bias by unlabeled data. In NIPS, 2007. (https://citeseerx.ist.psu.edu/document?repid=rep1&type=pdf&doi=07117994f0971b2fc2df95adb373c31c3d313442)

[26] Long, M., Wang, J., Ding, G., Sun, J., and Yu, P. (2014). [Transfer joint matching for unsupervised domain adaptation. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pages 1410–1417](https://citeseerx.ist.psu.edu/document?repid=rep1&type=pdf&doi=a279f53f386ac78345b67e13c1808880c718efdf)

[27] S. Si, D. Tao and B. Geng. In IEEE Transactions on Knowledge and Data Engineering, (2010) [Bregman Divergence-Based Regularization for Transfer Subspace Learning](https://citeseerx.ist.psu.edu/document?repid=rep1&type=pdf&doi=4118b4fc7d61068b9b448fd499876d139baeec81)

[28] Solomon, J., Rustamov, R., Guibas, L., & Butscher, A. (2014, January). [Wasserstein propagation for semi-supervised learning](https://proceedings.mlr.press/v32/solomon14.pdf). In International Conference on Machine Learning (pp. 306-314). PMLR.

What's Changed
* Update previously used dataset fixture by kachayev in https://github.com/scikit-adaptation/skada/pull/117
* Remove masked inputs only if estimator does not accept `sample_domain` by kachayev in https://github.com/scikit-adaptation/skada/pull/123
* Fix DiscriminatorReweightDensity and ReweightDensity by antoinedemathelin in https://github.com/scikit-adaptation/skada/pull/118
* [TO_REVIEW] _find_y_type return enum by YanisLalou in https://github.com/scikit-adaptation/skada/pull/125
* [MRG] Implement CircularValidation as a scorer by YanisLalou in https://github.com/scikit-adaptation/skada/pull/124
* No need for mark-as-final operation by kachayev in https://github.com/scikit-adaptation/skada/pull/128
* [MRG] Add TarS method by antoinecollas in https://github.com/scikit-adaptation/skada/pull/93
* Selector to avoid filtering out masked samples when fitting transformer by kachayev in https://github.com/scikit-adaptation/skada/pull/129
* [MRG] Fix tests random seed and TarS by antoinecollas in https://github.com/scikit-adaptation/skada/pull/131
* [MRG] DA deep methods with new API by tgnassou in https://github.com/scikit-adaptation/skada/pull/45
* [MRG] Add LS-ConS method by antoinecollas in https://github.com/scikit-adaptation/skada/pull/103
* Precommit with ruff+codespell by agramfort in https://github.com/scikit-adaptation/skada/pull/130
* Ignore deep/* tests when torch is not installed by kachayev in https://github.com/scikit-adaptation/skada/pull/133
* [MRG] Separate Lint and Tests + codecov configuration by rflamary in https://github.com/scikit-adaptation/skada/pull/135
* `SelectSource` and `SelectTarget` selectors by kachayev in https://github.com/scikit-adaptation/skada/pull/142
* `SelectSourceTarget` selector by kachayev in https://github.com/scikit-adaptation/skada/pull/145
* [MRG] Implementation of 1NN reweighting and reweighting example implementation by BuenoRuben in https://github.com/scikit-adaptation/skada/pull/108
* [MRG] Add predict_proba for jdot by YanisLalou in https://github.com/scikit-adaptation/skada/pull/153
* Update readme and add unique references in docstring by ambroiseodt in https://github.com/scikit-adaptation/skada/pull/154
* [MRG] CircularValidation: Re-encode y labels before training the estimator on y_source_pred by YanisLalou in https://github.com/scikit-adaptation/skada/pull/155
* [MRG] Fix Tars & MMDSConS by YanisLalou in https://github.com/scikit-adaptation/skada/pull/156
* [MRG] Add the auto/scale mode in KLIEP by YanisLalou in https://github.com/scikit-adaptation/skada/pull/157
* [TO_REVIEW] Make DeepEmbeddedValidation scorer to work with the new API by YanisLalou in https://github.com/scikit-adaptation/skada/pull/47
* [MRG] Add predict_proba to DA_SVM by YanisLalou in https://github.com/scikit-adaptation/skada/pull/158
* [MRG] Update How-to with advanced pipelines examples by rflamary in https://github.com/scikit-adaptation/skada/pull/144
* [MRG] Propagate adaptation output through multiple steps by kachayev in https://github.com/scikit-adaptation/skada/pull/149
* [MRG] Implementation of the TJM method by BuenoRuben in https://github.com/scikit-adaptation/skada/pull/140
* [TO_REVIEW] Small bug fix check_X_y_domain() + source_target_merge() by YanisLalou in https://github.com/scikit-adaptation/skada/pull/165
* Add frank-wolfe solver (v2) by antoinedemathelin in https://github.com/scikit-adaptation/skada/pull/167
* [MRG] Add kwargs to DASVM predict/predict_proba + add score func by YanisLalou in https://github.com/scikit-adaptation/skada/pull/160
* Allow multi dim inputs through the pipeline by kachayev in https://github.com/scikit-adaptation/skada/pull/170
* Fix sign for source/targets in the HowTo docs by kachayev in https://github.com/scikit-adaptation/skada/pull/172
* Make deep DA methods compatible with GPU by Florent-Michel in https://github.com/scikit-adaptation/skada/pull/174
* [MRG] Make `make_da_pipeline` work with deep methods by tgnassou in https://github.com/scikit-adaptation/skada/pull/159
* Fix issue 171 on deep_coral_loss by Florent-Michel in https://github.com/scikit-adaptation/skada/pull/182
* Update documentation by apmellot in https://github.com/scikit-adaptation/skada/pull/176
* Fix CDAN input to domain_classifier and gpu compatibility by Florent-Michel in https://github.com/scikit-adaptation/skada/pull/178
* [MRG] Fix issue of the circular validation with deep models by YanisLalou in https://github.com/scikit-adaptation/skada/pull/169
* [MRG] Major update of Adapter API by kachayev in https://github.com/scikit-adaptation/skada/pull/184
* [MRG] Add TransferSubspaceLearning method by antoinecollas in https://github.com/scikit-adaptation/skada/pull/181
* [MRG] OT Label propagation methods (classical and target shift) by rflamary in https://github.com/scikit-adaptation/skada/pull/195
* [MRG] Fix pack when y is a string by YanisLalou in https://github.com/scikit-adaptation/skada/pull/197
* [MRG] Add officehome dataset by YanisLalou in https://github.com/scikit-adaptation/skada/pull/196
* [MRG] Add Amazon review dataset to skada by YanisLalou in https://github.com/scikit-adaptation/skada/pull/185
* [TO_REVIEW] Add n_neighbours as an arg of NN RW by YanisLalou in https://github.com/scikit-adaptation/skada/pull/199
* [MRG][FIX] Add allo_source parameter to JDOTClassifier by rflamary in https://github.com/scikit-adaptation/skada/pull/202
* [MRG] Fix Circular validation for NO_DA_TARGET_ONLY by YanisLalou in https://github.com/scikit-adaptation/skada/pull/201
* [MRG] MMDLS handle case where we have X_source and no y_source by YanisLalou in https://github.com/scikit-adaptation/skada/pull/200
* [MRG] Clean datasets references by ambroiseodt in https://github.com/scikit-adaptation/skada/pull/203
* [MRG] Incorporate sklearn 1.5.0 changes that are incompatible by kachayev in https://github.com/scikit-adaptation/skada/pull/208
* [MRG] Fix subspace alignment by antoinecollas in https://github.com/scikit-adaptation/skada/pull/206
* [MRG] Fix Density Reweight + Deep embedded validation + ImportanceWeightedScorer by YanisLalou in https://github.com/scikit-adaptation/skada/pull/205
* FIX test collection don't run examples by tomMoral in https://github.com/scikit-adaptation/skada/pull/209
* Add a cov_shift_center parameter to move the center of the covariate shift by antoinedemathelin in https://github.com/scikit-adaptation/skada/pull/210
* [WIP] Center data when using transform of coral by antoinecollas in https://github.com/scikit-adaptation/skada/pull/211
* Add n_iter_max to OTLabelProp by antoinecollas in https://github.com/scikit-adaptation/skada/pull/212
* Fix test_cv np.bincount error by YanisLalou in https://github.com/scikit-adaptation/skada/pull/213
* [FIX][ENH] Fix and add more explanation to "DA validation procedures" examples by apmellot in https://github.com/scikit-adaptation/skada/pull/183
* [FIX][ENH] Improve "DA methods" examples by vloison in https://github.com/scikit-adaptation/skada/pull/188
* [MRG] Release 0.3.0, pyproject.toml an Citations files by rflamary in https://github.com/scikit-adaptation/skada/pull/215

New Contributors
* antoinecollas made their first contribution in https://github.com/scikit-adaptation/skada/pull/93
* tgnassou made their first contribution in https://github.com/scikit-adaptation/skada/pull/45
* agramfort made their first contribution in https://github.com/scikit-adaptation/skada/pull/130
* ambroiseodt made their first contribution in https://github.com/scikit-adaptation/skada/pull/154
* Florent-Michel made their first contribution in https://github.com/scikit-adaptation/skada/pull/174
* apmellot made their first contribution in https://github.com/scikit-adaptation/skada/pull/176
* tomMoral made their first contribution in https://github.com/scikit-adaptation/skada/pull/209
* vloison made their first contribution in https://github.com/scikit-adaptation/skada/pull/188

**Full Changelog**: https://github.com/scikit-adaptation/skada/compare/0.2.3...0.3.0

0.2.3

Documentatrion pre release

0.2.1

Documentation update tag

0.2

This is a first tag for SKADA. The library is still under heavy development and should not be used in production. API will definitely change in the future.

What's Changed
* Fix merge conflic in losses module by kachayev in https://github.com/scikit-adaptation/skada/pull/23
* Follow github block-quote markdown syntax for the README by kachayev in https://github.com/scikit-adaptation/skada/pull/22
* [MRG] CirleCI documentation by rflamary in https://github.com/scikit-adaptation/skada/pull/26
* [MRG] Doc circleCI by rflamary in https://github.com/scikit-adaptation/skada/pull/30
* Pipeline to respect `default_selector` parameters by kachayev in https://github.com/scikit-adaptation/skada/pull/29
* [MRG] Debug examples by rflamary in https://github.com/scikit-adaptation/skada/pull/31
* The selector to pass the params to the base estimator by kachayev in https://github.com/scikit-adaptation/skada/pull/35
* Add code cells markup for dataset examples to make them interactive by kachayev in https://github.com/scikit-adaptation/skada/pull/39
* Fix label masking for training dataset by kachayev in https://github.com/scikit-adaptation/skada/pull/40
* `make_da_pipeline` helper to allow named estimators by kachayev in https://github.com/scikit-adaptation/skada/pull/37
* Rename `pack_flatten` to `pack_for_lodo` by kachayev in https://github.com/scikit-adaptation/skada/pull/27
* [DOC] Corrections README and string for DomainAware dataset by YanisLalou in https://github.com/scikit-adaptation/skada/pull/42
* Rename Bunch keys by YanisLalou in https://github.com/scikit-adaptation/skada/pull/44
* [DOC] Small README fix by YanisLalou in https://github.com/scikit-adaptation/skada/pull/52
* [WIP] Add testing with minimal install and update it to use pip by rflamary in https://github.com/scikit-adaptation/skada/pull/48
* [FIX] Switch NotImplementedError to ValueError + New test cases by YanisLalou in https://github.com/scikit-adaptation/skada/pull/50
* Flake8 correction for _samples_generator.py by YanisLalou in https://github.com/scikit-adaptation/skada/pull/54
* Additional flake8 fixes by kachayev in https://github.com/scikit-adaptation/skada/pull/55
* Remove version for POT by kachayev in https://github.com/scikit-adaptation/skada/pull/57
* DomainAwareDataset str repr edge case handling by YanisLalou in https://github.com/scikit-adaptation/skada/pull/49
* Switch from dev0 to stable sklearn 1.4.0 by kachayev in https://github.com/scikit-adaptation/skada/pull/60
* [FIX] Unwrap expliticly given selector before generating the name for the pipeline by YanisLalou in https://github.com/scikit-adaptation/skada/pull/51
* [MRG] Make sure all API methods accept sample_domain as None by YanisLalou in https://github.com/scikit-adaptation/skada/pull/53
* Fix flake8 errors by kachayev in https://github.com/scikit-adaptation/skada/pull/62
* Doc fix by YanisLalou in https://github.com/scikit-adaptation/skada/pull/66
* [MRG] Add test cases for the Reweight class by YanisLalou in https://github.com/scikit-adaptation/skada/pull/70
* [TO_REVIEW] Using global variables instead of number by YanisLalou in https://github.com/scikit-adaptation/skada/pull/68
* [TO_REVIEW] Switch allow_source to True by default by YanisLalou in https://github.com/scikit-adaptation/skada/pull/64
* [Fix] Update flake8.yaml to actually run! by rflamary in https://github.com/scikit-adaptation/skada/pull/72
* Fix flake8 for utils and tests by kachayev in https://github.com/scikit-adaptation/skada/pull/73
* [MRG] Regression label for 2d classification data generation by BuenoRuben in https://github.com/scikit-adaptation/skada/pull/69
* [WIP] Update test suite for base selector functionality by kachayev in https://github.com/scikit-adaptation/skada/pull/74
* Fix masked inputs filtering in the base selector for regression tasks by kachayev in https://github.com/scikit-adaptation/skada/pull/86
* [MRG] JDOT Regressor by rflamary in https://github.com/scikit-adaptation/skada/pull/76
* Properly process `sample_weight` when using reweight adapters by kachayev in https://github.com/scikit-adaptation/skada/pull/90
* Remove target labels in the method comparison example by antoinedemathelin in https://github.com/scikit-adaptation/skada/pull/92
* [MRG] Modification source_target_merge function behaviours by YanisLalou in https://github.com/scikit-adaptation/skada/pull/71
* [MRG] PredictionEntropyScorer output negative scores by YanisLalou in https://github.com/scikit-adaptation/skada/pull/63

New Contributors
* kachayev made their first contribution in https://github.com/scikit-adaptation/skada/pull/23
* rflamary made their first contribution in https://github.com/scikit-adaptation/skada/pull/26
* YanisLalou made their first contribution in https://github.com/scikit-adaptation/skada/pull/42
* BuenoRuben made their first contribution in https://github.com/scikit-adaptation/skada/pull/69
* antoinedemathelin made their first contribution in https://github.com/scikit-adaptation/skada/pull/92

**Full Changelog**: https://github.com/scikit-adaptation/skada/commits/0.2

Links

Releases

Has known vulnerabilities

© 2024 Safety CLI Cybersecurity Inc. All Rights Reserved.