Pykeen

Latest version: v1.11.0

Safety actively analyzes 723607 Python packages for vulnerabilities to keep your Python projects secure.

Page 1 of 2

1.7.0

-------------------------------------------------------------------
This release is only compatible with PyTorch 1.10+.

New Models
~~~~~~~~~~
- Add BoxE by ralphabb in https://github.com/pykeen/pykeen/pull/618
- Add TripleRE by mberr in https://github.com/pykeen/pykeen/pull/712
- Add AutoSF by mberr in https://github.com/pykeen/pykeen/pull/713
- Add Transformer by mberr in https://github.com/pykeen/pykeen/pull/714
- Add Canonical Tensor Decomposition by mberr in https://github.com/pykeen/pykeen/pull/663
- Add (novel) Fixed Model by cthoyt in https://github.com/pykeen/pykeen/pull/691
- Add NodePiece model by mberr in https://github.com/pykeen/pykeen/pull/621

Updated Models
~~~~~~~~~~~~~~
- Update R-GCN configuration by mberr in https://github.com/pykeen/pykeen/pull/610
- Update ConvKB to ERModel by cthoyt in https://github.com/pykeen/pykeen/pull/425
- Update ComplEx to ERModel by mberr in https://github.com/pykeen/pykeen/pull/639
- Rename TranslationalInteraction to NormBasedInteraction by mberr in https://github.com/pykeen/pykeen/pull/651
- Fix generic slicing dimension by mberr in https://github.com/pykeen/pykeen/pull/683
- Rename UnstructuredModel to UM and StructuredEmbedding to SE by cthoyt in https://github.com/pykeen/pykeen/pull/721
- Allow to pass unresolved loss to `ERModel`'s `__init__` by mberr in https://github.com/pykeen/pykeen/pull/717

Representations and Initialization
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
- Add low-rank embeddings by mberr in https://github.com/pykeen/pykeen/pull/680
- Add NodePiece representation by mberr in https://github.com/pykeen/pykeen/pull/621
- Add label-based initialization using a transformer (e.g., BERT) by mberr in https://github.com/pykeen/pykeen/pull/638 and https://github.com/pykeen/pykeen/pull/652
- Add label-based representation (e.g., to update language model using KGEM) by mberr in https://github.com/pykeen/pykeen/pull/652
- Remove literal representations (use label-based initialization instead) by mberr in https://github.com/pykeen/pykeen/pull/679

Training
~~~~~~~~
- Fix displaying previous epoch's loss by mberr in https://github.com/pykeen/pykeen/pull/627
- Fix kwargs transmission on MultiTrainingCallback by Rodrigo-A-Pereira in https://github.com/pykeen/pykeen/pull/645
- Extend Callbacks by mberr in https://github.com/pykeen/pykeen/pull/609
- Add gradient clipping by mberr in https://github.com/pykeen/pykeen/pull/607
- Fix negative score shape for sLCWA by mberr in https://github.com/pykeen/pykeen/pull/624
- Fix epoch loss for loss reduction != "mean" by mberr in https://github.com/pykeen/pykeen/pull/623
- Add sLCWA support for Cross Entropy Loss by mberr in https://github.com/pykeen/pykeen/pull/704

Inference
~~~~~~~~~
- Add uncertainty estimate functions via MC dropout by mberr in https://github.com/pykeen/pykeen/pull/688
- Fix predict top k by mberr in https://github.com/pykeen/pykeen/pull/690
- Fix indexing in `predict_*` methods when using inverse relations by mberr in https://github.com/pykeen/pykeen/pull/699
- Move tensors to device for `predict_*` methods by mberr in https://github.com/pykeen/pykeen/pull/658

Trackers
~~~~~~~~
- Fix wandb logging by mberr in https://github.com/pykeen/pykeen/pull/647
- Add multi-result tracker by mberr in https://github.com/pykeen/pykeen/pull/682
- Add Python result tracker by mberr in https://github.com/pykeen/pykeen/pull/681
- Update file trackers by cthoyt in https://github.com/pykeen/pykeen/pull/629

Evaluation
~~~~~~~~~~
- Store rank count by mberr in https://github.com/pykeen/pykeen/pull/672
- Extend `evaluate()` for easier relation filtering by mberr in https://github.com/pykeen/pykeen/pull/391
- Rename sklearn evaluator and refactor evaluator code by cthoyt in https://github.com/pykeen/pykeen/pull/708
- Add additional classification metrics via `rexmex` by cthoyt in https://github.com/pykeen/pykeen/pull/668

Triples and Datasets
~~~~~~~~~~~~~~~~~~~~
- Add helper dataset with internal batching for Schlichtkrull sampling by mberr in https://github.com/pykeen/pykeen/pull/616
- Refactor splitting code and improve documentation by mberr in https://github.com/pykeen/pykeen/pull/709
- Switch `np.loadtxt` to `pandas.read_csv` by mberr in https://github.com/pykeen/pykeen/pull/695
- Add binary I/O to triples factories cthoyt in https://github.com/pykeen/pykeen/pull/665

Torch Usage
~~~~~~~~~~~
- Use `torch.finfo` to determine suitable epsilon values by mberr in https://github.com/pykeen/pykeen/pull/626
- Use `torch.isin` instead of own implementation by mberr in https://github.com/pykeen/pykeen/pull/635
- Switch to using `torch.inference_mode` instead of `torch.no_grad` by sbonner0 in https://github.com/pykeen/pykeen/pull/604

Miscellaneous
~~~~~~~~~~~~~
- Add YAML experiment format by mberr in https://github.com/pykeen/pykeen/pull/612
- Add comparison with reproduction results during replication, if available by mberr in https://github.com/pykeen/pykeen/pull/642
- Adapt hello_world notebook to API changes by dobraczka in https://github.com/pykeen/pykeen/pull/649
- Add testing configuration for Jupyter notebooks by mberr in https://github.com/pykeen/pykeen/pull/650
- Add empty default `loss_kwargs` by mali-git in https://github.com/pykeen/pykeen/pull/656
- Optional extra config for reproduce by mberr in https://github.com/pykeen/pykeen/pull/692
- Store pipeline configuration in pipeline result by mberr in https://github.com/pykeen/pykeen/pull/685
- Fix upgrade to sequence by mberr in https://github.com/pykeen/pykeen/pull/697
- Fix pruner use in `hpo_pipeline` by mberr in https://github.com/pykeen/pykeen/pull/724

Housekeeping
~~~~~~~~~~~~
- Automatically lint with black by cthoyt in https://github.com/pykeen/pykeen/pull/605
- Documentation and style guide cleanup by cthoyt in https://github.com/pykeen/pykeen/pull/606

1.6.0

-------------------------------------------------------------------
This release is only compatible with PyTorch 1.9+. Because of some changes,
it's now pretty non-trivial to support both, so moving forwards PyKEEN will
continue to support the latest version of PyTorch and try its best to keep
backwards compatibility.

New Models
~~~~~~~~~~
- DistMA (https://github.com/pykeen/pykeen/pull/507)
- TorusE (https://github.com/pykeen/pykeen/pull/510)
- Frequency Baselines (https://github.com/pykeen/pykeen/pull/514)
- Gated Distmult Literal (https://github.com/pykeen/pykeen/pull/591, thanks Rodrigo-A-Pereira)

New Datasets
~~~~~~~~~~~~
- WD50K (https://github.com/pykeen/pykeen/pull/511)
- Wikidata5M (https://github.com/pykeen/pykeen/pull/528)
- BioKG (https://github.com/pykeen/pykeen/pull/585, thanks sbonner0)

New Losses
~~~~~~~~~~
- Double Margin Loss (https://github.com/pykeen/pykeen/pull/539)
- Focal Loss (https://github.com/pykeen/pykeen/pull/542)
- Pointwise Hinge Loss (https://github.com/pykeen/pykeen/pull/540)
- Soft Pointwise Hinge Loss (https://github.com/pykeen/pykeen/pull/540)
- Pairwise Logistic Loss (https://github.com/pykeen/pykeen/pull/540)

Added
~~~~~
- Tutorial in using checkpoints when bringing your own data (https://github.com/pykeen/pykeen/pull/498)
- Learning rate scheduling (https://github.com/pykeen/pykeen/pull/492)
- Checkpoints include entity/relation maps (https://github.com/pykeen/pykeen/pull/498)
- QuatE reproducibility configurations (https://github.com/pykeen/pykeen/pull/486)

Changed
~~~~~~~
- Reimplment SE (https://github.com/pykeen/pykeen/pull/521)
and NTN (https://github.com/pykeen/pykeen/pull/522) with new-style models
- Generalize pairwise loss and pointwise loss hierarchies (https://github.com/pykeen/pykeen/pull/540)
- Update to use PyTorch 1.9 functionality (https://github.com/pykeen/pykeen/pull/489)
- Generalize generator strategies in LCWA (https://github.com/pykeen/pykeen/pull/602)

Fixed
~~~~~
- FileNotFoundError on Windows/Anaconda (https://github.com/pykeen/pykeen/pull/503, thanks Hao-666)
- Fixed docstring for ComplEx interaction (https://github.com/pykeen/pykeen/pull/504)
- Make DistMult the default interaction function for R-GCN (https://github.com/pykeen/pykeen/pull/548)
- Fix gradient error in CompGCN buffering (https://github.com/pykeen/pykeen/pull/573)
- Fix splitting of numeric triples factories (https://github.com/pykeen/pykeen/pull/594, thanks Rodrigo-A-Pereira)
- Fix determinism in spitting of triples factory (https://github.com/pykeen/pykeen/pull/500)
- Fix documentation and improve HPO suggestion (https://github.com/pykeen/pykeen/pull/524, thanks kdutia)

1.5.0

--------------------------------------------------------------------------------
New Metrics
~~~~~~~~~~~
- Adjusted Arithmetic Mean Rank Index (https://github.com/pykeen/pykeen/pull/378)
- Add harmonic, geometric, and median rankings (https://github.com/pykeen/pykeen/pull/381)

New Trackers
~~~~~~~~~~~~
- Console Tracker (https://github.com/pykeen/pykeen/pull/440)
- Tensorboard Tracker (https://github.com/pykeen/pykeen/pull/416; thanks sbonner0)

New Models
~~~~~~~~~~
- QuatE (https://github.com/pykeen/pykeen/pull/367)
- CompGCN (https://github.com/pykeen/pykeen/pull/382)
- CrossE (https://github.com/pykeen/pykeen/pull/467)
- Reimplementation of LiteralE with arbitrary combination (g) function (https://github.com/pykeen/pykeen/pull/245)

New Negative Samplers
~~~~~~~~~~~~~~~~~~~~~
- Pseudo-typed Negative Sampler (https://github.com/pykeen/pykeen/pull/412)

Datasets
~~~~~~~~
- Removed invalid datasets (OpenBioLink filtered sets; https://github.com/pykeen/pykeen/pull/https://github.com/pykeen/pykeen/pull/439)
- Added WK3k-15K (https://github.com/pykeen/pykeen/pull/403)
- Added WK3l-120K (https://github.com/pykeen/pykeen/pull/403)
- Added CN3l (https://github.com/pykeen/pykeen/pull/403)

Added
~~~~~
- Documentation on using PyKEEN in Google Colab and Kaggle (https://github.com/pykeen/pykeen/pull/379,
thanks `jerryIsHere <https://github.com/jerryIsHere>`_)
- Pass custom training loops to pipeline (https://github.com/pykeen/pykeen/pull/334)
- Compatibility later for the fft module (https://github.com/pykeen/pykeen/pull/288)
- Official Python 3.9 support, now that PyTorch has it (https://github.com/pykeen/pykeen/pull/223)
- Utilities for dataset analysis (https://github.com/pykeen/pykeen/pull/16, https://github.com/pykeen/pykeen/pull/392)
- Filtering of negative sampling now uses a bloom filter by default (https://github.com/pykeen/pykeen/pull/401)
- Optional embedding dropout (https://github.com/pykeen/pykeen/pull/422)
- Added more HPO suggestion methods and docs (https://github.com/pykeen/pykeen/pull/446)
- Training callbacks (https://github.com/pykeen/pykeen/pull/429)
- Class resolver for datasets (https://github.com/pykeen/pykeen/pull/473)

Updated
~~~~~~~
- R-GCN implementation now uses new-style models and is super idiomatic (https://github.com/pykeen/pykeen/pull/110)
- Enable passing of interaction function by string in base model class (https://github.com/pykeen/pykeen/pull/384,
https://github.com/pykeen/pykeen/pull/387)
- Bump scipy requirement to 1.5.0+
- Updated interfaces of models and negative samplers to enforce kwargs (https://github.com/pykeen/pykeen/pull/445)
- Reorganize filtering, negative sampling, and remove triples factory from most objects (
https://github.com/pykeen/pykeen/pull/400, https://github.com/pykeen/pykeen/pull/405,
https://github.com/pykeen/pykeen/pull/406, https://github.com/pykeen/pykeen/pull/409,
https://github.com/pykeen/pykeen/pull/420)
- Update automatic memory optimization (https://github.com/pykeen/pykeen/pull/404)
- Flexibly define positive triples for filtering (https://github.com/pykeen/pykeen/pull/398)
- Completely reimplemented negative sampling interface in training loops (https://github.com/pykeen/pykeen/pull/427)
- Completely reimplemented loss function in training loops (https://github.com/pykeen/pykeen/pull/448)
- Forward-compatibility of embeddings in old-style models and updated docs on
how to use embeddings (https://github.com/pykeen/pykeen/pull/474)

Fixed
~~~~~
- Regularizer passing in the pipeline and HPO (https://github.com/pykeen/pykeen/pull/345)
- Saving results when using multimodal models (https://github.com/pykeen/pykeen/pull/349)
- Add missing diagonal constraint on MuRE Model (https://github.com/pykeen/pykeen/pull/353)
- Fix early stopper handling (https://github.com/pykeen/pykeen/pull/419)
- Fixed saving results from pipeline (https://github.com/pykeen/pykeen/pull/428, thanks kantholtz)
- Fix OOM issues with early stopper and AMO (https://github.com/pykeen/pykeen/pull/433)
- Fix ER-MLP functional form (https://github.com/pykeen/pykeen/pull/444)

1.4.0

--------------------------------------------------------------------------------
New Datasets
~~~~~~~~~~~~
- Countries (https://github.com/pykeen/pykeen/pull/314)
- DB100K (https://github.com/pykeen/pykeen/issues/316)

New Models
~~~~~~~~~~
- MuRE (https://github.com/pykeen/pykeen/pull/311)
- PairRE (https://github.com/pykeen/pykeen/pull/309)
- Monotonic affine transformer (https://github.com/pykeen/pykeen/pull/324)

New Algorithms
~~~~~~~~~~~~~~
If you're interested in any of these, please get in touch with us
regarding an upcoming publication.

- Dataset Similarity (https://github.com/pykeen/pykeen/pull/294)
- Dataset Deterioration (https://github.com/pykeen/pykeen/pull/295)
- Dataset Remix (https://github.com/pykeen/pykeen/pull/296)

Added
~~~~~
- New-style models (https://github.com/pykeen/pykeen/pull/260) for direct usage of interaction
modules
- Ability to train ``pipeline()`` using an Interaction module rather than a Model
(https://github.com/pykeen/pykeen/pull/326, https://github.com/pykeen/pykeen/pull/330).

Changes
~~~~~~~
- Lookup of assets is now mediated by the ``class_resolver`` package (https://github.com/pykeen/pykeen/pull/321,
https://github.com/pykeen/pykeen/pull/327)
- The ``docdata`` package is now used to parse structured information out of the model and dataset documentation
in order to make a more informative README with links to citations (https://github.com/pykeen/pykeen/pull/303).

1.3.0

--------------------------------------------------------------------------------
We skipped version 1.2.0 because we made an accidental release before this version
was ready. We're only human, and are looking into improving our release workflow
to live in CI/CD so something like this doesn't happen again. However, as an end user,
this won't have an effect on you.

New Datasets
~~~~~~~~~~~~
- CSKG (https://github.com/pykeen/pykeen/pull/249)
- DBpedia50 (https://github.com/pykeen/pykeen/issues/278)

New Trackers
~~~~~~~~~~~~
- General file-based Tracker (https://github.com/pykeen/pykeen/pull/254)
- CSV Tracker (https://github.com/pykeen/pykeen/pull/254)
- JSON Tracker (https://github.com/pykeen/pykeen/pull/254)

Fixed
~~~~~
- Fixed ComplEx's implementation (https://github.com/pykeen/pykeen/pull/313)
- Fixed OGB's reuse entity identifiers (https://github.com/pykeen/pykeen/pull/318, thanks tgebhart)

Added
~~~~~
- ``pykeen version`` command for more easily reporting your environment in issues
(https://github.com/pykeen/pykeen/issues/251)
- Functional forms of all interaction models (e.g., TransE, RotatE) (https://github.com/pykeen/pykeen/issues/238,
`pykeen.nn.functional documentation <https://pykeen.readthedocs.io/en/latest/reference/nn/functional.html>`_). These
can be generally reused, even outside of the typical PyKEEN workflows.
- Modular forms of all interaction models (https://github.com/pykeen/pykeen/issues/242,
`pykeen.nn.modules documentation <https://pykeen.readthedocs.io/en/latest/reference/nn/modules.html>`_). These wrap
the functional forms of interaction models and store hyper-parameters such as the ``p`` value for the L_p norm in
TransE.
- The initializer, normalizer, and constrainer for the entity and relation embeddings are now exposed through the
``__init__()`` function of each KGEM class and can be configured. A future update will enable HPO on these as well
(https://github.com/pykeen/pykeen/issues/282).

Refactoring and Future Preparation
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
This release contains a few big refactors. Most won't affect end-users, but if you're writing your own PyKEEN
models, these are important. Many of them are motivated to make it possible to introduce a new interface that makes
it much easier for researchers (who shouldn't have to understand the inner workings of PyKEEN) to make new models.

- The regularizer has been refactored (https://github.com/pykeen/pykeen/issues/266,
https://github.com/pykeen/pykeen/issues/274). It no longer accepts a ``torch.device`` when instantiated.
- The ``pykeen.nn.Embedding`` class has been improved in several ways:
- Embedding Specification class makes it easier to write new classes (https://github.com/pykeen/pykeen/issues/277)
- Refactor to make shape of embedding explicit (https://github.com/pykeen/pykeen/issues/287)
- Specification of complex datatype (https://github.com/pykeen/pykeen/issues/292)
- Refactoring of the loss model class to provide a meaningful class hierarchy
(https://github.com/pykeen/pykeen/issues/256, https://github.com/pykeen/pykeen/issues/262)
- Refactoring of the base model class to provide a consistent interface (https://github.com/pykeen/pykeen/issues/246,
https://github.com/pykeen/pykeen/issues/248, https://github.com/pykeen/pykeen/issues/253,
https://github.com/pykeen/pykeen/issues/257). This allowed for simplification of the loss computation based on
the new hierarchy and also new implementation of regularizer class.
- More automated testing of typing with MyPy (https://github.com/pykeen/pykeen/issues/255) and automated checking
of documentation with ``doctests`` (https://github.com/pykeen/pykeen/issues/291)

Triples Loading
~~~~~~~~~~~~~~~
We've made some improvements to the ``pykeen.triples.TriplesFactory`` to facilitate loading even larger datasets
(https://github.com/pykeen/pykeen/issues/216). However, this required an interface change. This will affect any
code that loads custom triples. If you're loading triples from a path, you should now use:

.. code-block:: python

path = ...

Old (doesn't work anymore)
tf = TriplesFactory(path=path)

New
tf = TriplesFactory.from_path(path)

Predictions
~~~~~~~~~~~
While refactoring the base model class, we excised the prediction functionality to a new module
``pykeen.models.predict`` (docs: https://pykeen.readthedocs.io/en/latest/reference/predict.html#functions).
We also renamed some of the prediction functions inside the base model to make them more consistent, but we now
recommend you use the functions from ``pykeen.models.predict`` instead.

- ``Model.predict_heads()`` -> ``Model.get_head_prediction_df()``
- ``Model.predict_relations()`` -> ``Model.get_head_prediction_df()``
- ``Model.predict_tails()`` -> ``Model.get_head_prediction_df()``
- ``Model.score_all_triples()`` -> ``Model.get_all_prediction_df()``

Fixed
~~~~~
- Do not create inverse triples for validation and testing factory (https://github.com/pykeen/pykeen/issues/270)
- Treat nonzero applied to large tensor error as OOM for batch size search (https://github.com/pykeen/pykeen/issues/279)
- Fix bug in loading ConceptNet (https://github.com/pykeen/pykeen/issues/290). If your experiments relied on this
dataset, you should rerun them.

1.1.0

--------------------------------------------------------------------------------
New Datasets
~~~~~~~~~~~~
- CoDEx (https://github.com/pykeen/pykeen/pull/154)
- DRKG (https://github.com/pykeen/pykeen/pull/156)
- OGB (https://github.com/pykeen/pykeen/pull/159)
- ConceptNet (https://github.com/pykeen/pykeen/pull/160)
- Clinical Knowledge Graph (https://github.com/pykeen/pykeen/pull/209)

New Trackers
~~~~~~~~~~~~
- Neptune.ai (https://github.com/pykeen/pykeen/pull/183)

Added
~~~~~
- Add MLFlow set tags function (https://github.com/pykeen/pykeen/pull/139; thanks sunny1401)
- Add score_t/h function for ComplEx (https://github.com/pykeen/pykeen/pull/150)
- Add proper testing for literal datasets and literal models (https://github.com/pykeen/pykeen/pull/199)
- Checkpoint functionality (https://github.com/pykeen/pykeen/pull/123)
- Random triple generation (https://github.com/pykeen/pykeen/pull/201)
- Make negative sampler corruption scheme configurable (https://github.com/pykeen/pykeen/pull/209)
- Add predict with inverse tripels pipeline (https://github.com/pykeen/pykeen/pull/208)
- Add generalize p-norm to regularizer (https://github.com/pykeen/pykeen/pull/225)

Changed
~~~~~~~
- New harness for resetting parameters (https://github.com/pykeen/pykeen/pull/131)
- Modularize embeddings (https://github.com/pykeen/pykeen/pull/132)
- Update first steps documentation (https://github.com/pykeen/pykeen/pull/152; thanks TobiasUhmann )
- Switched testing to GitHub Actions (https://github.com/pykeen/pykeen/pull/165 and
https://github.com/pykeen/pykeen/pull/194)
- No longer support Python 3.6
- Move automatic memory optimization (AMO) option out of model and into
training loop (https://github.com/pykeen/pykeen/pull/176)
- Improve hyper-parameter defaults and HPO defaults (https://github.com/pykeen/pykeen/pull/181
and https://github.com/pykeen/pykeen/pull/179)
- Switch internal usage to ID-based triples (https://github.com/pykeen/pykeen/pull/193 and
https://github.com/pykeen/pykeen/pull/220)
- Optimize triples splitting algorithm (https://github.com/pykeen/pykeen/pull/187)
- Generalize metadata storage in triples factory (https://github.com/pykeen/pykeen/pull/211)
- Add drop_last option to data loader in training loop (https://github.com/pykeen/pykeen/pull/217)

Fixed
~~~~~
- Whitelist support in HPO pipeline (https://github.com/pykeen/pykeen/pull/124)
- Improve evaluator instantiation (https://github.com/pykeen/pykeen/pull/125; thanks kantholtz)
- CPU fallback on AMO (https://github.com/pykeen/pykeen/pull/232)
- Fix HPO save issues (https://github.com/pykeen/pykeen/pull/235)
- Fix GPU issue in plotting (https://github.com/pykeen/pykeen/pull/207)

Page 1 of 2

Releases

Has known vulnerabilities

Pykeen

Page 1 of 2

1.7.0

1.6.0

1.5.0

1.4.0

1.3.0

1.1.0

Page 1 of 2

Links

Releases