Captum

Latest version: v0.8.0

Safety actively analyzes 723217 Python packages for vulnerabilities to keep your Python projects secure.

Page 1 of 2

0.8.0

The v0.8.0 release of Captum offers new influence functions for data attribution, improvements to feature attribution methods (including LLM prompt attribution), enhanced type annotations for modern Python type checking, and a variety of other small changes. Note that support for Python 3.8 and PyTorch 1.10 have been dropped, and Captum Insights will be deprecated next major release.

Data Attribution: New Influence Functions

This version offers two different implementations that both calculate the "infinitesimal" influence score as defined in the paper[ "Understanding Black-box Predictions via Influence Functions"](https://arxiv.org/pdf/1703.04730.pdf).
- `NaiveInfluenceFunction`: a computationally slow but exact implementation that is useful for obtaining "ground-truth" (though, note that influence scores themselves are an approximation of the effect of removing then retraining). Several papers actually use this approach, i.e.[ "Learning Augmentation Network via Influence Functions"](https://openaccess.thecvf.com/content_CVPR_2020/papers/Lee_Learning_Augmentation_Network_via_Influence_Functions_CVPR_2020_paper.pdf),[ "Quantifying and Mitigating the Impact of Label Errors on Model Disparity Metrics"](https://openreview.net/forum?id=RUzSobdYy0V),[ "Achieving Fairness at No Utility Cost via Data Reweighting with Influence"](https://proceedings.mlr.press/v162/li22p/li22p.pdf) (PR https://github.com/pytorch/captum/pull/1214)
- `ArnoldiInfluenceFunction`: This is a computationally efficient implementation described in the paper[ "Scaling Up Influence Functions"](https://l.facebook.com/l.php?u=https%3A%2F%2Farxiv.org%2Fpdf%2F2112.03052.pdf&h=AT0QVmLyIXhxubw9fqxG6ULdy-lyAHPchGhHbXAwbM3GU1zQEUm1XPAk5ymiw11nONY4mWzJg10CSYlX5R3VJ5Ty7Y-WkawSfnSsWpaJLVPP_k2RpiSNNT80DeP9qS_3yT2a9Y3gb2ZTBhFCLipcnIyLOwWN) by Schioppa et al. (PR https://github.com/pytorch/captum/pull/1187)

Example:

python
from captum.influence._core.influence_function import NaiveInfluenceFunction
from torch import nn
from torch.utils.data import DataLoader

train_dl = DataLoader(your_dataset, batch_size=8) your dataloader
criterion = nn.MSELoss(reduction="none")
influence = NaiveInfluenceFunction(
net,
train_dl,
checkpoint_path, path to your model checkpoint
loss_fn=criterion,
batch_size=batch_size,
)

compute pairwise influences using influence implementation
influence_train_test_influences = influence.influence(
(test_samples, test_labels) your test data (Tensors)
)

What is the "infinitesimal" influence score
More details on the "infinitesimal" influence score: This "infinitesimal" influence score approximately answers the question if a given training example were infinitesimally down-weighted and the model re-trained to optimality, how much would the loss on a given test example change. Mathematically, the aforementioned influence score is given by $\nabla_\theta L(x)' H^{-1} \nabla_\theta L(z)$, where $\nabla_\theta L(x)$ is the gradient of the loss, considering only training example x with respect to (a subset of) model parameters $\theta$, $\nabla_\theta L(z)$ is the analogous quantity for a test example $z$, and $H$ is the Hessian of the (subset of) model parameters at a given model checkpoint.

What the two implementations have in common
Both implementations compute a low-rank approximation of the inverse Hessian, i.e. a tall and skinny (with width k) matrix $R$ such that $H^{-1} \approx RR'$, where $k$ is small. In particular, let $L$ be the matrix of width $k$ whose columns contain the top-k eigenvectors of $H$, and let $V$ be the $k$ by $k$ matrix whose diagonals contain the corresponding eigenvalues. Both implementations let $R=LV^{-1}L'$. Thus, the core computational step is computing the top-k eigenvalues / eigenvectors.
This approximation is useful for several reasons:
- It avoids numerical issues associated with inverting small eigenvalues
- Since the influence score is given by $\nabla_\theta L(x)' H^{-1} \nabla_\theta L(z)$, which is approximated by $(\nabla_\theta L(x)' R) (\nabla_\theta L(z)' R)$, we can compute an "influence embedding" for a given example x, $\nabla_\theta L(x)' R$, such that the influence score of one example on another is approximately the dot-product of their respective embeddings. Because k is small, i.e. 50, these influence embeddings are low-dimensional.
- Even for large models, we can store $R$ in memory, provided k is small. This means influence embeddings (and thus influence scores) can be efficiently computed by doing a backwards pass to compute $\nabla_\theta L(x)$ and then multiplying by $R'$. This is orders of magnitude faster than the previous LISSA approach of Koh et al, which to compute the influence score involving a given example, need to compute Hessian-vector products involving on the order of 10^4 examples.

The implementations differ in how they compute the top-k eigenvalues / eigenvectors.

How NaiveInfluenceFunction computes the top-k eigenvalues / eigenvectors

It is "naive" in that it computes the top-k eigenvalues / eigenvectors by explicitly forming the Hessian, converting it to a 2D tensor, computing its eigenvectors / eigenvalues, and then sorting. See documentation of the `_set_projections_naive_influence_function` method for more details.

How ArnoldiInfluenceFunction computes the top-k eigenvalues / eigenvectors

The key novelty of the approach by Schioppa et al. is that it uses the Arnoldi iteration to find the top-k eigenvalues / eigenvectors of the Hessian without explicitly forming the Hessian. In more detail, the approach first runs the Arnoldi iteration, which only requires the ability to compute Hessian-vector products, to find a Krylov subspace of moderate dimension, i.e. 200. It then finds the top-k eigenvalues / eigenvectors of the restriction of the Hessian to the subspace, where k is small, i.e. 50. Finally, it expresses the eigenvectors in the original basis. This approach for finding the top-k eigenvalues / eigenvectors is justified by the property of the Arnoldi iteration, that the Krylov subspace it returns tends to contain the top eigenvectors.

This implementation does incur some one-time overhead in `__init__`, where it runs the Arnoldi iteration to calculate $R$. After that overhead, calculation of influence scores is quick, only requiring a backwards pass and multiplication, per example.

Unlike `NaiveInfluenceFunction`, this implementation does not flatten any parameters, as the 2D Hessian is never formed, and Pytorch's Hessian-vector implementation (`torch.autograd.functional.hvp`) allows the input and output vector to be a tuple of tensors. Avoiding flattening / unflattening parameters brings scalability gains.

Feature Attribution Improvements

- Added initial support for asynchronous attribution (PyTorch [futures](https://pytorch.org/docs/stable/futures.html)) for the following methods (PRs https://github.com/pytorch/captum/pull/1295, https://github.com/pytorch/captum/pull/1316, https://github.com/pytorch/captum/pull/1317, https://github.com/pytorch/captum/pull/1314, https://github.com/pytorch/captum/pull/1320, https://github.com/pytorch/captum/pull/1326, https://github.com/pytorch/captum/pull/1335, https://github.com/pytorch/captum/pull/1487):
- FeatureAblation
- FeaturePermutation
- ShapleyValueSampling
- ShapleyValues

- Added support for additional gradient-based LLM attribution methods (PRs https://github.com/pytorch/captum/pull/1337, https://github.com/pytorch/captum/pull/1420):
- LayerGradientXActivation
- LayerGradientShap
- Added support to perturbation-based LLM attribution for “key and value” [caching](https://huggingface.co/docs/transformers/main/en/kv_cache) (PRs https://github.com/pytorch/captum/pull/1224, https://github.com/pytorch/captum/pull/1341, https://github.com/pytorch/captum/pull/1343, https://github.com/pytorch/captum/pull/1353)
- Added support to pass gradient keyword arguments to the following Captum.attr methods through grad_kwargs (PRs https://github.com/pytorch/captum/pull/1286, https://github.com/pytorch/captum/pull/1294, https://github.com/pytorch/captum/pull/1435):
- LayerGradCam
- InternalInfluence
- LayerConductance
- LayerDeepLift
- LayerGradientShap
- NeuronConductance
- LayerGradientXActivation
- LayerIntegratedGradients
- Added a tutorial for perturbation- and gradient-based LLM attribution (tutorials/Llama2_LLM_Attribution.ipynb) (PRs https://github.com/pytorch/captum/pull/1228, https://github.com/pytorch/captum/pull/1333, https://github.com/pytorch/captum/pull/1445)
![image](https://github.com/user-attachments/assets/7028ef87-82af-404b-abb0-8c46799f511d)

Changes to Requirements
- We have dropped support for Python < 3.8 and PyTorch < 1.10 (PRs https://github.com/pytorch/captum/pull/1460, https://github.com/pytorch/captum/pull/1298, https://github.com/pytorch/captum/pull/1305)
- We plan to deprecate Captum Insights in the next major release (PR https://github.com/pytorch/captum/pull/1498)

Improvements to Type Annotations
Greatly improved typing throughout the library, now supporting and complying with the latest versions of both pyre and mypy type checking (PRs https://github.com/pytorch/captum/pull/1371, https://github.com/pytorch/captum/pull/1318, https://github.com/pytorch/captum/pull/1319, https://github.com/pytorch/captum/pull/1324, https://github.com/pytorch/captum/pull/1247, https://github.com/pytorch/captum/pull/1270, https://github.com/pytorch/captum/pull/1299, https://github.com/pytorch/captum/pull/1330, https://github.com/pytorch/captum/pull/1356, https://github.com/pytorch/captum/pull/1359, https://github.com/pytorch/captum/pull/1377, https://github.com/pytorch/captum/pull/1389, https://github.com/pytorch/captum/pull/1381, https://github.com/pytorch/captum/pull/1382, https://github.com/pytorch/captum/pull/1383, https://github.com/pytorch/captum/pull/1406, https://github.com/pytorch/captum/pull/1405, https://github.com/pytorch/captum/pull/1404, https://github.com/pytorch/captum/pull/1403, https://github.com/pytorch/captum/pull/1402, https://github.com/pytorch/captum/pull/1401, https://github.com/pytorch/captum/pull/1400, https://github.com/pytorch/captum/pull/1399, https://github.com/pytorch/captum/pull/1398, https://github.com/pytorch/captum/pull/1397, https://github.com/pytorch/captum/pull/1396, https://github.com/pytorch/captum/pull/1395, https://github.com/pytorch/captum/pull/1394, https://github.com/pytorch/captum/pull/1393, https://github.com/pytorch/captum/pull/1392, https://github.com/pytorch/captum/pull/1391, https://github.com/pytorch/captum/pull/1390, https://github.com/pytorch/captum/pull/1385, https://github.com/pytorch/captum/pull/1412, https://github.com/pytorch/captum/pull/1409, https://github.com/pytorch/captum/pull/1411, https://github.com/pytorch/captum/pull/1418, https://github.com/pytorch/captum/pull/1416, https://github.com/pytorch/captum/pull/1415, https://github.com/pytorch/captum/pull/1414, https://github.com/pytorch/captum/pull/1421, https://github.com/pytorch/captum/pull/1424, https://github.com/pytorch/captum/pull/1365, https://github.com/pytorch/captum/pull/1427, https://github.com/pytorch/captum/pull/1425, https://github.com/pytorch/captum/pull/1428, https://github.com/pytorch/captum/pull/1433, https://github.com/pytorch/captum/pull/1434, https://github.com/pytorch/captum/pull/1431, https://github.com/pytorch/captum/pull/1437, https://github.com/pytorch/captum/pull/1438, https://github.com/pytorch/captum/pull/1439, https://github.com/pytorch/captum/pull/1441, https://github.com/pytorch/captum/pull/1448, https://github.com/pytorch/captum/pull/1453, https://github.com/pytorch/captum/pull/1455, https://github.com/pytorch/captum/pull/1459, https://github.com/pytorch/captum/pull/1457, https://github.com/pytorch/captum/pull/1458, https://github.com/pytorch/captum/pull/1461, https://github.com/pytorch/captum/pull/1462, https://github.com/pytorch/captum/pull/1463, https://github.com/pytorch/captum/pull/1464, https://github.com/pytorch/captum/pull/1465, https://github.com/pytorch/captum/pull/1466, https://github.com/pytorch/captum/pull/1467, https://github.com/pytorch/captum/pull/1469, https://github.com/pytorch/captum/pull/1470, https://github.com/pytorch/captum/pull/1471, https://github.com/pytorch/captum/pull/1472, https://github.com/pytorch/captum/pull/1474, https://github.com/pytorch/captum/pull/1475, https://github.com/pytorch/captum/pull/1476, https://github.com/pytorch/captum/pull/1477, https://github.com/pytorch/captum/pull/1479, https://github.com/pytorch/captum/pull/1480, https://github.com/pytorch/captum/pull/1481, https://github.com/pytorch/captum/pull/1482, https://github.com/pytorch/captum/pull/1503, https://github.com/pytorch/captum/pull/1502)

Minor Changes and Fixes

- Added a fix to IntegratedGradients to fully support the MPS backend (PR https://github.com/pytorch/captum/pull/1227)
- Added support for the latest version of the black code formatter (PR https://github.com/pytorch/captum/pull/1241)
- Improved the test case coverage, logic, stability, and speed across Captum, especially for layer-based attribution methods, LLM attribution, and captum.influence methods and utilities (PRs https://github.com/pytorch/captum/pull/1250, https://github.com/pytorch/captum/pull/1251, https://github.com/pytorch/captum/pull/1253, https://github.com/pytorch/captum/pull/1258, https://github.com/pytorch/captum/pull/1243, https://github.com/pytorch/captum/pull/1249, https://github.com/pytorch/captum/pull/1252, https://github.com/pytorch/captum/pull/1259, https://github.com/pytorch/captum/pull/1260, https://github.com/pytorch/captum/pull/1262, https://github.com/pytorch/captum/pull/1264, https://github.com/pytorch/captum/pull/1265, https://github.com/pytorch/captum/pull/1272, https://github.com/pytorch/captum/pull/1300, https://github.com/pytorch/captum/pull/1301, https://github.com/pytorch/captum/pull/1302, https://github.com/pytorch/captum/pull/1323, https://github.com/pytorch/captum/pull/1352, https://github.com/pytorch/captum/pull/1362, https://github.com/pytorch/captum/pull/1364, https://github.com/pytorch/captum/pull/1388, https://github.com/pytorch/captum/pull/1408, https://github.com/pytorch/captum/pull/1410, https://github.com/pytorch/captum/pull/1419, https://github.com/pytorch/captum/pull/1422, https://github.com/pytorch/captum/pull/1436, https://github.com/pytorch/captum/pull/1454, https://github.com/pytorch/captum/pull/1484, https://github.com/pytorch/captum/pull/1485, https://github.com/pytorch/captum/pull/1492)
- Improved LLM attribution plotting aesthetics and text readability (PRs https://github.com/pytorch/captum/pull/1348, https://github.com/pytorch/captum/pull/1349, https://github.com/pytorch/captum/pull/1351, https://github.com/pytorch/captum/pull/1354, https://github.com/pytorch/captum/pull/1355, https://github.com/pytorch/captum/pull/1360, https://github.com/pytorch/captum/pull/1417)
- Free autograd graphs in between LLM attribution calls (PR https://github.com/pytorch/captum/pull/1347)
- Fixed data type bug with the titanic tutorial (tutorials/Titanic_Basic_Interpret.ipynb) (PR https://github.com/pytorch/captum/pull/1331)
- Fixed multiple device-related bugs for feature ablation/permutation masks and LLM attribution (PR https://github.com/pytorch/captum/pull/1245, https://github.com/pytorch/captum/pull/1307)
- Reduced the complexity of various functions throughout Captum (PRs https://github.com/pytorch/captum/pull/1368, https://github.com/pytorch/captum/pull/1372, https://github.com/pytorch/captum/pull/1369, https://github.com/pytorch/captum/pull/1370, https://github.com/pytorch/captum/pull/1374, https://github.com/pytorch/captum/pull/1375, https://github.com/pytorch/captum/pull/1376, https://github.com/pytorch/captum/pull/1378, https://github.com/pytorch/captum/pull/1380, https://github.com/pytorch/captum/pull/1384, https://github.com/pytorch/captum/pull/1407)
- Fixed a bug in the tutorial parsing script (PR https://github.com/pytorch/captum/pull/1268)

0.7.0

- Multi-task attribution for Shapley Values and Shapley Value Sampling is now supported, allowing users to get attributions for multiple target outputs simultaneously (PR 1173)
- LayerGradCam now supports returning attributions for each channel independently without summing across channels (PR 1086, thanks to dzenanz for this contribution)

Bug Fixes

- Visualization utilities were updated to use the new keyword argument visible to ensure compatibility with Matplotlib 3.7 (PR 1118)
- The default visualization mode in visualize_timeseries_attr has been fixed to appropriately utilize overlay_individual (PR 1152, thanks to teddykoker for this contribution)

0.6.0

The Captum v0.6.0 release introduces a new feature `StochasticGates`. This release also enhances Influential Examples and includes a series of other improvements & bug fixes.

Stochastic Gates
Stochastic Gates is a technique to enforce sparsity by approximating L0 regularization. It can be used for network pruning and feature selection. As directly optimizing L0 is a non-differentiable combinatorial problem, Stochastic Gates approximates it by using certain continuous probability distributions (e.g., Concrete, Gaussian) as smoothed Bernoulli distributions. So the optimization can be reparameterized into the distributions parameters. Check the following papers for more details:

- [Learning Sparse Neural Networks through L0 Regularization](https://arxiv.org/abs/1712.01312)
- [Feature Selection using Stochastic Gates](https://arxiv.org/abs/1810.04247)

Captum provides two Stochastic Gates implementations using different distributions as smoothed Bernoulli, `BinaryConcreteStochasticGates` and `GaussianStochasticGates`. They are available under `captum.module`, a new subpackage collecting neural network building blocks that are useful for model understanding. A usage example:

py
from captum.module import GaussianStochasticGates

n_gates = 5 number of gates
stg = GaussianStochasticGates(n_gates, reg_weight=0.01)

inputs = torch.randn(3, n_gates) mock inputs with batch size of 3

gated_inputs, reg = stg(mock_inputs) gate the inputs
loss = model(gated_inputs) use gated inputs in the downstream network

optimize sparsity regularization together with the model loss
loss += reg

...

verify the learned gate values to see how model is using the inputs
print(stg.get_gate_values())

Influential Examples
Influential Examples is a new function pillar enabled in the last version. This new release continues to focus on it and introduces many improvements upon the existing `TracInCP` family. Some of the changes are incompatible with the previous version. Below is the list of details:

- Support loss function with reduction of `mean` in `TracInCPFast` and `TracInCPFastRandProj` (https://github.com/pytorch/captum/pull/913)
- `TracInCP` classes add a new argument `show_progress` to optionally display progress bars for the compuation (https://github.com/pytorch/captum/pull/898, https://github.com/pytorch/captum/pull/1046)
- `TracInCP` provides a new public method `self_influence` which computes the self influence scores among the examples in the given data. `influence` can no longer compute self_influence scores and the argument `inputs` cannot be `None` (https://github.com/pytorch/captum/pull/994, https://github.com/pytorch/captum/pull/1069, https://github.com/pytorch/captum/pull/1087, https://github.com/pytorch/captum/pull/1072)
- Previous constructor argument `influence_src_dataset` in `TracInCP` is renamed to `train_dataset` (https://github.com/pytorch/captum/pull/994)
- Add GPU support to `TracInCPFast` and `TracInCPFastRandProj` (https://github.com/pytorch/captum/pull/969)
- `TracInCP` and `TracInCPFastRandProj` provides a new public method `compute_intermediate_quantities` which computes “embedding” vectors for examples in a the given data (https://github.com/pytorch/captum/pull/1068)
- `TracInCP` classes supports a new optional argument `test_loss_fn` for use cases where different losses are used for training and testing examples (https://github.com/pytorch/captum/pull/1073)
- Revised the interface of the method `influence`. Removed the arguments `unpack_inputs` and `target`. Now, the `inputs` argument must be a `tuple` where the last element is the label (https://github.com/pytorch/captum/pull/1072)

Notable Changes
- LRP now will throw error when it detects the model ruses any modules (https://github.com/pytorch/captum/pull/911)
- Fixed the bug that the concept order changes in `TCAV`’s output (https://github.com/pytorch/captum/pull/915, https://github.com/pytorch/captum/issues/909)
- Fixed the data type issue of using Captum’s built-in SGD linear models in `Lime` (https://github.com/pytorch/captum/pull/938, https://github.com/pytorch/captum/issues/910)
- All submodules are now accessible under the top-level `captum` module, so users can `import captum` and access everything underneath it, e.g., `captum.attr` (https://github.com/pytorch/captum/pull/912, https://github.com/pytorch/captum/pull/992, https://github.com/pytorch/captum/issues/680)
- Added a new attribution visualization utility for time series data (https://github.com/pytorch/captum/pull/980)
- Improved version detection to fix some compatibility issues caused by dependencies’ versions (https://github.com/pytorch/captum/pull/940, https://github.com/pytorch/captum/pull/999, )
- Fixed an index bug in the tutorial Interpret regression models using Boston House Prices Dataset (https://github.com/pytorch/captum/pull/1014, https://github.com/pytorch/captum/issues/1012)
- Refactored `FeatureAblation` and `FeaturePermutation` to verify the output type of `forward_func` and its shape when `perturbation_per_eval > 1` (https://github.com/pytorch/captum/pull/1047, https://github.com/pytorch/captum/pull/1049, https://github.com/pytorch/captum/pull/1091)
- Changed [Housing Regression tutorial](https://captum.ai/tutorials/House_Prices_Regression_Interpret) with California housing dataset (https://github.com/pytorch/captum/pull/1041)
- Improved the error message of invalid input types when the required data type is `tensor` or `tuple[tensor]` (https://github.com/pytorch/captum/pull/1083)
- Switched to tensor `forward_hook` from module `backward_hook` for many attribution algorithms that need tensor gradients, like `DeepLift` and `LayerLRP`. So those modules can now support models with in-place modules (https://github.com/pytorch/captum/pull/979, https://github.com/pytorch/captum/issues/914)
- Added an optional `mask` argument to `FGSM` and `PGD` adversarial attacks under `captum.robust` to specify which elements are perturbed (https://github.com/pytorch/captum/pull/1043)

0.5.0

The Captum v0.5.0 release introduces a new function pillar, Influential Examples, with a few code improvements and bug fixes.

Influential Examples

Influential Examples implements the method [TracInCP](https://arxiv.org/abs/2002.08484). It calculates the influence score of a given training example on a given test example, which approximately answers the question “if the given training example were removed from the training data, how much would the loss on the model change?”. TracInCP can be used for:
- identifying **proponents/opponents**, which are the training examples with the most positive/negative influence on a given test example
- identifying mis-labelled data

Captum currently offers the following specific variant implementings of TracInCP:
* `TracInCP` - Computes influence scores using gradients at all specified layers. Can be used for identifying proponents/opponents, and identifying mis-labelled data. Both computations take time linear in training data size.
* `TracInCPFast` - Like TracInCP, but computes influence scores using only gradients in the last fully-connected layer, and is expedited using a computational trick.
* `TracInCPFastRandProj` - Version of TracInCPFast which is specialized for computing proponents/opponents. In particular, pre-processing enables computation of proponents / opponents in constant time. The tradeoff is the linear time and memory required for pre-processing. Random projections can be used to reduce memory usage. This class should not be used for identifying mis-labelled data.

A tutorial is made to demonstrate the usage https://captum.ai/tutorials/TracInCP_Tutorial
<img width="768" alt="influential example" src="https://user-images.githubusercontent.com/5113450/156647765-7b3c72a8-ea76-4d99-b735-4e73ba44efb5.png">

Notable Changes

* Minimum required PyTorch version becomes **v1.6.0** (876)
* Enabled argument `model_id` in `TCAV` and removed `AV` from public concept module (PR 811)
* Add new configurable argument `attribute_to_layer_input` in `TCAV` to set for both layer activation and attribution (864)
* Rename the argument `raw_input` to `raw_input_ids` in visualization util `VisualizationDataRecord` (PR 804)
* Support configurable `eps` argument in `DeepLift` (PR 835)
* Captum now leverages `register_full_backward_hook` introduced in PyTorch v1.8.0. Attribution to neuron output in `NeuronDeepLift`, `NeuronGuidedBackprop`, and `NeuronDeconvolution` are deprecated and will be removed in the next major release v0.6.0 (PR 837)
* Fix the issue that Lime and KernelShap fail to handle empty tensor input like `tensor([[],[],[]])` (PR 812)
* Fix the bug that `visualization_transform` of `ImageFeature` in Captum Insight is not applied (PR 871)

0.4.1

The Captum v0.4.1 release includes three new tutorials, a few code improvements and bug fixes.

New Tutorials

Robustness tutorial:

* Applying robustness attacks and metrics to CIFAR model and dataset

Concept tutorials:

* TCAV for image classification for googlenet model
* TCAV for NLP sentiment analysis model

Improvements

* Reduced unnecessary reliance on `Numpy` across the codebase by replacing such usages with `PyTorch` equivalents when possible (PR 714 755 760)
* Enhanced the error message for missing modules rules in LRP (PR 727)
* Switched linter to `ufmt` from previous `black` + `isort` and reformatted the code accordingly (PR 739)
* Generalized implementation of `captum._utils.av` for TCAV to use and refactored TCAV to simplify the creation of datasets used to train concept models (PR 747)

Bug Fixes

* Fixed the device error when using TCAV on cuda (Issue 719 720 721 , PR 725)
* Captum Insight now cache a subset of batches from dataset for recycle to fix the issue of not showing data after iterating all batches (PR 728)
* Corrected the loading of reference word embedding in tutorial “Interpreting Bert Part 1” (PR 743)
* Renamed the util `save_div`’s argument `default_value` to `default_denom` and unified its behaviors for different denominator types (Issue 654 , PR 751)

0.4.0

* Neuron conductance now supports a selector function (in addition to providing a neuron index) to select the target neuron for attribution, which enables support for layers with input / output as a tuple of tensors (PR 602).
* Lime now supports a generator to be returned by the perturbation function, rather than only a single sample, to better support enumeration of perturbations for interpretable model training (PR 619).
* KernelSHAP has been improved to perform weighted sampling of vectors for interpretable model training, rather than uniformly sampling vectors and weighting only when training. This change scales better with larger numbers of features, since weights for larger numbers of features were previously leading to arithmetic underflow (PR 619).
* A new option show_progress has been added to all perturbation-based attribution methods, which shows a progress bar to help users track progress of attribution computation (Issue 630 , PR 581).
* A new option / flag normalize has been added to infidelity evaluation metric that normalizes and scales infidelity score based on an input flag normalize (Issue: 613, PR: 639 )
* All perturbation-based attribution methods now support boolean input tensors (PR 666).
* Lime’s default regularization for Lasso regression has been reduced from 1.0 to 0.01 to avoid frequent issues with attribution results being 0 (Issue 679, PR 689).

Bug Fixes

* Gradient-based attribution methods have been fixed to not zero previously stored grads, which avoids warnings related to accessing grad of non-leaf tensors (Issue 421, 491, PR 597).
* Captum tests were previously included in Captum distributions unnecessarily; tests are no longer packaged with Captum releases (Issue 629 , PR 635).
* Captum’s dependency on matplotlib in Conda environments has been changed to matplotlib-base, since pyqt is not used in Captum (Issue 644, PR 648).
* Layer attribution methods now set gradient requirements only starting at the target layer rather than at the inputs, which ensures support for models with int or boolean input tensors (PR 647, 643).
* Lime and Kernel SHAP int overflow issues (with sklearn interpretable model training) have been resolved, and all interpretable model inputs / outputs are converted to floats prior to training (PR 649).
* Original parameter names which were renamed in v0.3 for NoiseTunnel, Kernel Shap, and Lime no longer lead to deprecation warnings and were removed in 0.4.0 (PR 558).

Page 1 of 2

Releases

Has known vulnerabilities

Captum

Page 1 of 2

0.8.0

0.7.0

0.6.0

0.5.0

0.4.1

0.4.0

Page 1 of 2

Links

Releases