Sae-lens

Latest version: v3.7.0

Safety actively analyzes 641872 Python packages for vulnerabilities to keep your Python projects secure.

Scan your dependencies

Page 1 of 7

3.7.0

Feature

* feat: new saes for gemma-2b-it and feature splitting on gpt2-small-layer-8 (195) ([`5cfe382`](https://github.com/jbloomAus/SAELens/commit/5cfe382d43f028c2e4f4e7cb21a1c19abb5471d0))

3.6.0

Feature

* feat: Support Gated-SAEs (188)

* Initial draft of encoder

* Second draft of Gated SAE implementation

* Added SFN loss implementation

* Latest modification of SFN loss training setup

* fix missing config use

* dont have special sfn loss

* add hooks and reshape

* sae error term not working, WIP

* make tests pass

* add benchmark for gated

---------

Co-authored-by: Joseph Bloom <jbloomausgmail.com> ([`232c39c`](https://github.com/jbloomAus/SAELens/commit/232c39cea709ae9c4b68b204cf027fbaab385f64))

Unknown

* fix hook z loader (194) ([`cb30996`](https://github.com/jbloomAus/SAELens/commit/cb30996cf36c80bbf6f0fd529bd27262bbce13ce))

3.5.0

Feature

* feat: trigger release ([`1a4663b`](https://github.com/jbloomAus/SAELens/commit/1a4663b7eadb42682586697e72192346c66cf430))

Unknown

* Performance improvements + using multiple GPUs. (189)

* fix: no grads when filling cache

* trainer should put activations on sae device

* hack to allow sae device to be specific gpu when model is on multiple devices

* add some tests (not in CI, which check multiple GPU performance

* make formatter typer happy

* make sure SAE calls move data between devices as needed ([`400474e`](https://github.com/jbloomAus/SAELens/commit/400474eaf758d57ea4090fd08e84cbdd91d55cc4))

3.4.1

Fix

* fix: allow settings trust_remote_code for new huggingface version (187)

* fix: allow settings trust_remote_code for new huggingface version

* default to True, not none

---------

Co-authored-by: jbloomAus <jbloomausgmail.com> ([`33a612d`](https://github.com/jbloomAus/SAELens/commit/33a612d7f694390a5a4596f7e15e0c51657634ba))

3.4.0

Feature

* feat: Adding Mistral SAEs (178)

Note: normalize_activations is now a string and should be either &39;none&39;, &39;expected_average_only_in&39; (Anthropic April Update, not yet folded), &39;constant_norm_rescale&39; (Anthropic Feb update).

* Adding code to load mistral saes

* Black formatting

* Removing library changes that allowed forward pass normalization

* feat: support feb update style norm scaling for mistral saes

* Adding code to load mistral saes

* Black formatting

* Removing library changes that allowed forward pass normalization

* Adding code to load mistral saes

* Black formatting

* Removing library changes that allowed forward pass normalization

* feat: support feb update style norm scaling for mistral saes

* remove accidental inclusion

---------
Co-authored-by: jbloomAus <jbloomausgmail.com> ([`227d208`](https://github.com/jbloomAus/SAELens/commit/227d2089f6fdadb54b5554056eb7721574608b58))

Unknown

* Update README.md Slack Link Expired (this one shouldn&39;t expire) ([`209696a`](https://github.com/jbloomAus/SAELens/commit/209696a4f74007559a650ad5357c4fd923205923))

* add expected perf for pretrained (179)

Co-authored-by: jbloom-md <josephmassdynamics.com> ([`10bd9c5`](https://github.com/jbloomAus/SAELens/commit/10bd9c58fd4d731bd453e49943d40f2ac01ff0fc))

* fix progress bar updates (171) ([`4d92975`](https://github.com/jbloomAus/SAELens/commit/4d92975cc1aa8cd7485e73597fd52e13a0f8e44e))

3.3.0

Feature

* feat: updating docs and standardizing PretokenizeRunner export (176) ([`03f071b`](https://github.com/jbloomAus/SAELens/commit/03f071b41e77ef07c8f8b892e52969337f5e94aa))

Unknown

* add tutorial (175) ([`8c67c23`](https://github.com/jbloomAus/SAELens/commit/8c67c2355211910bc5054ba9bc140e98424fa026))

Page 1 of 7

© 2024 Safety CLI Cybersecurity Inc. All Rights Reserved.