INDEX

Explanations

Proper nouns

np_max-act · gemini-2.0-flash

New Auto-Interp

Configuration

andyrdt/saes-llama-3.1-8b-instruct/resid_post_layer_15/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.15.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

olta

-0.08

ته

-0.08

 Ко

-0.06

تهم

-0.06

 आप

-0.06

ряд

-0.06

 Abram

-0.06

 гли

-0.06

 beni

-0.06

帝

-0.06

POSITIVE LOGITS

=<?=$

0.07

[d

0.06

 maize

0.06

notes

0.06

.createElement

0.06

 Symphony

0.06

 resetting

0.06

<ArrayList

0.06

++++++++++++++++++++++++++++++++

0.06

зано

0.06

Activations Density 0.049%

Proper nouns

No Comments

No Known Activations

Proper nouns

No Comments

No Known Activations