INDEX
Model
gemma-2-9b-it
Layer #
20
Steering Hook
blocks.20.hook_resid_pre
Steering Strength
38.25
Uploader
bot-neuronpedia
Created At
2/15/2025 1:06:43 AM
Raw Vector
Actions
Explanations
phrases related to definitions and explanations
New Auto-Interp
Negative Logits
-0.38
person
-0.31
-
-0.31
)
-0.29
Familienname
-0.29
le
-0.29
kick
-0.28
as
-0.28
outlawed
-0.28
"
-0.28
POSITIVE LOGITS
rrggbb
0.90
setVerticalGroup
0.78
<unused52>
0.77
<unused74>
0.77
<unused14>
0.76
<unused28>
0.76
<unused41>
0.76
<unused43>
0.76
<unused47>
0.76
<unused51>
0.76
Activations Density 9.243%