INDEX
Model
gemma-2-9b-it
Layer #
20
Steering Hook
blocks.20.hook_resid_pre
Steering Strength
41
Uploader
bot-neuronpedia
Created At
2/15/2025 1:06:43 AM
Raw Vector
Actions
Explanations
references to HTML elements and programming syntax
New Auto-Interp
Negative Logits
-0.47
)
-0.39
<eos>
-0.36
-
-0.36
↵
-0.34
↵↵
-0.31
(
-0.31
person
-0.31
)
-0.30
\
-0.30
POSITIVE LOGITS
<unused8>
0.90
<unused41>
0.90
<pad>
0.90
<unused3>
0.90
<unused14>
0.90
<unused28>
0.90
<unused43>
0.90
<unused51>
0.90
<unused52>
0.90
<unused74>
0.90
Activations Density 3.086%