INDEX
Model
gemma-2-9b-it
Layer #
20
Steering Hook
blocks.20.hook_resid_pre
Steering Strength
39
Uploader
bot-neuronpedia
Created At
2/15/2025 1:06:43 AM
Raw Vector
Actions
Explanations
punctuation marks and formatting characters
New Auto-Interp
Negative Logits
nawr
-0.31
humild
-0.28
olvides
-0.27
\
-0.26
anlam
-0.26
Legături
-0.26
of
-0.25
expectativa
-0.24
-]
-0.24
ne
-0.24
POSITIVE LOGITS
propOrder
0.79
Administrativna
0.77
setVerticalGroup
0.71
wiſſen
0.69
verſch
0.69
Weiſe
0.69
unſer
0.68
<unused8>
0.67
<unused51>
0.67
<unused52>
0.67
Activations Density 3.385%