INDEX
Model
gemma-2-9b-it
Layer #
20
Steering Hook
blocks.20.hook_resid_pre
Steering Strength
57.75
Uploader
bot-neuronpedia
Created At
2/15/2025 1:06:43 AM
Raw Vector
Actions
Explanations
distinct punctuation marks and formatting in written content
New Auto-Interp
Negative Logits
-0.35
)
-0.31
'
-0.30
comp
-0.29
\
-0.28
energy
-0.28
bool
-0.28
switch
-0.28
/
-0.28
st
-0.27
POSITIVE LOGITS
+#+#
0.78
kasarigan
0.75
queſta
0.75
ſelf
0.74
majánló
0.73
laſſen
0.73
<unused79>
0.73
<unused8>
0.72
<unused28>
0.72
<unused41>
0.72
Activations Density 2.082%