INDEX
Model
gemma-2-9b-it
Layer #
20
Steering Hook
blocks.20.hook_resid_pre
Steering Strength
53.25
Uploader
bot-neuronpedia
Created At
2/15/2025 1:06:43 AM
Raw Vector
Actions
Explanations
sentences containing punctuation or formatting elements
New Auto-Interp
Negative Logits
nawr
-0.52
switch
-0.38
abestanden
-0.37
RTSC
-0.37
exerting
-0.37
cesz
-0.36
culada
-0.36
exert
-0.36
-0.35
exerts
-0.35
POSITIVE LOGITS
+#+#
0.59
niająca
0.47
nød
0.47
انتهای
0.47
noDo
0.46
ContentAlignment
0.46
desmotivaciones
0.45
betweenstory
0.45
laſſen
0.45
majánló
0.44
Activations Density 1.657%