INDEX
Model
gemma-2-9b-it
Layer #
20
Steering Hook
blocks.20.hook_resid_pre
Steering Strength
44.25
Uploader
bot-neuronpedia
Created At
2/15/2025 1:06:43 AM
Raw Vector
Actions
Explanations
text that makes reference to statistics or numerical data
New Auto-Interp
Negative Logits
recognised
-0.41
-0.41
):
-0.39
)
-0.38
"
-0.36
ag
-0.36
current
-0.35
'
-0.34
newArrayList
-0.34
switch
-0.34
POSITIVE LOGITS
'\\;'
0.72
ainfi
0.70
noDo
0.70
PerformLayout
0.69
queſta
0.68
LookAnd
0.67
ſelf
0.62
addCriterion
0.62
ſtand
0.61
Infórmanos
0.61
Activations Density 0.790%