INDEX
Model
gemma-2-9b-it
Layer #
20
Steering Hook
blocks.20.hook_resid_pre
Steering Strength
52
Uploader
bot-neuronpedia
Created At
2/15/2025 1:06:43 AM
Raw Vector
Actions
Explanations
various forms of punctuation and sentence endings
New Auto-Interp
Negative Logits
recognised
-0.37
frein
-0.37
switch
-0.36
getDoctrine
-0.36
foot
-0.35
specialise
-0.35
wield
-0.34
fertiliser
-0.33
recognise
-0.32
tua
-0.32
POSITIVE LOGITS
noDo
0.69
betweenstory
0.66
LookAnd
0.65
purpoſe
0.61
Италијани
0.59
queſta
0.59
LEGGI
0.58
sprüche
0.54
geſ
0.53
PerformLayout
0.52
Activations Density 1.561%