INDEX
Model
gemma-2-9b-it
Layer #
20
Steering Hook
blocks.20.hook_resid_pre
Steering Strength
48.5
Uploader
bot-neuronpedia
Created At
2/15/2025 1:06:43 AM
Raw Vector
Actions
Explanations
specific legal terminology and references
New Auto-Interp
Negative Logits
Isn
-0.33
-0.29
mi
-0.29
-
-0.29
\
-0.27
Isn
-0.27
démo
-0.27
MD
-0.26
off
-0.26
nawr
-0.26
POSITIVE LOGITS
betweenstory
0.88
setVerticalGroup
0.85
0.75
autorytatywna
0.68
kasarigan
0.66
<unused1>
0.66
IntoConstraints
0.66
<unused8>
0.65
<unused28>
0.65
<unused41>
0.65
Activations Density 4.184%