INDEX
Model
gemma-2-9b-it
Layer #
20
Steering Hook
blocks.20.hook_resid_pre
Steering Strength
36
Uploader
bot-neuronpedia
Created At
2/15/2025 1:06:43 AM
Raw Vector
Actions
Explanations
phrases that indicate legal concepts or procedural terms
New Auto-Interp
Negative Logits
miteinander
-0.32
nawr
-0.30
\
-0.30
foot
-0.29
pungkas
-0.28
)
-0.26
nyingi
-0.25
Gericht
-0.24
-0.24
↵
-0.24
POSITIVE LOGITS
betweenstory
0.84
rrggbb
0.81
<unused1>
0.80
<unused3>
0.80
<unused14>
0.80
<unused23>
0.80
<unused28>
0.80
<unused43>
0.80
<unused47>
0.80
<unused51>
0.80
Activations Density 2.521%