INDEX
Model
gemma-2-9b-it
Layer #
20
Steering Hook
blocks.20.hook_resid_pre
Steering Strength
37.5
Uploader
bot-neuronpedia
Created At
2/15/2025 1:06:43 AM
Raw Vector
Actions
Explanations
formal definitions and structured statements
New Auto-Interp
Negative Logits
nawr
-0.34
↑
-0.30
başar
-0.29
pungkas
-0.28
mümkün
-0.27
kullanım
-0.27
cedo
-0.27
reszcie
-0.27
ateş
-0.26
söyl
-0.26
POSITIVE LOGITS
setVerticalGroup
0.68
+#+#
0.66
0.65
<pad>
0.63
<unused14>
0.63
<unused41>
0.63
<unused51>
0.63
<unused79>
0.63
[@BOS@]
0.63
<unused3>
0.63
Activations Density 2.374%