INDEX
Model
gemma-2-9b-it
Layer #
20
Steering Hook
blocks.20.hook_resid_pre
Steering Strength
42.75
Uploader
bot-neuronpedia
Created At
2/15/2025 1:06:43 AM
Raw Vector
Actions
Explanations
sequences of punctuation marks or brackets
New Auto-Interp
Negative Logits
Hipp
-0.41
wierd
-0.38
şört
-0.37
nerfed
-0.37
-0.36
Town
-0.35
thulhu
-0.35
ToBounds
-0.35
town
-0.35
grips
-0.34
POSITIVE LOGITS
LookAnd
0.77
InstrumentedTest
0.63
resourceCulture
0.59
kasarigan
0.54
queſta
0.54
Савезне
0.53
Administrativna
0.52
setVerticalGroup
0.50
PerformLayout
0.49
rativa
0.48
Activations Density 0.131%