INDEX
Model
gemma-2-9b-it
Layer #
20
Steering Hook
blocks.20.hook_resid_pre
Steering Strength
56.75
Uploader
bot-neuronpedia
Created At
2/15/2025 1:06:43 AM
Raw Vector
Actions
Explanations
instances of punctuation and formatting in written text
New Auto-Interp
Negative Logits
nawr
-0.43
recognised
-0.39
indra
-0.35
referenties
-0.34
isième
-0.33
nitrous
-0.33
Tuchel
-0.32
allAfrica
-0.32
fach
-0.32
thulhu
-0.32
POSITIVE LOGITS
'\\;'
0.56
LookAnd
0.55
+#+#
0.54
InstrumentedTest
0.53
kasarigan
0.53
ValueStyle
0.50
betweenstory
0.50
fromnode
0.49
ContentAlignment
0.49
transQ
0.49
Activations Density 2.052%