INDEX
Model
gemma-2-9b-it
Layer #
20
Steering Hook
blocks.20.hook_resid_pre
Steering Strength
51.75
Uploader
bot-neuronpedia
Created At
2/15/2025 1:06:43 AM
Raw Vector
Actions
Explanations
sentence-ending punctuation marks or transitional phrases
New Auto-Interp
Negative Logits
brute
-0.41
vapour
-0.39
taha
-0.39
nawr
-0.38
recognised
-0.38
isième
-0.37
Brute
-0.36
Jeg
-0.36
ndale
-0.36
iddhar
-0.36
POSITIVE LOGITS
+#+#
0.54
kasarigan
0.53
setVerticalGroup
0.53
noDo
0.51
transQ
0.49
ſelf
0.46
InstrumentedTest
0.46
andererseits
0.45
Infórmanos
0.45
".";
0.45
Activations Density 1.801%