INDEX
Model
gemma-2-9b-it
Layer #
20
Steering Hook
blocks.20.hook_resid_pre
Steering Strength
56.25
Uploader
bot-neuronpedia
Created At
2/15/2025 1:06:43 AM
Raw Vector
Actions
Explanations
punctuations and symbols that are commonly used in written language
New Auto-Interp
Negative Logits
apimachinery
-0.41
).__
-0.40
andExpect
-0.40
nawr
-0.39
Wayback
-0.38
endente
-0.36
-0.36
nahilalakip
-0.35
allAfrica
-0.35
isième
-0.35
POSITIVE LOGITS
CreateTagHelper
0.52
transQ
0.52
kasarigan
0.49
betweenstory
0.47
raiſ
0.46
ViewImports
0.45
árbol
0.44
Taktlose
0.43
fromnode
0.43
፩
0.42
Activations Density 1.557%