INDEX
Model
gemma-2-9b-it
Layer #
20
Steering Hook
blocks.20.hook_resid_pre
Steering Strength
53.75
Uploader
bot-neuronpedia
Created At
2/15/2025 1:06:43 AM
Raw Vector
Actions
Explanations
words related to programming terminology involving strings, numbers, and data structures
New Auto-Interp
Negative Logits
ویکیپدی
-0.66
)':
-0.60
]]:
-0.59
})$}
-0.59
}}$}
-0.58
"]))
-0.57
"]]
-0.56
)}-
-0.55
")"
-0.55
"}},
-0.55
POSITIVE LOGITS
betweenstory
0.49
Infór
0.46
gedrag
0.45
MessageTagHelper
0.44
tagHelper
0.44
desnuda
0.43
Története
0.43
skolan
0.42
őzés
0.41
transQ
0.41
Activations Density 0.004%