INDEX
Model
gemma-2-9b-it
Layer #
20
Steering Hook
blocks.20.hook_resid_pre
Steering Strength
57.25
Uploader
bot-neuronpedia
Created At
2/15/2025 1:06:43 AM
Raw Vector
Actions
Explanations
terms related to health metrics and outcomes
New Auto-Interp
Negative Logits
ModelExpression
-0.69
Personendaten
-0.69
WriteTagHelper
-0.64
Бахар
-0.61
:✨
-0.61
lenker
-0.59
وتسجيلات
-0.59
DockStyle
-0.57
***!
-0.57
AssemblyCulture
-0.54
POSITIVE LOGITS
health
0.35
durs
0.34
ſur
0.32
duros
0.32
desvi
0.32
lucha
0.31
þat
0.30
ſtate
0.30
individuals
0.29
purpoſe
0.29
Activations Density 0.021%