INDEX
Model
gemma-2-9b-it
Layer #
20
Steering Hook
blocks.20.hook_resid_pre
Steering Strength
0
Uploader
bot-neuronpedia
Created At
2/15/2025 1:06:43 AM
Raw Vector
Actions
Explanations
No Explanations Found
New Auto-Interp
Negative Logits
Personendaten
-0.69
:✨
-0.50
simplest
-0.41
ctrons
-0.41
grond
-0.40
barata
-0.40
cheaper
-0.40
generali
-0.40
mila
-0.39
ویکیپدیای
-0.39
POSITIVE LOGITS
irrit
0.46
ValueStyle
0.45
IContainer
0.43
felt
0.42
Feeling
0.42
irritation
0.42
feeling
0.41
Feeling
0.41
felt
0.41
headache
0.40
Activations Density 0.000%