INDEX
Model
gemma-2-9b-it
Layer #
20
Steering Hook
blocks.20.hook_resid_pre
Steering Strength
61.5
Uploader
bot-neuronpedia
Created At
2/15/2025 1:06:43 AM
Raw Vector
Actions
Explanations
concepts related to choices and alternatives
New Auto-Interp
Negative Logits
:✨
-0.82
GEBURTSDATUM
-0.69
MessageOf
-0.60
WriteTagHelper
-0.54
IRUS
-0.54
zeera
-0.54
httphttps
-0.54
LabelTagHelper
-0.54
censiti
-0.54
TextAppearance
-0.53
POSITIVE LOGITS
either
0.70
Either
0.62
either
0.58
Either
0.56
choices
0.52
entweder
0.51
scelte
0.48
choice
0.47
might
0.44
tantôt
0.44
Activations Density 0.000%