INDEX
Model
gemma-2-9b-it
Layer #
20
Steering Hook
blocks.20.hook_resid_pre
Steering Strength
40.75
Uploader
bot-neuronpedia
Created At
2/15/2025 1:06:43 AM
Raw Vector
Actions
Explanations
terms associated with research methodology and analysis
New Auto-Interp
Negative Logits
-0.41
supers
-0.39
immunos
-0.39
bluetooth
-0.38
adulte
-0.37
cheaper
-0.37
OfWeek
-0.37
Hipp
-0.36
ModelExpression
-0.36
unj
-0.36
POSITIVE LOGITS
LookAnd
0.77
Савезне
0.57
InstrumentedTest
0.57
setVerticalGroup
0.54
Normdatei
0.53
PerformLayout
0.51
MessageBoxIcon
0.51
astify
0.49
propOrder
0.49
PreExecute
0.49
Activations Density 0.030%