INDEX
Model
gemma-2-9b-it
Layer #
20
Steering Hook
blocks.20.hook_resid_pre
Steering Strength
45.75
Uploader
bot-neuronpedia
Created At
2/15/2025 1:06:43 AM
Raw Vector
Actions
Explanations
technical terminology and uncertainty expressions in policy or regulatory contexts
New Auto-Interp
Negative Logits
Schild
-0.36
koop
-0.32
Weblinks
-0.31
asum
-0.30
platte
-0.30
janja
-0.30
esternos
-0.30
fundido
-0.30
pukul
-0.29
Wunsch
-0.29
POSITIVE LOGITS
LookAnd
0.69
fromnode
0.69
帖最后由
0.62
queſta
0.54
:+:
0.51
Administrativna
0.51
setVerticalGroup
0.50
PerformLayout
0.49
InstrumentedTest
0.49
ioneer
0.47
Activations Density 4.208%