INDEX
Model
gemma-2-9b-it
Layer #
20
Steering Hook
blocks.20.hook_resid_pre
Steering Strength
44.25
Uploader
bot-neuronpedia
Created At
2/15/2025 1:06:43 AM
Raw Vector
Actions
Explanations
terms and definitions related to mathematical or statistical notation
New Auto-Interp
Negative Logits
adulte
-0.35
-0.32
disques
-0.30
-0.30
disque
-0.29
isième
-0.29
$_['
-0.28
adult
-0.28
ब्रेकडाउन
-0.28
Hipp
-0.28
POSITIVE LOGITS
LookAnd
0.90
'\\;'
0.68
CreateTagHelper
0.66
kasarigan
0.65
esModule
0.64
dAtA
0.64
InstrumentedTest
0.61
setVerticalGroup
0.59
PerformLayout
0.57
PreExecute
0.57
Activations Density 0.072%