INDEX
Model
gemma-2-9b-it
Layer #
20
Steering Hook
blocks.20.hook_resid_pre
Steering Strength
57.75
Uploader
bot-neuronpedia
Created At
2/15/2025 1:06:43 AM
Raw Vector
Actions
Explanations
terms related to notation and variable definitions in mathematical contexts
New Auto-Interp
Negative Logits
ब्रेकडाउन
-0.47
-0.35
</blockquote>
-0.34
-0.34
immoral
-0.34
مغ
-0.34
UpInside
-0.34
'
-0.34
CDs
-0.33
unków
-0.33
POSITIVE LOGITS
InstrumentedTest
0.68
principalTable
0.66
LookAnd
0.63
PerformLayout
0.57
esModule
0.57
'\\;'
0.55
kasarigan
0.52
تقاوى
0.50
Италијани
0.49
fromnode
0.48
Activations Density 0.004%