INDEX
Model
gemma-2-9b-it
Layer #
20
Steering Hook
blocks.20.hook_resid_pre
Steering Strength
53.5
Uploader
bot-neuronpedia
Created At
2/15/2025 1:06:43 AM
Raw Vector
Actions
Explanations
references to quantities or measurements
New Auto-Interp
Negative Logits
ModelExpression
-0.73
незавершена
-0.72
OGND
-0.68
Drapeau
-0.65
pinulongan
-0.61
transférez
-0.59
:✨
-0.59
➟
-0.59
IndentedString
-0.59
Paglinawan
-0.59
POSITIVE LOGITS
beginning
0.37
instead
0.36
íté
0.33
while
0.32
rather
0.31
éndolo
0.31
while
0.31
instead
0.29
inSlope
0.29
failure
0.28
Activations Density 0.000%