INDEX
Model
gemma-2-9b-it
Layer #
20
Steering Hook
blocks.20.hook_resid_pre
Steering Strength
46
Uploader
bot-neuronpedia
Created At
2/15/2025 1:06:43 AM
Raw Vector
Actions
Explanations
function definitions and their signatures in code
New Auto-Interp
Negative Logits
hendes
-0.45
Italijanski
-0.38
někdo
-0.37
Stimmung
-0.36
setelan
-0.36
cintura
-0.35
bubuk
-0.35
nawr
-0.35
peringatan
-0.34
bentar
-0.34
POSITIVE LOGITS
propOrder
0.86
исленность
0.66
rrggbb
0.66
setVerticalGroup
0.59
InstrumentedTest
0.59
PostInfinity
0.56
Clau
0.55
MessageTagHelper
0.54
LayoutStyle
0.53
phazard
0.53
Activations Density 0.271%