INDEX
Model
gemma-2-9b-it
Layer #
20
Steering Hook
blocks.20.hook_resid_pre
Steering Strength
44
Uploader
bot-neuronpedia
Created At
2/15/2025 1:06:43 AM
Raw Vector
Actions
Explanations
function definitions and their parameters in code
New Auto-Interp
Negative Logits
Stimmung
-0.36
dapur
-0.34
}{*}{-0.32
Personendaten
-0.31
peringatan
-0.30
Unterkunft
-0.30
někdo
-0.30
zumal
-0.30
))^
-0.29
hendes
-0.29
POSITIVE LOGITS
propOrder
0.84
0.73
InstrumentedTest
0.72
rrggbb
0.61
setVerticalGroup
0.59
IBOutlet
0.58
<unused15>
0.57
uxxxx
0.57
<unused1>
0.57
<unused26>
0.57
Activations Density 0.182%