INDEX
Model
gemma-2-9b-it
Layer #
20
Steering Hook
blocks.20.hook_resid_pre
Steering Strength
78
Uploader
bot-neuronpedia
Created At
2/15/2025 1:06:43 AM
Raw Vector
Actions
Explanations
code structure and template definitions in programming languages
New Auto-Interp
Negative Logits
commandement
-0.34
réglage
-0.32
measurement
-0.30
lässlich
-0.30
nutrición
-0.29
<>",
-0.29
Explicación
-0.28
condena
-0.28
fashion
-0.28
comando
-0.28
POSITIVE LOGITS
autorytatywna
0.70
IContainer
0.65
évaluateur
0.65
principalTable
0.63
ViewInit
0.62
transQ
0.59
class
0.57
ंदीखरीदारी
0.55
styleable
0.55
Espèce
0.54
Activations Density 0.603%