INDEX
Model
gemma-2-9b-it
Layer #
20
Steering Hook
blocks.20.hook_resid_pre
Steering Strength
49.25
Uploader
bot-neuronpedia
Created At
2/15/2025 1:06:43 AM
Raw Vector
Actions
Explanations
symbols and formatting elements commonly used in text
New Auto-Interp
Negative Logits
referenties
-0.41
Hauptartikel
-0.40
Litteratur
-0.37
nawr
-0.37
Smarty
-0.36
Literatur
-0.36
loaf
-0.34
erobic
-0.34
request
-0.33
出版年
-0.32
POSITIVE LOGITS
kasarigan
0.53
LookAnd
0.53
betweenstory
0.53
fromnode
0.51
queſta
0.49
LEGGI
0.48
انتهای
0.47
BoxFit
0.47
laſſen
0.47
árbol
0.47
Activations Density 0.698%