INDEX
Model
gemma-2-9b-it
Layer #
20
Steering Hook
blocks.20.hook_resid_pre
Steering Strength
62.75
Uploader
bot-neuronpedia
Created At
2/15/2025 1:06:43 AM
Raw Vector
Actions
Explanations
words related to recognition and achievement
New Auto-Interp
Negative Logits
Personendaten
-0.59
Audiodateien
-0.56
tovers
-0.55
المشاركات
-0.51
deus
-0.49
$_['
-0.49
WriteTagHelper
-0.47
cheaper
-0.46
baratos
-0.46
grine
-0.46
POSITIVE LOGITS
recognition
0.53
reconhecimento
0.53
Anerkennung
0.52
acknowledgment
0.48
reconocimiento
0.47
riconoscimento
0.47
praise
0.47
reconocer
0.44
achievement
0.43
recognition
0.43
Activations Density 0.000%