INDEX
Model
gemma-2-9b-it
Layer #
20
Steering Hook
blocks.20.hook_resid_pre
Steering Strength
72
Uploader
bot-neuronpedia
Created At
2/15/2025 1:06:43 AM
Raw Vector
Actions
Explanations
terms related to proximity and closeness in various contexts
New Auto-Interp
Negative Logits
WriteTagHelper
-0.59
tanleria
-0.56
estacks
-0.50
OFDb
-0.49
Personendaten
-0.49
httphttps
-0.48
Hauptartikel
-0.47
➟
-0.47
:✨
-0.46
$_['
-0.46
POSITIVE LOGITS
proximity
0.60
close
0.59
Close
0.55
closeness
0.55
0.54
CLOSE
0.52
CLOSE
0.52
élo
0.52
close
0.50
Close
0.50
Activations Density 0.026%