INDEX
Model
gemma-2-9b-it
Layer #
20
Steering Hook
blocks.20.hook_resid_pre
Steering Strength
0
Uploader
bot-neuronpedia
Created At
2/15/2025 1:06:43 AM
Raw Vector
Actions
Explanations
No Explanations Found
New Auto-Interp
Negative Logits
Wikimedijinoj
-0.65
şört
-0.50
himo
-0.49
WriteTagHelper
-0.49
醐
-0.49
THEORY
-0.49
èdia
-0.49
metast
-0.48
biodegradable
-0.48
tyimages
-0.48
POSITIVE LOGITS
carefully
0.59
careful
0.52
ValueStyle
0.47
memperhatikan
0.44
soigneusement
0.43
caution
0.42
attenzione
0.42
dikkat
0.40
duquel
0.40
WaitGroup
0.39
Activations Density 0.000%