INDEX
Model
gemma-2-9b-it
Layer #
20
Steering Hook
blocks.20.hook_resid_pre
Steering Strength
71.5
Uploader
bot-neuronpedia
Created At
2/15/2025 1:06:43 AM
Raw Vector
Actions
Explanations
elements related to cleanliness and hygiene
New Auto-Interp
Negative Logits
WriteTagHelper
-0.68
Personendaten
-0.66
GEBURTSDATUM
-0.59
bootstrapcdn
-0.55
Paglinawan
-0.55
zheimer
-0.54
GEBURTS
-0.54
urlpatterns
-0.54
المناصب
-0.54
styleType
-0.52
POSITIVE LOGITS
Clean
0.48
cleanliness
0.47
clean
0.45
clean
0.45
Clean
0.43
hygiene
0.38
CLEAN
0.38
attenzione
0.37
CLEAN
0.36
limpio
0.36
Activations Density 0.001%