INDEX
Model
gemma-2-9b-it
Layer #
20
Steering Hook
blocks.20.hook_resid_pre
Steering Strength
78.5
Uploader
bot-neuronpedia
Created At
2/15/2025 1:06:43 AM
Raw Vector
Actions
Explanations
terms related to classification and categorization
New Auto-Interp
Negative Logits
supersonic
-0.52
screenshot
-0.50
neutrons
-0.46
supers
-0.45
Songtext
-0.45
Grip
-0.44
nervous
-0.43
sistors
-0.43
:✨
-0.43
impre
-0.42
POSITIVE LOGITS
categories
0.65
category
0.61
categorize
0.59
ValueStyle
0.59
categorized
0.56
categorization
0.55
kategor
0.51
classification
0.51
categor
0.50
classifications
0.48
Activations Density 0.001%