INDEX
Model
gemma-2-9b-it
Layer #
20
Steering Hook
blocks.20.hook_resid_pre
Steering Strength
69
Uploader
bot-neuronpedia
Created At
2/15/2025 1:06:43 AM
Raw Vector
Actions
Explanations
terms related to classification and organization
New Auto-Interp
Negative Logits
raulic
-0.50
DCHECK
-0.50
redient
-0.46
Audiodateien
-0.46
Smarty
-0.46
himo
-0.45
Songtext
-0.45
Daredevil
-0.43
ècie
-0.43
tanleria
-0.42
POSITIVE LOGITS
ValueStyle
0.55
classification
0.48
categorization
0.47
categorized
0.47
categorize
0.43
categories
0.42
temáticas
0.42
category
0.42
classé
0.41
classifications
0.41
Activations Density 0.000%