INDEX
Explanations
The neuron activates on past-tense verb tokens (e.g., “created,” “built,” “generated,” “made”).
New Auto-Interp
Negative Logits
Sud
-0.06
trials
-0.06
prominent
-0.06
bdsm
-0.06
commanders
-0.06
ashion
-0.06
-filter
-0.06
parece
-0.06
.mime
-0.06
-ring
-0.06
POSITIVE LOGITS
.Col
0.07
mẽ
0.07
”?
0.07
Ã
0.07
तरफ
0.06
.")
0.06
swirling
0.06
ية
0.06
')==
0.06
.'"
0.06
Activations Density 0.031%