INDEX
Explanations
This neuron activates on list item numbers (the numeric markers at the start of enumerated entries).
New Auto-Interp
Negative Logits
004
-0.07
زیست
-0.07
saison
-0.07
candid
-0.06
Grupo
-0.06
Realty
-0.06
.Cancel
-0.06
να
-0.06
Hizmet
-0.06
_variant
-0.06
POSITIVE LOGITS
etooth
0.07
aaa
0.06
mé
0.06
STANCE
0.06
shops
0.06
ΕΤ
0.06
十一
0.06
τε
0.06
n
0.06
mites
0.06
Activations Density 0.018%