INDEX
Explanations
Citations
This neuron strongly activates on personal names in bibliographic citations (i.e. author names in reference lists).
New Auto-Interp
Negative Logits
Вт
-0.07
IPP
-0.06
shops
-0.06
exig
-0.06
systemic
-0.06
toBe
-0.06
/datatables
-0.06
Maya
-0.06
(Art
-0.06
수가
-0.06
POSITIVE LOGITS
Ά
0.07
_ALLOWED
0.07
donn
0.06
.exe
0.06
devlet
0.06
idot
0.06
legalArgumentException
0.06
COL
0.06
لیم
0.06
apest
0.06
Activations Density 0.035%