INDEX
Explanations
Locations
The neuron detects proper-place names or geographical locations in the text.
New Auto-Interp
Negative Logits
Baptist
-0.07
.reducer
-0.07
inşa
-0.07
lại
-0.07
azing
-0.07
عليها
-0.06
alah
-0.06
SHARES
-0.06
цій
-0.06
_UNDEFINED
-0.06
POSITIVE LOGITS
죽
0.07
康
0.06
Gre
0.06
iants
0.06
\Json
0.06
険
0.06
heat
0.05
وان
0.05
porno
0.05
puede
0.05
Activations Density 0.018%