INDEX
Explanations
This neuron detects mentions of proper‐noun organization names (e.g. companies, agencies, and institutional titles).
New Auto-Interp
Negative Logits
Sanford
-0.06
uzzi
-0.06
Rings
-0.06
ратно
-0.06
021
-0.06
960
-0.06
Duis
-0.06
Christopher
-0.06
すれば
-0.06
PREFIX
-0.06
POSITIVE LOGITS
rb
0.08
_G
0.07
ัจ
0.07
CONF
0.07
_P
0.06
-п
0.06
DUCT
0.06
перей
0.06
avait
0.06
영향을
0.06
Activations Density 0.122%