INDEX
Explanations
beginning
The main thing this neuron does is detect references to specific historical time periods and associated place or event names.
New Auto-Interp
Negative Logits
traffic
-0.07
ans
-0.06
(**
-0.06
gods
-0.06
politics
-0.06
protest
-0.06
project
-0.06
_inode
-0.06
butto
-0.06
Pager
-0.06
POSITIVE LOGITS
sitesinde
0.07
وغير
0.07
ConfigurationException
0.07
acion
0.06
أبريل
0.06
đ
0.06
astonishing
0.06
در
0.06
_bg
0.06
↵
0.06
Activations Density 0.089%