INDEX
Explanations
Locations
The neuron activates on place names and geographic region mentions.
New Auto-Interp
Negative Logits
ARC
-0.07
مشاركة
-0.07
inality
-0.06
National
-0.06
声音
-0.06
ائد
-0.06
.FragmentManager
-0.06
physician
-0.06
discrimination
-0.06
International
-0.06
POSITIVE LOGITS
leniyor
0.06
atrib
0.06
不会
0.06
fix
0.06
adm
0.06
infiltration
0.05
ák
0.05
σκεται
0.05
ाएग
0.05
loa
0.05
Activations Density 0.014%