INDEX
Explanations
instances of place names or geographic locations
New Auto-Interp
Negative Logits
llx
-0.16
oppins
-0.15
ãģ¤ãģ¶
-0.15
ENC
-0.14
enc
-0.14
.Condition
-0.14
جز
-0.14
μβ
-0.13
iliz
-0.13
γο
-0.13
POSITIVE LOGITS
Grim
0.16
cion
0.15
Frid
0.15
grave
0.15
earer
0.14
335
0.14
combe
0.14
رÙĪÙħ
0.14
unlike
0.13
ameda
0.13
Activations Density 0.009%