INDEX
Explanations
ancient place names or languages
New Auto-Interp
Negative Logits
圩
-0.71
agawa
-0.71
infine
-0.70
็ง
-0.69
[]}
-0.69
DBObject
-0.69
λλ
-0.68
شلوار
-0.68
Rivers
-0.68
samurai
-0.67
POSITIVE LOGITS
insurg
0.75
EDP
0.72
เบ
0.71
Mero
0.70
arsis
0.70
ciato
0.69
竞
0.68
Saudi
0.67
Zinc
0.66
Judaism
0.66
Activations Density 0.042%