INDEX
Explanations
words in a non-Latin script, possibly Cyrillic
characters or letters from a specific non-Latin script
New Auto-Interp
Negative Logits
ometimes
-0.82
GOODMAN
-0.77
oppable
-0.76
DragonMagazine
-0.74
bucks
-0.71
ierrez
-0.71
emort
-0.70
emonium
-0.69
otaur
-0.69
agonist
-0.68
POSITIVE LOGITS
Ñı
1.19
н
1.18
ÑĤ
1.08
¶
1.08
м
1.07
в
1.07
Ð
1.06
¾
1.06
л
1.06
Ñ
1.04
Activations Density 0.005%