INDEX
Negative Logits
mêmes
-0.50
autorytatywna
-0.47
contenus
-0.45
conseguenza
-0.43
новништво
-0.42
Ante
-0.42
aikaa
-0.41
aussieht
-0.41
вещей
-0.41
Ad
-0.40
POSITIVE LOGITS
are
0.76
>{@0.74
ويكيميديا
0.73
externi
0.69
they
0.69
DrawerToggle
0.68
were
0.65
allAfrica
0.63
UnusedPrivate
0.60
HttpHeaders
0.59
Activations Density 0.001%