INDEX
Negative Logits
^{*}\0.45
sortes
0.44
ripe
0.43
ítulos
0.43
प्लान
0.41
unsurprisingly
0.41
roč
0.39
गुरूवार
0.39
प्रभावित
0.39
byshire
0.39
POSITIVE LOGITS
worthwhile
0.49
worth
0.49
IMO
0.47
effort
0.44
lohnt
0.44
Worth
0.43
warto
0.43
imo
0.41
deserves
0.41
enswert
0.41
Activations Density 0.033%