INDEX
Negative Logits
æus
-0.98
fuls
-0.95
tubers
-0.91
roaches
-0.87
nicity
-0.85
waits
-0.84
sticks
-0.84
joys
-0.84
roughs
-0.83
vrolet
-0.83
POSITIVE LOGITS
coscienza
0.62
noastre
0.61
themselves
0.59
Suomessa
0.59
that
0.59
neler
0.59
olev
0.58
stesse
0.58
célèbres
0.57
hip
0.57
Activations Density 0.147%