INDEX
Negative Logits
theless
-0.77
æĺ¯
-0.73
Interstitial
-0.70
sticks
-0.67
hide
-0.64
Õ
-0.64
kat
-0.64
schild
-0.64
naire
-0.62
handedly
-0.62
POSITIVE LOGITS
herty
1.51
ignt
1.05
apon
0.97
issance
0.95
uthor
0.89
keye
0.89
eland
0.88
essions
0.88
urry
0.86
ipers
0.85
Activations Density 0.002%