INDEX
Negative Logits
aran
-0.07
Minute
-0.07
akin
-0.07
Pastor
-0.07
othermal
-0.06
_mini
-0.06
다운
-0.06
incidence
-0.06
bombing
-0.06
laden
-0.06
POSITIVE LOGITS
spokes
0.10
separately
0.06
spokesperson
0.06
pesso
0.06
-su
0.06
_oc
0.06
réfé
0.06
($('0.06
otionEvent
0.06
bcc
0.06
Activations Density 0.001%