INDEX
Negative Logits
slightly
0.64
nifty
0.63
légèrement
0.61
trochę
0.61
yummy
0.58
somewhat
0.57
nieco
0.54
sometimes
0.54
చక్క
0.54
delightful
0.53
POSITIVE LOGITS
警告
0.89
abhor
0.88
horrified
0.86
horrifying
0.84
warning
0.84
appalled
0.84
WARNING
0.83
WARNING
0.83
चेतावनी
0.82
horrific
0.81
Activations Density 0.074%