INDEX
Negative Logits
Jaw
-0.08
வே
-0.08
aptly
-0.07
UW
-0.07
Expose
-0.07
-0.07
've
-0.07
멘
-0.07
azvo
-0.07
наб
-0.07
POSITIVE LOGITS
desagrad
0.11
pleasant
0.11
unpleasant
0.10
pleasant
0.10
agradable
0.10
stroll
0.09
strolling
0.09
Gespräch
0.09
breeze
0.09
agradável
0.09
Activations Density 0.016%