INDEX
Explanations
words related to negative emotions or opinions towards someone or something
expressions of negative emotions, particularly those related to disdain, distrust, contempt, and hatred
New Auto-Interp
Negative Logits
inventoryQuantity
-0.71
Explan
-0.70
Horizon
-0.69
iday
-0.64
Blueprint
-0.64
helicop
-0.63
Grac
-0.62
ramid
-0.62
gow
-0.61
Appendix
-0.60
POSITIVE LOGITS
uous
0.96
toward
0.91
lust
0.90
rence
0.89
towards
0.89
uously
0.87
wart
0.80
vengeance
0.79
hatred
0.77
algia
0.76
Activations Density 0.101%