INDEX
Explanations
words related to negative emotions, such as fear, anxiety, and frustration
expressions of emotional distress and psychological states
New Auto-Interp
Negative Logits
arsen
-0.81
soDeliveryDate
-0.75
natureconservancy
-0.73
çīĪ
-0.69
vernment
-0.67
Unch
-0.64
backdoor
-0.63
folio
-0.62
è£
-0.62
itures
-0.62
POSITIVE LOGITS
sadness
0.94
inducing
0.93
loneliness
0.83
anguish
0.80
sorrow
0.79
feelings
0.79
disbelief
0.77
palpable
0.77
nausea
0.77
melancholy
0.75
Activations Density 0.513%