INDEX
Explanations
emotional words associated with feelings
expressions of emotions and feelings
New Auto-Interp
Negative Logits
annis
-0.85
culosis
-0.69
Accounting
-0.67
zan
-0.66
wald
-0.63
err
-0.62
iary
-0.62
Amend
-0.61
lder
-0.60
amn
-0.60
POSITIVE LOGITS
feelings
1.04
terness
0.94
sensations
0.86
æĦ
0.82
aversion
0.80
pring
0.77
affection
0.76
sentiments
0.76
ual
0.75
emotions
0.75
Activations Density 0.020%