INDEX
Explanations
the word "feelings."
descriptions containing emotions or sentiments
references to emotions and feelings
New Auto-Interp
Negative Logits
annis
-0.72
ded
-0.68
err
-0.63
photos
-0.62
agher
-0.60
Dou
-0.60
design
-0.59
lder
-0.59
wald
-0.59
Twin
-0.58
POSITIVE LOGITS
feelings
1.10
terness
0.98
terday
0.87
sensations
0.86
aversion
0.85
affection
0.84
sentiments
0.84
otions
0.80
uated
0.79
emotions
0.78
Activations Density 0.009%