INDEX
Explanations
words related to emotions or deep feelings, specifically related to the heart
references to "hearts" in emotional contexts
New Auto-Interp
Negative Logits
Attorney
-0.69
Sweeney
-0.64
aggress
-0.63
Discrimination
-0.62
dq
-0.62
traumatic
-0.61
pmwiki
-0.60
outpatient
-0.60
antidepressants
-0.60
ress
-0.58
POSITIVE LOGITS
hearts
1.17
chool
1.02
mith
0.94
strings
0.85
pring
0.85
pots
0.84
core
0.82
ight
0.81
borough
0.79
endor
0.78
Activations Density 0.006%