INDEX
Explanations
phrases related to physical well-being and personal feelings
expressions related to personal well-being and health status
New Auto-Interp
Negative Logits
Differences
-0.73
intrusion
-0.70
Opposition
-0.67
allic
-0.67
Critics
-0.67
Whale
-0.65
opposes
-0.65
Influence
-0.64
cific
-0.64
Adds
-0.64
POSITIVE LOGITS
thankful
1.35
enjoying
1.30
grateful
1.29
happier
1.26
happiest
1.20
exhausted
1.20
happy
1.19
glad
1.17
ready
1.17
recovering
1.16
Activations Density 0.350%