INDEX
Explanations
positive emotional expressions related to physical well-being
expressions of positive emotions and well-being
New Auto-Interp
Negative Logits
stakes
-0.79
stakes
-0.75
enge
-0.74
xus
-0.70
atal
-0.68
pmwiki
-0.67
biases
-0.67
fallacy
-0.66
unintended
-0.66
andestine
-0.66
POSITIVE LOGITS
cheerful
1.28
calm
1.27
healthy
1.24
upbeat
1.24
happy
1.21
refreshed
1.21
relaxed
1.19
stable
1.15
fine
1.15
comfortable
1.12
Activations Density 0.585%