INDEX
Explanations
phrases related to well-being and mental health
the term "being" and related concepts of existence and well-being
New Auto-Interp
Negative Logits
Brave
-0.64
My
-0.64
Cour
-0.63
Select
-0.62
lur
-0.61
Central
-0.61
straw
-0.61
Rec
-0.60
Court
-0.59
Guide
-0.59
POSITIVE LOGITS
being
3.53
having
1.49
doing
1.23
still
1.11
yip
1.10
taking
1.07
tons
1.06
together
1.05
moving
1.01
violence
1.01
Activations Density 0.018%