INDEX
Explanations
discussions about mental health and well-being
New Auto-Interp
Negative Logits
client
-0.16
Yelp
-0.15
clients
-0.15
avior
-0.15
client
-0.15
ettel
-0.14
ComputedStyle
-0.14
clients
-0.13
Vict
-0.13
_intr
-0.13
POSITIVE LOGITS
mental
0.43
Mental
0.41
mental
0.36
mentally
0.28
suicide
0.26
MENT
0.25
MH
0.24
Samar
0.23
Mind
0.22
Ment
0.22
Activations Density 0.043%