INDEX
Explanations
mentions of mental health and well-being
terms related to mental health and psychological status
New Auto-Interp
Negative Logits
Intent
-0.72
eer
-0.66
publication
-0.65
Commissioners
-0.64
ries
-0.63
Prospect
-0.63
Aval
-0.62
Cullen
-0.62
destinations
-0.61
Juliet
-0.61
POSITIVE LOGITS
ependent
0.86
retarded
0.85
onding
0.81
defic
0.80
pecially
0.78
osure
0.77
@#&
0.77
challenged
0.75
otropic
0.75
rypt
0.74
Activations Density 0.010%