INDEX
Explanations
mentions of mental health related terms
references to mental health issues
New Auto-Interp
Negative Logits
advertisement
-0.82
gger
-0.75
ICA
-0.74
IRD
-0.73
ded
-0.70
aday
-0.70
eled
-0.68
ding
-0.66
oulos
-0.65
VEN
-0.64
POSITIVE LOGITS
faculties
0.96
disorders
0.91
illness
0.86
defic
0.83
disorder
0.82
itary
0.80
wellbeing
0.79
retard
0.79
awar
0.79
disabilities
0.79
Activations Density 0.011%