INDEX
Explanations
mentions of mental health-related terms and phrases
references to mental health topics
New Auto-Interp
Negative Logits
advertisement
-0.79
ICA
-0.73
ded
-0.71
IRD
-0.71
gger
-0.70
ered
-0.68
aday
-0.67
eled
-0.67
ELS
-0.65
Clive
-0.64
POSITIVE LOGITS
faculties
0.96
disorders
0.92
illness
0.89
defic
0.87
disorder
0.85
wellbeing
0.82
izing
0.82
ising
0.82
itary
0.79
retard
0.79
Activations Density 0.014%