INDEX
Explanations
references to the concept of care and responsibility in relation to mental health
New Auto-Interp
Negative Logits
OTHERWISE
-0.15
ombs
-0.15
verture
-0.15
ama
-0.15
ieren
-0.15
woes
-0.14
.ie
-0.14
ivery
-0.14
ãĥ³ãĤ¯
-0.14
ÑĢезÑĥлÑĮÑĤ
-0.13
POSITIVE LOGITS
reason
0.27
possibility
0.27
limit
0.26
chance
0.26
saying
0.24
need
0.23
difference
0.23
danger
0.21
temptation
0.21
eed
0.20
Activations Density 0.087%