INDEX
Explanations
phrases related to risk factors in health and social scenarios
New Auto-Interp
Negative Logits
bedo
-0.16
алом
-0.16
roe
-0.15
Regents
-0.15
vrier
-0.15
éĥİ
-0.15
rove
-0.14
Hindered
-0.14
麦
-0.14
Rated
-0.14
POSITIVE LOGITS
risk
0.91
risks
0.77
Risk
0.72
risk
0.71
-risk
0.66
Risk
0.64
Ris
0.61
é£İéĻ©
0.59
risking
0.52
ris
0.51
Activations Density 0.235%