INDEX
Explanations
concepts and terms related to health, safety, and well-being
New Auto-Interp
Negative Logits
afil
-0.07
ipel
-0.07
lesh
-0.06
анÑĤи
-0.06
eated
-0.06
achu
-0.06
Bast
-0.06
iren
-0.06
Declared
-0.06
lemen
-0.06
POSITIVE LOGITS
welfare
0.10
needs
0.08
elfare
0.08
both
0.08
wellbeing
0.08
interests
0.08
Welfare
0.07
isOk
0.07
ment
0.07
safety
0.07
Activations Density 0.017%