INDEX
Explanations
phrases and words related to health and safety topics
New Auto-Interp
Negative Logits
Health
-0.32
Health
-0.32
health
-0.32
health
-0.31
HEALTH
-0.31
-health
-0.28
_health
-0.27
.health
-0.26
_HEALTH
-0.25
.Health
-0.23
POSITIVE LOGITS
well
0.31
Well
0.28
well
0.27
welfare
0.27
fitness
0.27
Well
0.27
wellbeing
0.26
wel
0.25
safety
0.24
WELL
0.24
Activations Density 0.035%