INDEX
    Explanations

    phrases and words related to health and safety topics

    New Auto-Interp
    Negative Logits
     Health
    -0.32
    Health
    -0.32
     health
    -0.32
    health
    -0.31
     HEALTH
    -0.31
    -health
    -0.28
    _health
    -0.27
    .health
    -0.26
    _HEALTH
    -0.25
    .Health
    -0.23
    POSITIVE LOGITS
     well
    0.31
     Well
    0.28
    well
    0.27
     welfare
    0.27
     fitness
    0.27
    Well
    0.27
     wellbeing
    0.26
     wel
    0.25
     safety
    0.24
     WELL
    0.24
    Act Density 0.035%

    No Known Activations