INDEX
    Explanations

    phrases relating to the impact of human actions on health and societal issues

    New Auto-Interp
    Negative Logits
    .
    -0.63
    Naissance
    -0.53
     ويكيميديا
    -0.50
    izzata
    -0.48
    lgari
    -0.42
    していきます
    -0.42
     s
    -0.42
    都不是
    -0.42
    !
    -0.41
    kosi
    -0.41
    POSITIVE LOGITS
     itſelf
    0.87
    ConstraintMaker
    0.86
     بلکه
    0.82
     myſelf
    0.79
     iſt
    0.71
     himſelf
    0.71
     sondern
    0.70
     arşivlendi
    0.69
     estekak
    0.69
     Monfieur
    0.67
    Act Density 0.281%

    No Known Activations