INDEX
    Explanations

    terms related to alleviating issues, particularly poverty and health problems, via social justice initiatives

    New Auto-Interp
    Negative Logits
     freely
    -0.70
     cringe
    -0.66
     Origin
    -0.65
     boldly
    -0.65
     Odin
    -0.65
     ultras
    -0.65
     oats
    -0.64
     passionately
    -0.62
     unprotected
    -0.62
     blindly
    -0.61
    POSITIVE LOGITS
    uating
    1.27
    icating
    1.16
    uate
    1.12
    uated
    1.11
    uates
    1.10
    ving
    1.08
    iating
    1.06
    pling
    1.06
    inished
    1.05
    ating
    1.05
    Act Density 0.079%

    No Known Activations