INDEX
    Explanations

    general terms referring to collections of people

    phrases emphasizing inclusivity and the well-being of various groups of people

    New Auto-Interp
    Negative Logits
     Collider
    -0.69
     bluff
    -0.69
    USS
    -0.67
    OLOG
    -0.65
     brisk
    -0.64
    inventoryQuantity
    -0.64
    Alias
    -0.63
     Haunted
    -0.61
     Jump
    -0.60
     caution
    -0.59
    POSITIVE LOGITS
     irrespective
    0.84
     alike
    0.82
    soever
    0.80
     harmed
    0.77
    selves
    0.75
    dden
    0.73
     effected
    0.73
    igent
    0.73
    folk
    0.73
    omever
    0.73
    Act Density 0.301%

    No Known Activations