INDEX
    Explanations

    phrases related to taking action or making decisions

    phrases indicating actions taken against significant entities or situations

    New Auto-Interp
    Negative Logits
    ouf
    -0.83
    nces
    -0.81
    ascript
    -0.79
    arella
    -0.76
    aught
    -0.74
    quartered
    -0.74
    chell
    -0.74
     teasp
    -0.73
    locks
    -0.73
    ivas
    -0.73
    POSITIVE LOGITS
     unsuspecting
    0.94
     whoever
    0.81
     whichever
    0.79
     anybody
    0.75
     everybody
    0.70
     anyone
    0.68
     whatever
    0.68
     ourselves
    0.67
     behalf
    0.66
     hordes
    0.65
    Act Density 0.506%

    No Known Activations