INDEX
    Explanations

    past tense verbs

    statements related to accountability and consequences

    New Auto-Interp
    Negative Logits
    ELS
    -0.57
    thodox
    -0.55
    Il
    -0.52
    los
    -0.52
    Bridge
    -0.49
    airs
    -0.49
    Eastern
    -0.48
    updated
    -0.48
    Balt
    -0.48
    Ń
    -0.48
    POSITIVE LOGITS
     yourself
    1.30
     yourselves
    1.19
     Yourself
    0.89
     your
    0.76
     YOUR
    0.70
    poke
    0.67
    your
    0.63
    Your
    0.61
     panties
    0.60
     smack
    0.59
    Act Density 0.872%

    No Known Activations