INDEX
    Explanations

    sentences that start with "we."

    the word "We" to indicate collective actions or statements

    New Auto-Interp
    Negative Logits
     Mehran
    -0.69
     guiActiveUnfocused
    -0.64
    cum
    -0.63
    SPONSORED
    -0.62
     PUBLIC
    -0.59
     totality
    -0.59
     steroids
    -0.59
     Publication
    -0.57
    uates
    -0.57
     LSD
    -0.57
    POSITIVE LOGITS
    've
    1.15
    're
    1.10
    'll
    1.07
    ldon
    1.05
    bley
    0.99
    akening
    0.99
    ighed
    0.99
    eping
    0.98
    selves
    0.94
    alth
    0.93
    Act Density 0.168%

    No Known Activations