INDEX
    Explanations

    possessive pronouns combined with rankings or evaluations

    New Auto-Interp
    Negative Logits
    umd
    -0.07
    antino
    -0.07
    pong
    -0.07
    239
    -0.07
    issance
    -0.07
    ovah
    -0.07
     Claus
    -0.06
    akit
    -0.06
    adero
    -0.06
    /////////////////////////////////////////////////////////////////////////////↵
    -0.06
    POSITIVE LOGITS
     thoughts
    0.10
     attempt
    0.07
     Thoughts
    0.07
     picks
    0.07
     view
    0.07
     contribution
    0.07
    rant
    0.06
     pick
    0.06
     reasons
    0.06
    _complete
    0.06
    Act Density 0.016%

    No Known Activations