INDEX
    Explanations

    sentences indicating negative events or situations

    sentence endings that convey impactful or conclusive statements

    New Auto-Interp
    Negative Logits
    ',"
    -0.61
    lled
    -0.59
    Thumbnail
    -0.58
    inguishable
    -0.58
    '."
    -0.54
    ucer
    -0.53
    untarily
    -0.53
     Cup
    -0.52
    Instance
    -0.51
    Mobil
    -0.49
    POSITIVE LOGITS
    ↵Âł
    1.18
     Âł
    1.13
     Âł Âł
    1.09
     ³³
    1.09
    ³³
    1.05
     americ
    0.94
     Secondly
    0.92
    Âł
    0.88
    tumblr
    0.84
    ↵↵
    0.83
    Act Density 0.493%

    No Known Activations