INDEX
    Explanations

    sentences conveying lack of understanding or the desire to hold people accountable

    sentences or punctuation marks indicating the end of statements, particularly those that are marked with a period

    New Auto-Interp
    Negative Logits
     challeng
    -0.86
     withd
    -0.80
     immersion
    -0.76
     inqu
    -0.75
     manif
    -0.74
     nodd
    -0.74
     dimensional
    -0.74
     dispos
    -0.70
     nuts
    -0.70
     glim
    -0.70
    POSITIVE LOGITS
     Photograph
    1.41
     Photo
    1.30
     Retrieved
    1.29
    jpg
    1.28
     REUTERS
    1.20
     Courtesy
    1.15
     Image
    1.12
     Provided
    1.09
     Accessed
    1.09
     PHOTO
    1.07
    Act Density 0.220%

    No Known Activations