INDEX
    Explanations

    capital letters or special characters in the middle of words

    instances of placeholders or incomplete thoughts in the text

    New Auto-Interp
    Negative Logits
     eleph
    -1.01
    ò
    -0.94
     pione
    -0.91
    aditional
    -0.90
    Þ
    -0.90
     exting
    -0.86
     practition
    -0.85
    ThumbnailImage
    -0.84
    Ý
    -0.83
    senal
    -0.82
    POSITIVE LOGITS
    Anyway
    0.71
    ³³³
    0.68
    0.67
    NULL
    0.67
    BUT
    0.65
    Honestly
    0.63
    ³³³³³³³³³³³³³³³³
    0.62
    OH
    0.60
     Imran
    0.60
    ³³³³
    0.59
    Act Density 0.546%

    No Known Activations