INDEX
    Explanations

    terminal punctuation, specifically periods

    New Auto-Interp
    Negative Logits
    wie
    -0.16
    ecz
    -0.16
    loth
    -0.15
    adera
    -0.15
     Lud
    -0.15
    /forum
    -0.14
     ç©
    -0.14
     æ¢
    -0.14
    reso
    -0.14
    way
    -0.14
    POSITIVE LOGITS
    ilent
    0.17
    itage
    0.15
    bz
    0.15
     exert
    0.15
    uest
    0.15
    æ»
    0.14
    098
    0.14
    oux
    0.14
    aku
    0.14
    ittest
    0.14
    Act Density 0.000%

    No Known Activations