INDEX
    Explanations

    punctuation marks and their frequency in the text

    previous letter followed by capitalized word

    New Auto-Interp
    Negative Logits
     Diſ
    -0.62
    gameserver
    -0.60
    ]")]
    -0.57
     faw
    -0.57
    -0.55
    engelsk
    -0.55
     poffe
    -0.54
     purpoſe
    -0.54
     seta
    -0.54
     incremental
    -0.53
    POSITIVE LOGITS
    twimg
    0.69
     autorytatywna
    0.61
    cherichia
    0.57
     GenerationType
    0.56
     пунктов
    0.56
    0.55
    Życiorys
    0.54
    Đ
    0.52
    ThroughAttribute
    0.52
    0.52
    Act Density 0.221%

    No Known Activations