INDEX
    Explanations

    punctuation marks at the end of sentences

    New Auto-Interp
    Negative Logits
    živ
    -0.14
    uide
    -0.14
    #Region
    -0.14
    (strtolower
    -0.14
     Bros
    -0.14
     иÑħ
    -0.13
    amer
    -0.13
     leur
    -0.13
     dit
    -0.13
    _stderr
    -0.13
    POSITIVE LOGITS
     Im
    0.24
     Dan
    0.22
     Er
    0.21
    Dan
    0.20
     Parallel
    0.20
     Dane
    0.20
     Nach
    0.19
    dan
    0.19
     Dar
    0.19
     Au
    0.19
    Act Density 0.012%

    No Known Activations