INDEX
    Explanations

    punctuation marks, especially quotation marks and periods

    New Auto-Interp
    Negative Logits
    en
    -0.70
    vastava
    -0.68
    ghijkl
    -0.64
    яза
    -0.61
    LOR
    -0.61
     getApplication
    -0.60
     Sait
    -0.59
     snow
    -0.56
     Faul
    -0.56
    inq
    -0.56
    POSITIVE LOGITS
    ?”
    1.05
    .”
    1.01
    .’”
    0.99
    findpost
    0.97
     مشين
    0.97
     nakalista
    0.96
     <=",
    0.94
    ?"
    0.93
    ).”
    0.89
     dessinée
    0.88
    Act Density 0.129%

    No Known Activations