INDEX
    Explanations

    special characters and punctuation marks used in various contexts

    New Auto-Interp
    Negative Logits
    ris
    -0.17
    foundland
    -0.16
    rie
    -0.16
    ses
    -0.15
    indsight
    -0.15
    iê
    -0.15
    ish
    -0.14
    اÙħا
    -0.14
    mitt
    -0.14
     Ved
    -0.14
    POSITIVE LOGITS
    页éĿ¢åŃĺæ¡£å¤ĩ份
    0.17
    uard
    0.16
    htub
    0.16
    EAR
    0.15
    ottes
    0.14
     OnTrigger
    0.14
    IFF
    0.14
    oux
    0.14
    eniable
    0.14
    ialog
    0.13
    Act Density 0.052%

    No Known Activations