INDEX
    Explanations

    punctuation marks and their associated significance in the text

    New Auto-Interp
    Negative Logits
    794
    -0.15
    /fw
    -0.15
    itbart
    -0.15
     forks
    -0.15
    .scalablytyped
    -0.15
    ζα
    -0.15
     Jad
    -0.14
    .bunifuFlatButton
    -0.14
    viron
    -0.14
    geois
    -0.14
    POSITIVE LOGITS
    çª
    0.16
    kili
    0.15
     Hermes
    0.15
    AVIS
    0.15
    éĽĨ
    0.15
    asca
    0.14
    OTES
    0.14
    ymes
    0.14
    olle
    0.14
    ãĤ·ãĥ¼
    0.14
    Act Density 0.210%

    No Known Activations