INDEX
    Explanations

    punctuation marks and their patterns in the text

    New Auto-Interp
    Negative Logits
    ÏĦζ
    -0.15
    chie
    -0.13
    ishi
    -0.13
    éľ²
    -0.13
    icha
    -0.13
    ãģĵãĤį
    -0.13
     Ñģм
    -0.13
    ãģ«ãģ¤
    -0.13
    Ïĥμ
    -0.13
    baugh
    -0.13
    POSITIVE LOGITS
    atten
    0.15
    _weak
    0.14
    orsk
    0.14
    ryn
    0.14
    rdf
    0.14
     versa
    0.14
    ynch
    0.14
    олаг
    0.13
    arih
    0.13
    bens
    0.13
    Act Density 0.002%

    No Known Activations